220 likes | 437 Views
ICS- FORTH. Data Synchronization Issues in GALS SoCs. Rostislav (Reuven) Dobkin and Ran Ginosar Technion Christos P. Sotiriou FORTH. Outline. The Problem Synchronization Failures in GALS SoCs Three solutions: Timing verification Synchronizers Locally-delayed clocks Analysis.
E N D
ICS- FORTH Data Synchronization Issues in GALS SoCs Rostislav (Reuven) Dobkin and Ran GinosarTechnion Christos P. SotiriouFORTH
Outline • The Problem • Synchronization Failures in GALS SoCs • Three solutions: • Timing verification • Synchronizers • Locally-delayed clocks • Analysis
A GALS Module contains: Synchronous Island Local clock generator Self-timed wrapper (can stop the local clock) Handshake for inter-modular communications, GALS with Stoppable Clocks Moore et al., “Point to point GALS interconnect,” ASYNC 2002 Villiger et al., “Self- timed Ring for Globally- Asynchronous Locally- Synchronous Systems,” ASYNC 2003
Data Synchronization Moore et al., “Point to point GALS interconnect,” ASYNC 2002 Villiger et al., “Self- timed Ring for Globally- Asynchronous Locally- Synchronous Systems,” ASYNC 2003
Synchronization Failure Due to clock tree delay, the previous clock rise may conflict with the handshake
d Synchronization Failure: RACE ! Conflict Condition: DCLK = d + x
Solution 1:Timing Verification • Extract delays • Verify that DCLK falls inside the SAFE zones SAFE SAFE … SAFE
Solution 1: Disadvantages • Clock tree delays must be re-verified after each layout iteration • The solution is sensitive to thermal and voltage variations
Solution 2:Two-Flop Synchronizer • Low bandwidth • Resolution time: one clock cycle • Data Cycle: At least 3 clock cycles
Solution 3:Time Budget Clock Y MS DCTRL Y1 Conflict Port Wins MS Y1 DCTRL Clock Wins Asynchronous Controller Delay Clock Y1High-Phase MUTEX Metastability Resolution
How much resolution time? • Less than 50 FO4 delays needed to resolve metastability • ASIC / SoC clocks are slow: T > 100 FO4 delays • Conclusions: • Fast clocks: Half a cycle is budgeted for M/S resolution • Slower clocks (T>200 FO4): Quarter cycle REQUIRED MTBF (YEARS)
Solution 3:A. Decoupled Input Port DCTRL= D{R3+DO+ DI+L-R2-}
Solution 3:B. Decoupled Output Port DCTRL= D{A4+ A1+ A3-}
Solution 3:C. A Simpler Input Port DCTRL= DLATCH + DTX {ACK+ REQ-}
DL Solution 3:Analysis Clock Y MS DCTRL Y1 Conflict Minimal Clock High-Phase, THP ~3 FO4 gate delays T/4 for M/S Resolution Asynchronous Controller Delay • Example: T=160 FO4 gate delays. Constraint:
Solution 3:Simulations *These results are based on data bus width of 16 bits
Summary • Design of arbitrated clocks for GALS SoCs must consider clock tree delays to control the risk of synchronization failures • Presented three solutions: • Extract the delays and verify timing • Employ 2-flop synchronizers or matched-delay async ports (low bandwidth) • Employ locally-delayed ports(high bandwidth)