Advanced Fusion Compiler Synthesis and P& R Technologies To Drive Performance and
Advanced Fusion Compiler Synthesis and P& R Technologies To Drive Performance and
Keerthi Penmetsa
Synopsys
IC Compiler II
Market ML-enhanced tools, AI-driven apps
Leadership Accelerating AI, automotive, and multi-die systems
Signoff Fusion
Cloud-ready
PrimeTime, PrimeShield StarRC
PrimePower IC Validator NXT
PrimeLib Formality / Formality ECO
Tweaker ECO RedHawk Analysis Fusion
POST SILICON
7
Avg % Avg 16 %
1% 5% 10 % 15 % 20 % S malle r 1% 5% 10 % 15 % 30 % Lowe r
2-3X Id le Avg30 %
Faste r Powe r
1.5 X 2X 3X Lower
1% 5% 15 % 30 % 60 %
Un-gated Register
New clock latency
CG_A_1
CG_A
CK
E
Clock latency updated
Diminished power
returns compared
to area overhead
• To ensure QoR improvements, the self-gating algorithm takes timing and power
into consideration. A self-gating cell is inserted for a candidate registers if:
– There is enough timing slack available in the register’s data pin
– Internal dynamic power of the circuit is reduced
– Smart grouping of candidate registers
• To enable self-gating inside place_opt:
– set the place_opt.flow.enable_self_gating application option to true
D00 D00
D01 D01
D10 D10
D11 D11
• Placement
aware debanking clock_opt
(via restructuring)
• Benefits
– DCDP focuses to improve corner congestion and routability
– No need for custom placement blockages (hard or soft or partial)
at the congested corners to improve routability.
– The tool supports an Auto Density Control feature to control cell density for both
good timing and good congestion. But it still needs manual manipulation of the
density settings for better QoR
– This new density handling feature has better QoR without manual tuning
• Benefits
– Focus on mitigation of density hotspots, controlling oscillations between spreading
and clumping density objectives, and more targeted congestion expansion
– Density flattening improves local density hotspots by targeting them dynamically
– Footprint expansion increases the accuracy of cell expansion by accounting for
anticipated changes in local density
– Single-sided density cost avoids oscillations of cells spreading and clumping over
the placements in the flow
SNUG SILICON VALLEY 2023 17
Optimization Technology
IO Priority
Consistent de-prioritization of I/O through the RTL2GDS flow
• Benefits
– Consistent handling of IO paths through the flow
– Script simplification. No need for custom IO path groups and weights to drive QoR trajectory
– Transparent to the users SNUG SILICON VALLEY 2023 19
T-2022.03-SP4
IO Priority
User Interface
• Requirements
– time.enable_io_path_groups must be true (default)
• In version T-2022.03-SP3, the redundancy removal step prints a log message when a register
becomes constant or unloaded
– Spacing rules
– Cross-row VT rules
– NDR related PG-DRC rules Batch Legalization
Wire-Opt Enhancements
Better timing (R2R TNS) at end-of-flow
• Setup TNS, power, and routability are comparable or improved when low/medium hold effort used
• “low” effort is applied automatically when set_qor_strategy -mode early_design used
Next Generation -
H-tree Based Multisource Clock Trees
Fully Automated H-tree with Improved Routing and Latency Driven Tap Assignment
Placed block +
CTS setup Automatic derivation of tap Improved handshaking between
drivers and configuration
Custom Router and Detail Route
H-tree Setup
Improved pin connection Detail Route
delay
Synthesis
Tap
Latency-aware tap insertion LATENCY DRIVEN TAP ASSIGNMENT
Reduced WL for
Assignment Latency-aware tap
assignment
Latency and wire length Sinks
similar/better
latency
awareness for improved
Clock Tree No user-driven exploration sink distribution.
Synthesis flows
Tap Driver