## High-Performance SRAM Design

#### Rahul Rao

#### IBM Systems and Technology Group



# Worst case read condition



: Worst case Bitline Leakage when reading a "1"

# Data Independent Leakage Cell



Figure 9 (c): Schematic of Ten transistors with M9 and M10 added to schematic of eight transistor to lower leakage power

(Calhoun, 2010)

# **Mechanisms of Parametric Failures**

![](_page_4_Figure_1.jpeg)

### Question

#### □ Which of the following are true for the 6-T SRAM cell

- A cell with poor READ margin is unlikely to have access failure
- b) Differential read means there is no worst case data condition for read
- c) The worst case write condition is having cells with alternateOs and 1 along the column
- Access fails can be minimized by running the array at a slower frequency

### Topics

- Introduction to memory
- □ SRAM basics and bitcell array (refresher)
- Current Challenges
- □ Alternative Cell Types (6 to 10T), Asymmetric Cells, Sub-

threshold Cells, Low - leakage cells

□ Impact of Variation, Assist Circuits

- □ BTI and impact on SRAMs
- Power

![](_page_6_Picture_9.jpeg)

## **Sources of Manufacturing Variations**

#### Variation in Process Parameters

![](_page_7_Picture_2.jpeg)

## **Impact of Manufacturing Variations**

![](_page_8_Figure_1.jpeg)

#### Location of Identical Ring Oscillators on a Die

![](_page_8_Figure_3.jpeg)

#### **Frequency Correlation**

(averaged over 300 die)

## **Environmental Variations**

![](_page_9_Figure_1.jpeg)

#### **Temperature Variation**

- Switching Characteristics of Blocks
- Material Properties: Thermal Coefficient
- Cooling and Packaging Solutions
- Workload and Thermal Management Policies

# Delay and leakage increase with temperature

![](_page_9_Figure_8.jpeg)

#### **Power Supply Variation**

- IR drop: Leakage, Power grid robustness
- Ldi/dt: Transient activity, decoupling capacitors
- Power Efficient Design Strategies: Clock Gating, Power Gating

Delay increases with power supply droop

P. Restle, ICCAD 2006

# **Global and Local Variations**

![](_page_10_Figure_1.jpeg)

![](_page_11_Figure_0.jpeg)

![](_page_12_Figure_0.jpeg)

S. Mukhopadhyay, ITC 2010

![](_page_13_Figure_0.jpeg)

![](_page_14_Figure_0.jpeg)

S. Mukhopadhyay, ITC 2010

### Question

Mark worst case VT variation condition for each device for write failure

![](_page_15_Figure_2.jpeg)

# Inter-die Variation & Cell Failures

![](_page_16_Figure_1.jpeg)

![](_page_17_Figure_0.jpeg)

• *P<sub>COL</sub>*: Probability that any of the cells in a column fail

$$P_{COL} = 1 \quad (1 \quad P_F)^{N_{ROW}}$$

## Impact of Redundancy on Memory Failure

![](_page_18_Figure_1.jpeg)

Larger redundancy

Redundant Col / Total Col. [%]

- (1) more column to replace (less memory failure).
- (2) smaller cell area (larger cell failure).

# **Transistor Sizing**

![](_page_19_Figure_1.jpeg)

• Slide contributed by K. Roy, Purdue

### Question

### Array redundancy

- a) Improves cell stability
- b) Degrades cell performance (i.e increases read and write times)
- c) Does not require any change to cell peripheral circuits
- d) Row redundancy is better than column redundancy

## Example: Multi-VCC for SRAM Cell

![](_page_21_Figure_1.jpeg)

 Create differential voltage between WL and Cell to decouple the Read & Write

- Write: V\_WL > V\_Cell
- Read: V\_WL < V\_Cell</p>

# Dynamic Circuit Techniques for Variation Tolerant SRAM

![](_page_22_Figure_1.jpeg)

|                 | Read                                                                                            | Write                                                        |
|-----------------|-------------------------------------------------------------------------------------------------|--------------------------------------------------------------|
| V <sub>WL</sub> | Lower V <sub>WL</sub> =><br>lower V <sub>read</sub><br>(weak AX)                                | Higher V <sub>WL</sub> =><br>Strong AX<br>helps<br>discharge |
| V <sub>cs</sub> | Higher V <sub>cs</sub> =><br>lower V <sub>read</sub><br>(strong PD)<br>Higher V <sub>trip</sub> | Lower V <sub>cs</sub> =><br>Weak PUP                         |
| V <sub>BL</sub> | Weak impact                                                                                     | Negative V <sub>BL</sub><br>for 0 => strong<br>AX helps      |
|                 |                                                                                                 | uischarge                                                    |

# Example: Dual-Vcc based Dynamic Circuit Techniques

![](_page_23_Figure_1.jpeg)

- Dynamic VCC MUX is integrated into subarray
- VCC selection is along column direction to decouple the Read & Write

# Implementation Consideration: Half-Select Stability

![](_page_24_Figure_1.jpeg)

• Higher  $V_{WL}$ 

- Row-based scheme
- Degrades half-select read stability of the unselected columns
- Lower V<sub>cell</sub> or negative bit-line
  - + Column-based scheme
  - + Half-select read stability remains same

## **Negative Bit Line Scheme**

![](_page_25_Figure_1.jpeg)

Source: S. Mukhopadhyay, R. Rao et. al, TVLSI 2009

![](_page_26_Figure_0.jpeg)

- Various dynamic schemes have different effectiveness in improving writability for similar read stability
  - Higher V<sub>WL</sub> is most effective

Source: S. Mukhopadhyay, R. Rao et. al, TVLSI 2009

# Impact on Active Data-Retention

![](_page_27_Figure_1.jpeg)

- Column based read-write control adversely impact the active data-retention failures
  - DC negative bitline has higher active data-retention failures
  - Tran-NBL and lower V<sub>cs</sub> have comparable failure rates

Source: S. Mukhopadhyay, R. Rao et. al, TVLSI 2009

# **Assist Methods**

WLOD (WL Overdrive)

Strengthen PG

![](_page_28_Figure_3.jpeg)

VCDL (V<sub>DD,CELL</sub> Lowering)

![](_page_28_Figure_5.jpeg)

![](_page_28_Figure_6.jpeg)

**NBL** (Negative BL)

Strengthen PG

![](_page_28_Figure_9.jpeg)

SBL (Suppressed BL)

Weaken BL noise

![](_page_28_Figure_12.jpeg)

WLUD (WL Underdrive)

Weaken PG

![](_page_28_Figure_15.jpeg)

#### Proposed Dual-Transient WL (DTWL)

- Dual-Transient WL (DTWL) controls WL transiently
- DTWL provides mix-up assist for read and write
  - Covers different process-corner
  - DTWL mitigate the impact of WLOD (WA)

![](_page_29_Figure_5.jpeg)

### Question

#### □ Of the various assist methods

- a) Negative bit line scheme does not help 8-T sram cell
- b) Word line under drive does not help 8-T sram cell
- c) Word line over drive does not help 7-T conditionally decoupled sram cell
- d) VCDL does not help any kind of assymetric sram cell

![](_page_31_Figure_0.jpeg)

Slide 31