Unit4 Memory Array Structures (Part1)
Unit4 Memory Array Structures (Part1)
BL BL BL
WL WL
WL
0
GND
Figure shows ROM cell. The cell should be designed such that 0 or 1 is presented to the
bit line upon activation of its word line. Initially BL is connected to GND through resistor.
The presence or absence of a diode between WL and BL differentiates between ROM
cells storing 1 or 0 respectively.
The disadvantage of diode cell is that it does not isolate the bit line from the word line.
Other approach is to use active NMOS transistor whose drain is connected to the supply
voltage. Here all output driving current is provided by the MOS transistor in the cell.
WL [0]
GND
WL [1]
WL [2]
GND
WL [3]
word0: 010101,
word1: 011001
Fig: Pseudo-nMOS OR ROM
word2: 100101,
word3: 101010
V DD
Pull-up devices
WL [0]
WL [1]
WL [2]
WL [3]
Here word lines are active low signals, initially WL are high, to store logic 1 connect
a transistor, to store logic 0 no transistor is required
Fig: Pseudo-nMOS AND ROM
– each input and output data line connects to each cell in its
column
– when row is enabled by decoder, each cell has logic that stores
input data bit or outputs stored bit depending on rd/wr control.
SRAM
Figure shows a 6-transistor (6T) SRAM commonly used in practice.
Such a cell uses a single wordline and both true and complementary
bitlines. The complementary bit-line is often called bit_b. The cell
contains a pair of cross-coupled inverters and an access transistor for
each bitline. True and complementary versions of the data are stored
on the cross-coupled inverters. If the data is disturbed slightly, positive
feedback around the loop will restore it to VDD or GND. The wordline
is asserted to read or write the cell.
Fig: Read operation for 6T SRAM cell Fig x1: SRAM column read
Figure shows a SRAM cell being read. The both bitlines are initially
pre-charged to high. Assume A is initially '0' and thus A_b is initially
'1.' A_b and bit_b both should remain '1'. When the wordline is raised,
bit should be pulled down through transistors Nl and N2. At the same
time bit is being pulled down, node A tends to rise. A is held low by N1,
but raised by current flowing in from N2. Hence, N1 must be stronger
than N2. Specifically, the transistors must be ratioed such that node A
remains below the switching threshold of the P2/N3 inverter. This
constraint is called read stability.
Waveforms for the read operation are shown in Figure (b) as a 0 is
read onto bit. Observe that A momentarily rises, but does not glitch
badly enough to flip the cell.
Figure x1 shows the same cell in the context of a full column from the
SRAM. During phase 2, the bitlines are precharged high.
Many SRAM cells share the same bitline pair, which acts as a
distributed dual-rail footless dynamic multiplexer.
Fig: Write operation for 6T SRAM cell Fig: SRAM column write
The waveforms of Figure show the SRAM cell being written. Again,
assume A is initially '0' and that we wish to write a '1‘ into the cell, bit
is precharged high and left floating. bit_b is pulled low by a write
driver. We know on account of the read stability constraint that bit will
be unable to force A high through N2. Hence, the cell must be written
by forcing A_b low through N4. P2 opposes this operation; thus, P2
must be weaker than N4 so that A_b can be pulled low enough. This
constraint is called writeability.
Once A_b falls low, N1 turns OFF and P1 turns ON, pulling A high as
desired.
DRAM
Dynamic RAMs (DRAMs) store their contents as charge on a capacitor
rather than in a feedback loop. Thus, the basic cell is substantially
smaller than SRAM, but the cell must be periodically read and
refreshed so that its contents do not leak away.
The read disturbs the cell contents at x, so the cell must be rewritten
after each read.
On a write, the bitline is driven high or low and the voltage is forced onto
the capacitor. Some DRAMs drive the wordline to VDDP = VDD + Vt to
avoid a degraded level when writing a '1.'
The DRAM capacitor Ccell must be as small as possible to achieve good
density. However, the bitline is contacted to many DRAM cells and has
a relatively large capacitance Cbit.
Therefore, the cell capacitance is typically much smaller than the bitline
capacitance.
3-Transistor DRAM Cell
BL 1 BL 2
WWL
RWL WWL
M3 RWL
M1 X X VDD 2 VT
M2
VDD
CS BL 1
BL 2 VDD 2 VT DV
Figure shows two stage sensing approach along with the SRAM bit column structure. The
bit lines are connected to the inputs x and xb. of the amplifier.
Content-Addressable Memory (CAM)
The CAM acts as an ordinary SRAM that can be read or written given adr and data,
but also performs matching operations. Matching asserts a matchline output for each
word of the CAM that contains a specified key.
Bit Bit Bit Bit
Word Bit Bit
M8 M9
M4 M5
CAM ••• CAM
M6 M7
Word
Word S S
int
CAM ••• CAM M3 M2
Match
M1
Figure (c) shows a 10T CAM cell consisting of a normal SRAM cell with
additional transistors to perform the match. Multiple CAM cells in the
same word are tied to the same matchline.
Figure below shows a complete 4 × 4 CAM array. Like an SRAM, it consists
of an array of cells, a decoder, and column circuitry. However, each row also
produces a dynamic matchline. The matchlines are precharged with the
clocked pMOS transistors. The miss signal is produced with a distributed
pseudo-nMOS NOR.
Floating Lines
Driven Lines
If the line Y is driven with a resistance
RY, a step on line X results in a transient
on line Y. The transient decays with a
time constant tXY = RY(CXY+CY). The
actual impact on the “victim line” is a
strong function of the rise (fall) time of
the interfering signal. Fig: Capacitive coupling to a driven line
If the rise time is larger than the time constant, the peak value of disturbance
is diminished(reduced). This is illustrated in the waveforms of figure.
Obviously, keeping the driving impedance of a wire—and hence tXY —low
goes a long way towards reducing the impact of capacitive cross talk.
The keeper transistor, added to a dynamic gate or precharged wire, is an
excellent example of how impedance reduction helps to control noise.
There are several approaches to dealing with cross talk between capacitive
transmission lines:
1. Wherever possible signals on adjacent wiring layers should be routed in
perpendicular directions to minimize the vertical coupling.
(i.e. It is also used to say that if on one layer the traces follow the
direction "from north to south", on the layer adjacent to it the
traces should follow the direction "from east to west“).
4. If signals are sent differentially (e.g., like the bit lines in a SRAM), cross talk
can be made common-mode by routing the true and complement lines close to
each other and (optionally) periodically reversing their positions.
5. Sensitive signals (e.g., those that operate at a low voltage) should be well
separated from full-swing signals to minimize the capacitive coupling from
signals with large ΔV.
6. In extreme cases, a sensitive signal can be shielded by placing conductors
above, below, and on either side of it that are tied to the reference supply (Vp or
GND, depending on the signal) at a single point.
7. The interwire capacitance between signals on different layers can be further
reduced by the addition of extra routing layers.
When one device is sending data on the bus, all other sending devices should
be disconnected.
This can be achieved by putting the output buffers of those devices in a high
impedance state Z that effectively disconnects the gate from the output wire.
Such a buffer has three possible states—0, 1, and Z—and is therefore called a
tri-state buffer.
Current flowing through a resistive wire results in an ohmic voltage drop that
degrades the signal levels. This is especially important in the power
distribution network,
This is demonstrated by the circuit in Figure , where an inverter placed far
from the power and ground pins connects to a device closer to the supply. The
difference in logic levels caused by the IR voltage drop over the supply rails
might partially turn on transistor M1. This can result in an accidental
discharging of the precharged, dynamic node X, or cause static power
consumption if the connecting gate is static.
The most obvious solution to this problem is to reduce the maximum distance
between the supply pins and the circuit supply connections. This is most easily
accomplished through a structured layout of the power distribution network.
A number of onchip power-distribution networks with peripheral bonding are
shown in Figure.
(a) Single layer power grid (b) Dual layer grid; (c) Dual power plane.
Fig: On-chip power distribution networks.
The power and ground are brought onto the chip via bonding pads located on
the four sides of the chip. In the first approach (a), power and ground are
routed vertically (or horizontally) on the same layer. Power is brought in from
two sides of the chip. Local power strips are strapped to this upper grid, and
then further routed on the lower metal levels.
Method (b) uses two coarse metal layers for the power distribution, and the
power is brought in from the four sides of the die (chip).
The other method is to use two solid metal planes for the distribrution of Vdd
and GND (c).
This approach has the advantage of drastically reducing the resistance of the
network.
The metal planes also act as shields between data signalling layers, hence
reducing cross-talk.
They also help to reduce the on-chip inductance. Obviously, this approach is
only feasible when sufficient metal layers are available.
Resistance and Performance—RC Delay
The delay of a wire grows quadratically with its length (i.e ).
Doubling the length of a wire increases its delay by a factor of four.
The signal delay of long wires therefore tends to be dominated by the RC
effect. This is becoming an larger problem in modern technologies, which
feature an increasing average length of the global wires.
In this section, we discuss a number of design techniques that may help to
cope with the delay imposed by the resistance of a wire.
1. Better Interconnect Materials
Resistivity of commonly-used
A first option for reducing RC delays is to use better conductors (at 20 C)
interconnect materials. The introduction of silicides
and Copper have helped to reduce the resistance of
polysilicon and metal wires, respectively, while the
use of dielectric materials with a lower permittivity
lowers the capacitance.
But as the technology grows the new material do not solve the fundamental problem of
the delay of long wires.
2. Innovative design techniques are often the only way of coping with the latter.
Sometimes, it is hard to avoid the use of long polysilicon wires. A good example of such
circumstance are the address lines in memories, which must connect to a large number
of transistor gates. Keeping the wires in polysilicon increases the memory density by
avoiding the overhead of the extra metal contacts.
The polysilicon-only option unfortunately leads to an excessive propagation delay.
One possible solution is to drive the word line from both ends, as shown in Figure.
This effectively reduces the worst-case delay by a factor of four.
The most popular design approach to reduce the propagation delay of long wires is to
introduce intermediate buffers, also called repeaters, in the interconnect line.
Making an interconnect line m times shorter reduces its propagation delay quadratically.
Assuming that the repeaters have a fixed delay tpbuf , the delay of the partitioned wire
is given by:
5. Optimizing the Interconnect Architecture
Even with buffer insertion, the delay of a resistive wire cannot be reduced below the
minimum. Long wires hence often exhibit a delay that is longer than the clock period
of the design.
Wire pipelining is a popular performance-improvement technique in this category.
The wire is partitioned in k segments by inserting registers or latches. While this does not
reduce the delay through the wire segment— it takes k clock cycles for a signal to proceed
through the wire—, it helps to increase its throughput, as the wire is handling k signals
simultaneously at any point in time. The delay of the individual wire segments can further
be optimized by repeater insertion, and should be below a single clock period.
Inductive Parasitics
Besides having a parasitic resistance and capacitance, interconnect wires also
exhibit an inductive parasitic.
An important source of parasitic inductance is introduced by the bonding wires
and chip packages.
The source of inductive parasitics and their quantitative values are discussed
here.
Inductance and Reliability— Ldi/dt Voltage Drop
During each switching action, a transient current is sourced from (or
sunk into) the supply rails to charge (or discharge) the circuit
capacitances, as shown in Figure. Both VDD and VSS connections are
routed to the external supplies through bonding wires and package
pins and possess a nonignorable series inductance.
1. Separate power pins for I/O pads and chip core—Since the I/O drivers
require the largest switching currents, they also cause the largest current
changes. It is wise to isolate the center of the chip, where most of the logic
action occurs, from the drivers by providing different power and ground pins.
2. Multiple power and ground pins— In order to reduce the di/dt per supply
pin, we can restrict the number of I/O drivers connected to a single supply pin.
Typical numbers are five to ten drivers per supply pin.
3. Careful selection of the positions of the power and ground pins on the
package— The inductance of pins located at the corners of the package is
substantially higher due to its length
as shown in Figure
Fig: The inductance of a bonding-wire/pin
combination depends upon the pin position
4. Increase the rise and fall times of the off-chip signals to the maximum extent
allowable and distributed all over the chip.
5. Use advanced packaging technologies such as surface-mount or hybrids that
come with a substantially reduced capacitance and inductance per pin. For
instance, we can see from Table 2.2 that the bonding inductance of a chip
mounted in flip-chip style on a substrate using the solder-bump techniques is
reduced to 0.1nH, which is 50 to 100 times smaller than for standard packages.
6. Adding decoupling capacitances on the board— These capacitances, which
should be added for every supply pin, act as local supplies and stabilize the
supply voltage seen by the chip. They separate the bonding-wire inductance
from the inductance of the board interconnect (Figure). The bypass capacitor,
combined with the inductance, actually acts as a low-pass network that filters
away the high-frequency components of the transient voltage spikes on the
supply lines.
Finally, be aware that the mutual inductance between neighboring wires also
introduces cross talk. This effect is not yet a major concern in CMOS but
definitely is emerging as an issue at the highest switching speeds.
Inductance and Performance—Transmission Line Effects
When an interconnection wire becomes sufficiently long or when the circuits
become sufficiently fast, the inductance of the wire starts to dominate the
delay behavior, and transmission line effects must be considered.
In this section, we discuss some techniques to minimize the impact of the
transmission line behavior such as Termination and Shielding of wires.
Termination
Appropriate termination is the most effective way of minimizing the delay.
Matching the load impedance to the characteristic impedance of the line
results in the fastest response. This leads to the following design rule:
To avoid the negative effects of transmission-line behavior such as slow
propagation delays, the line should be terminated, either at the source (series
termination), or at the destination (parallel termination) with a resistance
matched to its characteristic impedance Z0.
The two scenarios — series and
parallel termination — are shown in
Figure.
Series termination requires that the
impedance of the signal source is
matched to the connecting wire.
The impedance of the driver inverter
can be matched to the line by careful Fig: Matched termination scenarios for
transistor sizing. wires behaving as transmission lines:
(a) series termination at the source;
(b) parallel termination at the destimation.
It is important that the impedance of the driver is closely matched to the line,
typically to within 10% or better, then excessive reflections of travelling waves
can be avoided. This can be compensated by making the resistance of the driver
transistors electrically tunable as shown in figure.
Fig: Tunable segmented driver providing matched series-termination to a transmission
line load.