Module 3 - Part 2

Pipelining is a technique that decomposes a sequential process into sub-operations executed concurrently across dedicated segments, enhancing processing speed. The document explains the structure of a four-segment pipeline, its speedup ratio compared to non-pipelined processing, and various applications such as instruction and arithmetic pipelines. It also discusses the challenges of instruction-level parallelism and the operation of supercomputers, emphasizing the importance of efficient data handling in pipelined architectures.

Uploaded by

manomitkundu1590

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

Module 3 - Part 2

Uploaded by

manomitkundu1590

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Pipelining

Pipelining is a technique of decomposing a sequential process into sub-operations; with each

sub-process being executed in a special dedicated segment that operates concurrently with all
other segments. A pipeline can be visualized as a collection of processing segments through
which binary information flows.
General Considerations
Any operation that can be decomposed into a sequence of sub-operations of about the same
complexity can be implemented by a pipeline processor. The general structure of a four-
segment pipeline is illustrated in Fig. 46. The operands pass through all four segments in a
fixed sequence.

The space-time diagram of a four-segment pipeline is demonstrated in Fig47.

The speedup(S) of a pipeline processing over an equivalent non-pipeline processing is defined

𝑛𝑡
by the ratio: 𝑆= 𝑛
(𝑘+𝑛−1)𝑡𝑝
As the number of tasks increases, n becomes much larger than 𝑘 − 1, and 𝑘 + 𝑛 − 1
approaches the value of n. Under this condition, the speedup becomes:
𝑡𝑛
𝑆=
𝑡𝑝
numerical example: Let the time it takes to process a sub-operation in each segment be equal
to 𝑡𝑝= 20 ns. Assume that the pipeline has 𝑘 = 4 segments and executes 𝑛 = 100 tasks in
sequence. The pipeline system will take
(𝑘 + 𝑛 − 1)𝑡𝑝 = (4 + 99) × 20 = 2060𝑛𝑠
to complete. Assuming that t = ktp = 4 x 20 = 80 ns,
a non-pipeline system requires:
𝑛𝑘𝑡𝑝 = 100 × 80 = 8000𝑛𝑠
to complete the 100 tasks. The speedup ratio is equal to:
8000⁄
2060 = 3.88
Instruction Pipeline
The computer needs to process each instruction with the following sequence of steps:
1. Fetch the instruction from memory.
2. Decode the instruction.
3. Calculate the effective address.
4. Fetch the operands from memory.
5. Execute the instruction.
6. Store the result in the proper place.
Figure 48 shows how the instruction cycle in the CPU can be processed with a four-segment
pipeline. While an instruction is being executed in segment 4, the next instruction in sequence is
busy fetching an operand from memory in segment 3.
The four segments are represented in the flowchart:
1. FI is the segment that fetches an instruction.
2. DA is the segment that decodes the instruction and calculates the effective address.
3. FO is the segment that fetches the operand.
4. EX is the segment that executes the instruction.
A pipeline operation is said to have been stalled if one unit (stage) requires more time to perform
its function, thus forcing other stages to become idle. Consider, for example, the case of an
instruction fetch that incurs a cache miss. Assume also that a cache miss requires three extra time
units.

Instruction-Level Parallelism
Contrary to pipeline techniques, instruction-level parallelism (ILP) is based on the idea of
multiple issue processors (MIP). An MIP has multiple pipelined datapaths for instruction
execution. Each of these pipelines can issue and execute one instruction per cycle. Figure 49
shows the case of a processor having three pipes. For comparison purposes, we also show in the
same figure the sequential and the single pipeline case.
Arithmetic Pipeline
Pipeline arithmetic units are usually found in very high speed computers. They are used to
implement floating-point operations, multiplication of fixed-point numbers, and similar
computations encountered in scientific problems.
an example of a pipeline unit for floating-point addition and subtraction. The inputs to the
floating-point adder pipeline are two normalized floating-point binary numbers.

A, B are two fractions that represent the mantissas and a, b are the exponents. The sub-
operations that are performed in the four segments are:
1. Compare the exponents.
2. Align the mantissas.
3. Add or subtract the mantissas.
4. Normalize the result.
Numerical example may clarify the sub-operations performed in each segment. For simplicity,
we use decimal numbers, although Fig.49 refers to binary numbers. Consider the two normalized
floating-point numbers:

The two exponents are subtracted in the first segment to obtain (3 − 2 = 1). The larger exponent
3 is chosen as the exponent of the result. The next segment shifts the mantissa of Y to the right
to obtain:

This aligns the two mantissas under the same exponent. The addition of the two mantissas in
segment 3 produces the sum:
Suppose that the time delays of the four segments are 𝑡1 = 60𝑛𝑠, 𝑡2 = 70𝑛𝑠, 𝑡3 = 100𝑛𝑠,
𝑡4 = 80𝑛𝑠, and the interface registers have a delay of 𝑡𝑟 = 10𝑛𝑠. The clock cycle is chosen to be
𝑡𝑝 = 𝑡3 + 𝑡𝑟 = 110𝑛𝑠 . An equivalent non-pipeline floating point adder-subtractor will have
a delay time 𝑡𝑛 = 𝑡1 + 𝑡2 + 𝑡3 + 𝑡4 + 𝑡𝑟 = 320𝑛𝑠. In this case the pipelined adder has a speedup
of 320/110 = 2.9 over the non-pipelined adder.
Supercomputers
Supercomputers are very powerful, high-performance machines used mostly for scientific
computations. To speed up the operation, the components are packed tightly together to minimize
the distance that the electronic signals have to travel. Supercomputers also use special techniques
for removing the heat from circuits to prevent them from burning up because of their close
proximity.
A supercomputer is a computer system best known for its high computational speed, fast and
large memory systems, and the extensive use of parallel processing.
Delayed Branch
Consider now the operation of the following four instructions:

If the three-segment pipeline proceeds: (I: Instruction fetch, A:ALU operation, and E: Execute
instruction) without interruptions, there will be a data conflict in instruction 3 because the operand
in R2 is not yet available in the A segment. This can be seen from the timing of the pipeline
shown in Fig. 50(a). The E segment in clock cycle 4 is in a process of placing the memory data
into R2. The A segment in clock cycle 4 is using the data from R2, but the value in R2 will not
be the correct value since it has not yet been transferred from memory. It is up to the compiler
to make sure that the instruction following the load instruction uses the data fetched from
memory. It was shown in Fig. 50 that a branch instruction delays the pipeline operation by NOP
instruction until the instruction at the branch address is fetched.

Understanding Pipelining Techniques
No ratings yet
Understanding Pipelining Techniques
25 pages
Parallel Processing Structures in COA
No ratings yet
Parallel Processing Structures in COA
24 pages
RISC vs CISC: Architecture Overview
No ratings yet
RISC vs CISC: Architecture Overview
16 pages
Understanding Pipelining in Computing
No ratings yet
Understanding Pipelining in Computing
21 pages
Understanding Pipelining Techniques
No ratings yet
Understanding Pipelining Techniques
15 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
9 pages
Pipeline and Vector Processingvvvvvvvvvvvvv
No ratings yet
Pipeline and Vector Processingvvvvvvvvvvvvv
10 pages
Arithmetic Pipeline in Parallel Processing
No ratings yet
Arithmetic Pipeline in Parallel Processing
21 pages
Pipelining and Vector Processing Overview
No ratings yet
Pipelining and Vector Processing Overview
33 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
13 pages
Pipelining Design Architecture - 095634
No ratings yet
Pipelining Design Architecture - 095634
6 pages
Understanding Pipelining Concepts
No ratings yet
Understanding Pipelining Concepts
38 pages
Pipelining and Vector Processing Explained
No ratings yet
Pipelining and Vector Processing Explained
29 pages
ARM Chip Instruction Examples
No ratings yet
ARM Chip Instruction Examples
21 pages
Chapter-6 1-6 2-6 3
No ratings yet
Chapter-6 1-6 2-6 3
41 pages
COA-5-1&2 Notes
No ratings yet
COA-5-1&2 Notes
14 pages
Pipelining in Parallel Computer Architecture
No ratings yet
Pipelining in Parallel Computer Architecture
59 pages
Understanding Pipelining in Computing
No ratings yet
Understanding Pipelining in Computing
13 pages
Unit III Pipelining
No ratings yet
Unit III Pipelining
10 pages
Pipelining and Vector Processing Techniques
No ratings yet
Pipelining and Vector Processing Techniques
40 pages
Parallel and Pipeline Processing Techniques
No ratings yet
Parallel and Pipeline Processing Techniques
10 pages
Pipelining vs. Parallel Processing Explained
No ratings yet
Pipelining vs. Parallel Processing Explained
32 pages
Pipelining in Computer Architecture Explained
No ratings yet
Pipelining in Computer Architecture Explained
11 pages
Understanding Parallel Processing Techniques
No ratings yet
Understanding Parallel Processing Techniques
17 pages
Pipelining in Computer Architecture
100% (1)
Pipelining in Computer Architecture
33 pages
Understanding Multiprocessors and Pipelining
No ratings yet
Understanding Multiprocessors and Pipelining
11 pages
Understanding Pipelining Concepts
No ratings yet
Understanding Pipelining Concepts
20 pages
Lecture 5 Computer Architecture
No ratings yet
Lecture 5 Computer Architecture
16 pages
Understanding Pipelining Techniques
No ratings yet
Understanding Pipelining Techniques
5 pages
Pipeline and Vector Processing Overview
No ratings yet
Pipeline and Vector Processing Overview
11 pages
Pipelining and Vector Processing Overview
No ratings yet
Pipelining and Vector Processing Overview
29 pages
Understanding Pipelining in CPUs
No ratings yet
Understanding Pipelining in CPUs
8 pages
Pipeline and Vector Processing Overview
100% (1)
Pipeline and Vector Processing Overview
18 pages
Pipelining and Vector Processing Overview
No ratings yet
Pipelining and Vector Processing Overview
46 pages
Parallel Processing and Pipelining Explained
No ratings yet
Parallel Processing and Pipelining Explained
13 pages
Unit 5
No ratings yet
Unit 5
11 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
74 pages
Pipeline and Multiprocessors Overview
No ratings yet
Pipeline and Multiprocessors Overview
21 pages
Pipelining Techniques and Challenges
No ratings yet
Pipelining Techniques and Challenges
31 pages
COA Unit 5
No ratings yet
COA Unit 5
22 pages
Pipelining and Vector Processing Overview
No ratings yet
Pipelining and Vector Processing Overview
28 pages
Understanding Parallel Processing Techniques
No ratings yet
Understanding Parallel Processing Techniques
59 pages
Pipeline and Vector Processing Techniques
No ratings yet
Pipeline and Vector Processing Techniques
52 pages
Pipelining: Techniques and Hazards
No ratings yet
Pipelining: Techniques and Hazards
13 pages
Pipelining
No ratings yet
Pipelining
54 pages
Parallel Processing and Pipelining Explained
No ratings yet
Parallel Processing and Pipelining Explained
72 pages
Understanding Parallel Architecture and Pipelining
No ratings yet
Understanding Parallel Architecture and Pipelining
19 pages
Parallel Processing and Pipelining Techniques
No ratings yet
Parallel Processing and Pipelining Techniques
20 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
46 pages
Introduction to Pipelining in CPUs
No ratings yet
Introduction to Pipelining in CPUs
7 pages
Pipelining Techniques in Microprocessors
No ratings yet
Pipelining Techniques in Microprocessors
16 pages
Understanding Pipelining Concepts
No ratings yet
Understanding Pipelining Concepts
23 pages
Pipelining and Vector Processing Techniques
No ratings yet
Pipelining and Vector Processing Techniques
30 pages
Instruction Pipelining Overview
No ratings yet
Instruction Pipelining Overview
13 pages
Pipe Lining
No ratings yet
Pipe Lining
15 pages
Parellel Processing
No ratings yet
Parellel Processing
10 pages
Parallel vs Serial Processing Explained
No ratings yet
Parallel vs Serial Processing Explained
23 pages
Pipeline Hazards and Solutions in Multiprocessors
No ratings yet
Pipeline Hazards and Solutions in Multiprocessors
32 pages
Understanding Pipeline Processing in Computing
No ratings yet
Understanding Pipeline Processing in Computing
18 pages
Storage Management and File Systems Guide
No ratings yet
Storage Management and File Systems Guide
14 pages
7th Gen Core Family Desktop S Processor Lines Datasheet Vol 1
No ratings yet
7th Gen Core Family Desktop S Processor Lines Datasheet Vol 1
47 pages
Sap Suse Implementation Guide Omnistack PDF
No ratings yet
Sap Suse Implementation Guide Omnistack PDF
35 pages
ESP32-C6-WROOM-1 Datasheet Overview
No ratings yet
ESP32-C6-WROOM-1 Datasheet Overview
52 pages
AUTOSAR Communication Methodology Overview
No ratings yet
AUTOSAR Communication Methodology Overview
49 pages
Understanding the Fetch Decode Execute Cycle
No ratings yet
Understanding the Fetch Decode Execute Cycle
4 pages
GameCenter Startup Log Analysis
No ratings yet
GameCenter Startup Log Analysis
22 pages
Overview of Distributed Operating Systems
No ratings yet
Overview of Distributed Operating Systems
21 pages
IPv4 vs IPv6 and Network Topologies Guide
No ratings yet
IPv4 vs IPv6 and Network Topologies Guide
7 pages
AKD to AKD2G Migration Guide
No ratings yet
AKD to AKD2G Migration Guide
3 pages
How To Identify The Serial Number of An ONTAP Platform
No ratings yet
How To Identify The Serial Number of An ONTAP Platform
4 pages
IoT Enabling Technologies Overview
No ratings yet
IoT Enabling Technologies Overview
25 pages
Essential Run Commands for Windows
No ratings yet
Essential Run Commands for Windows
5 pages
22320 Winter 2022 Model Answer Paper
No ratings yet
22320 Winter 2022 Model Answer Paper
24 pages
Hitachi VSP 5000 Series Hardware Guide
No ratings yet
Hitachi VSP 5000 Series Hardware Guide
79 pages
Microprocessor Evolution Overview
No ratings yet
Microprocessor Evolution Overview
14 pages
Understanding RAID Levels and Benefits
No ratings yet
Understanding RAID Levels and Benefits
9 pages
Database Transaction Management Quiz
No ratings yet
Database Transaction Management Quiz
5 pages
Abrites Renault Commander User Manual
No ratings yet
Abrites Renault Commander User Manual
26 pages
Address Decoding in Microprocessors
No ratings yet
Address Decoding in Microprocessors
11 pages
ICDL Computer & Online Essentials Syllabus
No ratings yet
ICDL Computer & Online Essentials Syllabus
8 pages
Using Spark on NERSC's Cori System
No ratings yet
Using Spark on NERSC's Cori System
14 pages
BioStar 1 8 Administrator Guide - EN PDF
100% (1)
BioStar 1 8 Administrator Guide - EN PDF
291 pages
Ursalink UC11xx Control Protocol Guide
No ratings yet
Ursalink UC11xx Control Protocol Guide
9 pages
INMH Weather Data Overview
No ratings yet
INMH Weather Data Overview
2 pages
Introduction to Hive in Big Data
No ratings yet
Introduction to Hive in Big Data
17 pages
Azure Data Engineering Interview Q&A
100% (1)
Azure Data Engineering Interview Q&A
65 pages
SSH File Management Workshop Guide
No ratings yet
SSH File Management Workshop Guide
3 pages
Burst Compiler Configuration for Unity
No ratings yet
Burst Compiler Configuration for Unity
7 pages
TCP/IP Networking and Supernetting Guide
No ratings yet
TCP/IP Networking and Supernetting Guide
33 pages

Module 3 - Part 2

Uploaded by

Module 3 - Part 2

Uploaded by

Pipelining

Pipelining is a technique of decomposing a sequential process into sub-operations; with each

The space-time diagram of a four-segment pipeline is demonstrated in Fig47.

The speedup(S) of a pipeline processing over an equivalent non-pipeline processing is defined

You might also like