0% found this document useful (0 votes)

5 views14 pages

Parallel Processor Taxonomy and Programming

Uploaded by

sunehagumber4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views14 pages

Parallel Processor Taxonomy and Programming

Uploaded by

sunehagumber4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Parallel Processors

1. Taxonomy and Topology

Taxonomy means classification — in parallel computing, systems are classified
based on how they handle instructions and data.
The most famous classification is Flynn’s Taxonomy:

1. Shared Memory Multiprocessors

In a shared memory system, all processors access a common global memory.

Characteristics:
• Memory is shared, accessible by all processors.
• Communication is done through reading/writing shared variables.
• Suitable for fine-grained parallelism (tight synchronization).

Types:
1. Uniform Memory Access (UMA):
o All processors have equal access time to memory.
o Example: symmetric multiprocessors (SMPs).
2. Non-Uniform Memory Access (NUMA):
o Access time varies based on memory location (local vs remote).
o Example: modern multicore systems like AMD EPYC, Intel Xeon.

2. Distributed Memory Networks

In distributed memory systems, each processor has its own local memory and
communicates via message passing.

Characteristics:
• No shared memory.
• Communication via message passing (MPI).
• Highly scalable and used in clusters or supercomputers.

Examples:
• Computer clusters
• Cloud computing servers
• GPU networks
Advantages:
• Scalable for large systems.
• Each node can operate independently.

Challenges:
• Harder to program due to explicit communication.
• Synchronization and data consistency must be manually handled.

Processor Organization
Defines how processors and memory units are structured and connected.
• Centralized organization: One main memory shared by all processors.
• Distributed organization: Each processor has its own memory and
communicates through a network.
• Hybrid organization: Combines shared and distributed memory models.

Static Interconnection Network

Static interconnection networks are fixed. In a unidirectional static interconnection
network connections between nodes allow communication to occur in only one
direction. So the data can be transmitted from one node to another node but not in the
reverse direction. However, in a bidirectional static interconnection network, the
connection between nodes allows communication to occur in both directions. The
choice between both connections depends on the specific requirements of the parallel
computing system.
There are also two type of Static Interconnection Network

Embeddings and Simulations

1. Embedding in Parallel Processing

Definition
In parallel computing, embedding means mapping one network (or topology) into another
so that a parallel algorithm designed for one architecture can run efficiently on another.
In simpler terms:
“Embedding is the process of placing (mapping) the nodes and links of one parallel
computer network (called the guest) onto another network (called the host) in an efficient
way.”
The Embedding is a function that maps the nodes and edges of the source graph (G) onto
the nodes and paths of the host graph (H).
Embedding: G->H

Why Embedding is Needed

• Different parallel computers have different topologies (like mesh, ring, hypercube,
etc.).
• A parallel algorithm may be optimized for one topology, but your hardware might
use another.
• Instead of redesigning the algorithm, we embed one topology into another.
Example:
If an algorithm was designed for a ring network, but your computer uses a mesh topology,
you embed the ring into the mesh and execute it there.

Applications of Embedding
• To simulate algorithms from one architecture on another.
• To design universal architectures that can support multiple algorithm types.
• To measure efficiency of network topologies for various applications.

2. Simulation in Parallel Processing

Definition
Simulation is the process of using one parallel architecture (the host) to imitate or reproduce
the behavior of another (the guest).

In short:
Simulation allows one type of parallel computer to behave as if it were another type.

Why Simulation is Needed

• Algorithms are often designed for specific architectures (e.g., hypercube, mesh,
shared memory).
• Real hardware may have a different structure.
• By simulating, we can execute algorithms meant for one model on another without
rewriting them.

Types of Simulation
1. Network Simulation – Simulating one network topology on another (e.g., hypercube
on mesh).
2. Memory Simulation – Simulating shared memory using distributed memory (or vice
versa).
3. Algorithmic Simulation – Running an algorithm designed for one model on another.

Example
Suppose we want to simulate a shared memory system on a distributed memory machine.
We can:
• Use message passing to emulate shared variables.
• Create a memory consistency protocol so that updates appear synchronized.
• Each processor acts as if it has access to all memory, even though physically, data is
exchanged by messages.
This is exactly how MPI + OpenMP hybrid systems work in practice.

In short
• Embedding = Mapping (structural relation)
• Simulation = Behavior imitation (functional execution)
• A good embedding leads to an efficient simulation.
•

1. Shared Memory Programming

Concept
In shared memory programming, multiple processors or threads share a single, common
memory space.
• Every processor can directly read and write to the same memory.
• Communication happens implicitly through memory — no explicit messages.
• Used in multicore processors, SMP (Symmetric Multiprocessing) systems.
Architecture
+---------+
| Memory |
+---------+
/ | \
/ | \
CPU1 CPU2 CPU3
->All CPUs access the same global memory via a bus or interconnection network.
-> Each CPU may also have local cache for speed.

Advantages

Simple to program (shared variables).

Fast communication (no network overhead).
Good for fine-grained parallel tasks.

Disadvantages

Limited scalability (can’t grow beyond one machine easily).

Synchronization overhead.
Debugging race conditions is difficult.

2. Distributed Memory Programming

Concept
In distributed memory programming, each processor has its own private local memory.
Processors communicate by explicitly sending and receiving messages.
• No global address space.
• Used in clusters, supercomputers, cloud computing, HPC (High-Performance
Computing.

Architecture
Diagram concept:
+---------+ +---------+ +---------+
| CPU1 |<--->| CPU2 |<--->| CPU3 |
| Memory1 | | Memory2 | | Memory3 |
+---------+ +---------+ +---------+
Each node has:
• Its own CPU(s)
• Its own memory
• A network interface to communicate with other nodes

Features
• Each process has its own memory space.
• Communication done via message passing.
• You must partition data and explicitly coordinate between nodes.

Advantages
Highly scalable (can run on thousands of nodes).
Each node has its own memory — no contention.
Fault tolerance easier (failed node can be restarted).

Disadvantages
Programmer must handle communication manually.
Higher latency (network delay).
Harder to debug and maintain.

Object-Oriented Programming (OOP)

What it is:
OOP is a way of programming where we think in terms of objects rather than just
instructions.
OOP is a style of programming where everything is represented as objects. Each object has:
• Data (called attributes or properties)
• Behavior (called methods or functions)
Main Concepts of OOP:
1. Class:
o A blueprint or template for creating objects.
o Example: class Car { int speed; void drive() {} }
2. Object:
o A real instance of a class.
o Example: Car myCar = new Car();
3. Encapsulation:
o Keeping data (attributes) safe inside the object.
o Only allow access through methods.
4. Inheritance:
o A class can use features (properties & methods) of another class.
o Example: class ElectricCar extends Car {}
5. Polymorphism:
o Objects can behave in multiple ways.
o Example: Method drive() works differently for Car and Bike.
6. Abstraction:
o Show only important details, hide unnecessary details.
o Example: Using a remote without knowing how it works internally.
Example in Real Life:
• Class: Car
• Object: Your red Honda
• Attributes: Color, speed
• Methods: Start(), Stop(), Honk()

SYNTAX ->class Car {

public:
string color; // Attribute
int speed; // Attribute

// Method (Behavior)
void drive() {
cout << "Car is driving at speed " << speed << endl;
}
};
Advantages of OOP:
• Makes code reusable, modular, and easy to maintain.
• Models real-world objects and protects data.
Disadvantages of OOP:
• Can be complex, use more memory, and slower for small programs.

1. Data Parallel Programming

Definition:
• A type of parallel programming where the same operation is performed
simultaneously on multiple data elements.
Key Points:
• Focuses on data, not tasks.
• Each processor works on a portion of the data.
• Common in scientific computing, image processing, and simulations.
Example:
• Adding two arrays element by element in parallel.

2. Functional Programming
Definition:
• A programming style where programs are constructed using pure functions and
immutable data.
• Functions do not change state or have side effects.
Key Points:
• Emphasizes declarative style (what to do, not how).
• Functions can be passed as arguments, returned from other functions.
• Examples: Haskell, Scala, parts of Python, JavaScript.

3. Data Flow Programming

Definition: A programming paradigm where the program is represented as a graph of data
flowing between operations.
• Execution depends on availability of input data, not the order of instructions.
Key Points:
• Good for parallelism and reactive systems.
• Each node performs an operation and sends output to other nodes.
• Examples: LabVIEW, Apache NiFi.
Example:
• A graph where sensor data flows → filtered → analyzed → stored.
In short:
• Data Parallel: Same operation on many data items simultaneously.
• Functional: Uses pure functions, avoids changing data.
• Data Flow: Computation happens as data moves through a network of operations.

1. Scheduling in Parallel Programs

Definition:
Scheduling in parallel programming is the process of deciding the order in which tasks (or
threads) are executed on processors to improve efficiency and reduce execution time.
Goals:
• Minimize total execution time.
• Maximize processor utilization.
• Balance workload among processors.
Types of Scheduling:
1. Static Scheduling:
o Task assignments are made before execution.
o Works well when task sizes and execution times are predictable.
o Pros: Simple, less overhead.
o Cons: Not flexible if task times vary.
2. Dynamic Scheduling:
o Tasks are assigned during execution as processors become free.
o Handles unpredictable workloads efficiently.
o Pros: Flexible, better load balancing.
3.
o Cons: More overhead due to runtime decision-making.

2. Loop Scheduling in Parallel Programs

Definition:
Loop scheduling is a technique in parallel programming where iterations of a
loop are divided among multiple processors. This is important because loops
often dominate computation in scientific and data-intensive programs.
Common Loop Scheduling Methods:
1. Static (Block) Scheduling:
o Divide loop iterations into equal blocks and assign each block to a processor.
o Example: 16 iterations, 4 processors → each gets 4 iterations.
2. Cyclic (Round-Robin) Scheduling:
o Assign iterations one by one in a round-robin fashion.
o Example: Processor 1 gets iterations 1,5,9… Processor 2 gets 2,6,10…
3. Dynamic (Chunked) Scheduling:
o Divide loop into chunks and assign them to processors as they become free.
o Good for loops with varying iteration times.
Advantages of Loop Scheduling:
• Balances workload among processors.
• Reduces idle time and improves performance.

Parallelization of Sequential Programs

Definition:
Parallelization is the process of converting a sequential (one step at a time) program into a
parallel program so that multiple tasks can execute simultaneously on multiple processors.
Why Parallelize:
• To reduce execution time.
• To utilize multiple processors efficiently.
• To handle large data or complex computations faster.
Steps to Parallelize a Sequential Program:
1. Identify Independent Tasks:
o Find parts of the program that can run simultaneously without depending on
each other.
o Example: In a loop, iterations that do not affect each other.
2. Data Decomposition:
o Divide data into chunks to be processed by different processors.
o Example: Splitting an array into parts for addition.
3. Task Decomposition:
o Divide the program into independent tasks or functions that can run in
parallel.
4. Synchronization:
o Ensure proper coordination if tasks share data to avoid conflicts.
5. Implement Parallel Code:
o Use parallel programming constructs like threads, OpenMP, MPI, or GPU
kernels.
Benefits:
• Faster execution.
• Better resource utilization.
Challenges:
• Data dependencies may prevent full parallelization.
• Synchronization overhead.

How Parallel Programming Environment Supports Programs

A parallel programming environment provides the tools, libraries, and systems needed to
make writing and running parallel programs easier and efficient. It helps in the following
ways:
1. Programming Support:
o Provides languages or extensions that let programmers write parallel code
easily.
o Example: OpenMP, MPI, CUDA.
2. Task Management & Scheduling:
o Assigns tasks to processors efficiently.
o Ensures tasks run in parallel without conflicts.
3. Communication & Synchronization:
o Provides mechanisms to share data safely among tasks or processors.
o Examples: Message passing, locks, semaphores, barriers.
4. Runtime Support:
o Handles execution of parallel tasks, monitors progress, balances workload.
5. Debugging & Profiling Tools:
o Detects errors like race conditions and improves performance.
6. Hardware Abstraction:
o Hides hardware complexity (multi-core CPUs, clusters, GPUs) so programmers
can focus on parallel logic.

Parallel Prgrmng9
No ratings yet
Parallel Prgrmng9
64 pages
GPU Execution Models Explained
No ratings yet
GPU Execution Models Explained
21 pages
Parallel Computing Explained: Types & Benefits
No ratings yet
Parallel Computing Explained: Types & Benefits
4 pages
Shared Memory in Parallel Computing
No ratings yet
Shared Memory in Parallel Computing
26 pages
Parallel Computing and Supercomputers
No ratings yet
Parallel Computing and Supercomputers
8 pages
GPU Execution Models in Parallel Computing
No ratings yet
GPU Execution Models in Parallel Computing
21 pages
Understanding Shared Memory in Parallel Computing
No ratings yet
Understanding Shared Memory in Parallel Computing
17 pages
Parallel Processing Techniques Overview
No ratings yet
Parallel Processing Techniques Overview
36 pages
Challenges in Parallel Memory Architectures
No ratings yet
Challenges in Parallel Memory Architectures
64 pages
Parallel and Distributed Programming Overview
No ratings yet
Parallel and Distributed Programming Overview
6 pages
Module 5
No ratings yet
Module 5
3 pages
Parallel Computer Memory Architecture
No ratings yet
Parallel Computer Memory Architecture
31 pages
Understanding Parallel Computer Architectures
No ratings yet
Understanding Parallel Computer Architectures
39 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
18 pages
Parallel and Distributed Computing Course Overview
No ratings yet
Parallel and Distributed Computing Course Overview
422 pages
Introduction to Parallel Computing Concepts
No ratings yet
Introduction to Parallel Computing Concepts
28 pages
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
No ratings yet
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
40 pages
Overview of Parallel Hardware Systems
No ratings yet
Overview of Parallel Hardware Systems
40 pages
PDC Notes by Zatch-1
No ratings yet
PDC Notes by Zatch-1
42 pages
Programming Models for Parallel Systems
No ratings yet
Programming Models for Parallel Systems
21 pages
Understanding Parallel Computing Concepts
No ratings yet
Understanding Parallel Computing Concepts
19 pages
Scalable Parallel Computing Architectures
No ratings yet
Scalable Parallel Computing Architectures
11 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
127 pages
Recent Trends in Parallel Computing
No ratings yet
Recent Trends in Parallel Computing
12 pages
Parallelism Theory in Computer Architecture
No ratings yet
Parallelism Theory in Computer Architecture
43 pages
Parallel and Distributed Computing Overview
No ratings yet
Parallel and Distributed Computing Overview
124 pages
Overview of Parallel Processing Techniques
No ratings yet
Overview of Parallel Processing Techniques
35 pages
Parallel Computing Fundamentals for Scientists
No ratings yet
Parallel Computing Fundamentals for Scientists
46 pages
Serial vs. Parallel Computing Explained
No ratings yet
Serial vs. Parallel Computing Explained
8 pages
Wa0024.
No ratings yet
Wa0024.
4 pages
Overview of Parallel and Distributed Computing
No ratings yet
Overview of Parallel and Distributed Computing
66 pages
Basics of Parallel Computing Explained
No ratings yet
Basics of Parallel Computing Explained
7 pages
Parallel and Distributed Computing Overview
No ratings yet
Parallel and Distributed Computing Overview
30 pages
Parallel & Distributed Computing
No ratings yet
Parallel & Distributed Computing
10 pages
Parallal & Distributed Computing Lecture-1 Lecture-2
No ratings yet
Parallal & Distributed Computing Lecture-1 Lecture-2
37 pages
Memory Performance in Parallel Computing
No ratings yet
Memory Performance in Parallel Computing
11 pages
Parallel Computing Architectures Explained
No ratings yet
Parallel Computing Architectures Explained
13 pages
Understanding Parallel Computing Concepts
No ratings yet
Understanding Parallel Computing Concepts
32 pages
Parallel Programming Models in Cloud Computing
No ratings yet
Parallel Programming Models in Cloud Computing
39 pages
Understanding Parallel Programming Models
No ratings yet
Understanding Parallel Programming Models
17 pages
Overview of Parallel Computing Concepts
No ratings yet
Overview of Parallel Computing Concepts
46 pages
Parallel Programming Models Overview
No ratings yet
Parallel Programming Models Overview
86 pages
Overview of Parallel Hardware Concepts
No ratings yet
Overview of Parallel Hardware Concepts
60 pages
Overview of Massively Parallel Processing
No ratings yet
Overview of Massively Parallel Processing
25 pages
Unit 1: Parallel Computing
No ratings yet
Unit 1: Parallel Computing
51 pages
Introduction to Parallel Computing Concepts
No ratings yet
Introduction to Parallel Computing Concepts
27 pages
Parallel Programming Overview and Techniques
No ratings yet
Parallel Programming Overview and Techniques
38 pages
Overview of Parallel Processing Concepts
No ratings yet
Overview of Parallel Processing Concepts
25 pages
Principles of Parallel Computing Overview
No ratings yet
Principles of Parallel Computing Overview
28 pages
Lecture 1
No ratings yet
Lecture 1
112 pages
02 Lecture Flynn IN
No ratings yet
02 Lecture Flynn IN
78 pages
Introduction to Parallel Computing Concepts
No ratings yet
Introduction to Parallel Computing Concepts
14 pages
Multicore Processor Architecture Overview
No ratings yet
Multicore Processor Architecture Overview
19 pages
Understanding Distributed Memory Architecture
No ratings yet
Understanding Distributed Memory Architecture
17 pages
Understanding Multicore and OpenMP
No ratings yet
Understanding Multicore and OpenMP
82 pages
Overview of Parallel Architectures
No ratings yet
Overview of Parallel Architectures
53 pages
Understanding Shared & Distributed Memory Systems
No ratings yet
Understanding Shared & Distributed Memory Systems
32 pages
Overview of Parallel Computing Systems
No ratings yet
Overview of Parallel Computing Systems
83 pages
Understanding Basic Demography
No ratings yet
Understanding Basic Demography
37 pages
KISS Portfolio Insights from 42 Macro
No ratings yet
KISS Portfolio Insights from 42 Macro
149 pages
Importance of Village Doctors and Climate Change
No ratings yet
Importance of Village Doctors and Climate Change
6 pages
History of Constellations Explained
No ratings yet
History of Constellations Explained
5 pages
Nuclear Graphite: Dimensional Changes & Creep
No ratings yet
Nuclear Graphite: Dimensional Changes & Creep
29 pages
Jss1 2nd Term Cca Exam
No ratings yet
Jss1 2nd Term Cca Exam
5 pages
Networking Tips for Career Success
No ratings yet
Networking Tips for Career Success
11 pages
6 Effective Study Habits for Success
No ratings yet
6 Effective Study Habits for Success
1 page
Pardon Campaign for Helen Duncan
No ratings yet
Pardon Campaign for Helen Duncan
7 pages
Ayn Rand's Night of January 16th Play
No ratings yet
Ayn Rand's Night of January 16th Play
9 pages
A2 Textiles Research Plan by Matthew Xiao
No ratings yet
A2 Textiles Research Plan by Matthew Xiao
5 pages
Understanding Pulmonary Emphysema
No ratings yet
Understanding Pulmonary Emphysema
20 pages
Daily Routine Vocabulary Guide
No ratings yet
Daily Routine Vocabulary Guide
2 pages
Senior Software Engineer Profile
No ratings yet
Senior Software Engineer Profile
3 pages
Hire Purchase Accounting Problems
100% (5)
Hire Purchase Accounting Problems
37 pages
Inventory Management Insights and Costs
100% (1)
Inventory Management Insights and Costs
25 pages
Impact of BIM on Construction Management
No ratings yet
Impact of BIM on Construction Management
4 pages
Non-Publication Order in Assault Case
No ratings yet
Non-Publication Order in Assault Case
11 pages
Spectrum TRD1 Tests U8 1-Opt PDF
No ratings yet
Spectrum TRD1 Tests U8 1-Opt PDF
3 pages
Screenshot 2023-10-05 at 8.19.34 AM
No ratings yet
Screenshot 2023-10-05 at 8.19.34 AM
145 pages
Class 5 Make-Up Assignments Guide
No ratings yet
Class 5 Make-Up Assignments Guide
15 pages
Overview of Insect Orders and Characteristics
No ratings yet
Overview of Insect Orders and Characteristics
36 pages
Digital Transformation Insights Podcast
No ratings yet
Digital Transformation Insights Podcast
4 pages
Wadhwa Wise City: Panvel's Premier Township
No ratings yet
Wadhwa Wise City: Panvel's Premier Township
8 pages
Consignment Handling in SD
No ratings yet
Consignment Handling in SD
25 pages
Social Media's Impact on Junior High Academics
No ratings yet
Social Media's Impact on Junior High Academics
18 pages
UC Davis Policy Writing Guide
No ratings yet
UC Davis Policy Writing Guide
8 pages
Pakistan Textile Industry Plan 2020
No ratings yet
Pakistan Textile Industry Plan 2020
20 pages
Common Plastic Production Quality Issues
No ratings yet
Common Plastic Production Quality Issues
67 pages
Shahjalal Islami Bank Branches List
No ratings yet
Shahjalal Islami Bank Branches List
8 pages

Parallel Processor Taxonomy and Programming

Uploaded by

Parallel Processor Taxonomy and Programming

Uploaded by

Parallel Processors

1. Taxonomy and Topology

1. Shared Memory Multiprocessors

2. Distributed Memory Networks

Static Interconnection Network

Embeddings and Simulations

Why Embedding is Needed

2. Simulation in Parallel Processing

Why Simulation is Needed

1. Shared Memory Programming

Simple to program (shared variables).

Limited scalability (can’t grow beyond one machine easily).

2. Distributed Memory Programming

Object-Oriented Programming (OOP)

SYNTAX ->class Car {

1. Data Parallel Programming

3. Data Flow Programming

1. Scheduling in Parallel Programs

2. Loop Scheduling in Parallel Programs

Parallelization of Sequential Programs

How Parallel Programming Environment Supports Programs

You might also like