0% found this document useful (0 votes)

129 views3 pages

Overview of Shared Memory Systems

Shared memory systems connect processors to a global shared memory. Communication between processors occurs through reading and writing to shared memory. Performance can be impacted by contention when multiple processors access memory simultaneously. Cache coherency issues can also arise when copies of data in caches become inconsistent. Uniform memory access (UMA) systems provide equal access times to all memory for all processors. Non-uniform memory access (NUMA) systems attach local memory to each processor, resulting in non-uniform access times depending on data location.

Uploaded by

Pranav Kasliwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

129 views3 pages

Overview of Shared Memory Systems

Uploaded by

Pranav Kasliwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Shared Memory Architecture

Figure 1: Shared memory systems.

 Shared memory systems form a major category of multiprocessors. In this category,

all processors share a global memory (See Fig. 1).
 Communication between tasks running on different processors is performed through
writing to and reading from the global memory.
 All interprocessor coordination and synchronization is also accomplished via the
global memory.
 Two main problems need to be addressed when designing a shared memory system:
1. performance degradation due to contention. Performance degradation might
happen when multiple processors are trying to access the shared memory
simultaneously. A typical design might use caches to solve the contention
problem.
2. coherence problems. Having multiple copies of data, spread throughout the
caches, might lead to a coherence problem. The copies in the caches are
coherent if they are all equal to the same value. However, if one of the
processors writes over the value of one of the copies, then the copy becomes
inconsistent because it no longer equals the value of the other copies.
 Scalability remains the main drawback of a shared memory system.

Classification of Shared Memory Systems

Figure 2: Shared memory via two ports.

 The simplest shared memory system consists of one memory module (M) that can be
accessed from two processors P1 and P2 (see Fig. 2).
o Requests arrive at the memory module through its two ports. An arbitration unit
within the memory module passes requests through to a memory controller.
o If the memory module is not busy and a single request arrives, then the arbitration
unit passes that request to the memory controller and the request is satisfied.
o The module is placed in the busy state while a request is being serviced. If a new
request arrives while the memory is busy servicing a previous request, the memory
module sends a wait signal, through the memory controller, to the processor making
the new request.
o In response, the requesting processor may hold its request on the line until the
memory becomes free or it may repeat its request some time later.
o If the arbitration unit receives two requests, it selects one of them and passes it to
the memory controller. Again, the denied request can be either held to be served
next or it may be repeated some time later.

Uniform Memory Access (UMA)

Figure 3: Bus-based UMA (SMP) shared memory system.

 In the UMA system a shared memory is accessible by all processors through an

interconnection network in the same way a single processor accesses its memory.
 All processors have equal access time to any memory location. The interconnection network
used in the UMA can be a single bus, multiple buses, or a crossbar switch.
 Because access to shared memory is balanced, these systems are also called SMP (symmetric
multiprocessor) systems. Each processor has equal opportunity to read/write to memory,
including equal access speed.
o A typical bus-structured SMP computer, as shown in Fig. 3, attempts to reduce
contention for the bus by fetching instructions and data directly from each individual
cache, as much as possible.
o In the extreme, the bus contention might be reduced to zero after the cache
memories are loaded from the global memory, because it is possible for all
instructions and data to be completely contained within the cache.
 This memory organization is the most popular among shared memory systems.
 Examples of this architecture are Sun Starfire servers, HP V series, and Compaq AlphaServer
GS, Silicon Graphics Inc. multiprocessor servers.

Nonuniform Memory Access (NUMA)

Figure 4: NUMA shared memory system.

 In the NUMA system, each processor has part of the shared memory attached (see Fig. 4).
 The memory has a single address space. Therefore, any processor could access any memory
location directly using its real address. However, the access time to modules depends on the
distance to the processor. This results in a nonuniform memory access time.
 A number of architectures are used to interconnect processors to memory modules in a
NUMA. Among these are the tree and the hierarchical bus networks.
 Examples of NUMA architecture are BBN TC-2000, SGI Origin 3000, and Cray T3E.

Common questions

In a bus-based Uniform Memory Access (UMA) system, reducing contention is achieved by fetching instructions and data directly from each processor's cache as much as possible. By doing so, after loading cache memories from the global memory, the need to access the common bus decreases significantly, potentially reducing bus contention to zero. The primary advantage of this system is providing equal access time for all processors to any memory location, making it a balanced and symmetric (SMP) shared memory system .

NUMA (Nonuniform Memory Access) systems are distinct from UMA (Uniform Memory Access) systems primarily in the way memory is accessed. In NUMA architectures, each processor is attached to its own local memory, leading to varying memory access times depending on the proximity of a processor to the memory module. This architectural feature results in non-uniform access times but allows for greater scalability by dividing memory into processor-local segments. In contrast, UMA systems feature uniform access times since memory is a centralized component shared equally among all processors. This fundamental difference impacts how efficiently a system can scale and the strategies used to minimize access latency, especially in large-scale systems .

Uniform Memory Access (UMA) systems ensure that all processors have equal access time to any memory location, which leads to a balanced performance suitable for applications where uniform speed is critical. In contrast, Nonuniform Memory Access (NUMA) systems allow processor-specific memory, resulting in variable access times depending on the processor's distance to the memory module. While UMA provides consistent access times, NUMA systems can potentially offer better scalability and performance for large systems by reducing the reliance on centralized memory structures, albeit at the requirement of careful memory locality management to maintain performance .

NUMA architecture would outperform UMA in scenarios where applications can benefit from exploiting data locality and require scalable solutions. In applications where data can be partitioned to correspond to processor-local memory, such as large databases or high-performance computing tasks, a NUMA system can reduce memory access latency by localizing data access to processors' nearby memory modules. This reduces the bottleneck effect seen in UMA systems, making NUMA more suitable for environments requiring extensive parallel processing and high scalability, despite the complexity brought by variable memory access times and the need for sophisticated memory management .

In a shared memory system with two processors accessing a common memory module, the arbitration unit plays a critical role in managing simultaneous requests. When only one request arrives, the arbitration unit passes it to the memory controller immediately. However, if two requests arrive at the same time, the arbitration unit selects one to pass on while the other waits. This helps in orderly processing of requests, ensuring one processor's request does not indefinitely block the other's, thus managing access to the shared resource effectively .

In UMA systems, the type of interconnection network, such as a single bus, multiple buses, or a crossbar switch, significantly impacts the system's ability to balance memory access. A single bus may become a bottleneck under high demand, limiting the number of processors that can efficiently access memory concurrently. By contrast, multiple buses or a crossbar switch can improve concurrency by providing additional paths for data transfer, thus reducing contention. These configurations can more effectively manage increased traffic and provide balanced access times for multiple processors, enhancing overall system throughput .

Examples of systems using UMA architecture include Sun Starfire servers, HP V series, and Compaq AlphaServer GS systems. This architecture is favored in applications requiring balanced and equal memory access times for all processors, such as symmetric multiprocessing environments. It provides uniform memory access, which simplifies the development of parallel applications by ensuring predictable access speeds and uniform resource distribution .

While caches are often used in shared memory systems to mitigate contention by reducing direct memory access, they introduce potential coherence issues as a trade-off. Multiple cache copies of data can lead to inconsistency if one copy is updated while others are not. To resolve this, mechanisms such as cache coherence protocols are essential to maintain data consistency across different processor caches. These mechanisms, while solving the inconsistencies, can also introduce additional overhead and complexity into system design and operation, influencing overall system performance and efficiency .

The two main challenges in designing a shared memory system are performance degradation due to contention and coherence problems. Contention occurs when multiple processors attempt to access the shared memory simultaneously, leading to performance issues. This can be mitigated by implementing cache memory systems to handle simultaneous accesses more efficiently. Coherence problems arise when different processors have copies of the same data in their caches, leading to inconsistencies if one processor updates its copy. Ensuring coherence requires mechanisms like cache coherence protocols to maintain consistency across caches .

The scalability issue in shared memory systems arises due to the challenges in efficiently managing and coordinating access to shared resources as the number of processors increases. This challenge becomes a significant drawback because increased contention and coherence traffic can lead to bottlenecks, severely impacting system performance. The bus or interconnection network used to link processors to shared memory can become a limiting factor if it cannot accommodate the demand, which limits the system’s ability to scale effectively as more processors are added .

Inbound 7682828671927968610
No ratings yet
Inbound 7682828671927968610
20 pages
Understanding Parallel Computing
No ratings yet
Understanding Parallel Computing
9 pages
Classification of Distributed Systems
No ratings yet
Classification of Distributed Systems
16 pages
Shared vs. Message Passing Systems
No ratings yet
Shared vs. Message Passing Systems
14 pages
Shared Memory System Design Overview
No ratings yet
Shared Memory System Design Overview
31 pages
Shared vs. Distributed Memory in Computing
No ratings yet
Shared vs. Distributed Memory in Computing
22 pages
Memory Buffering Techniques in Switches
No ratings yet
Memory Buffering Techniques in Switches
25 pages
Chapter 2 - Memory Architecture For Multiprocessing - Revised
No ratings yet
Chapter 2 - Memory Architecture For Multiprocessing - Revised
64 pages
Thread-Level Parallelism in Multiprocessors
No ratings yet
Thread-Level Parallelism in Multiprocessors
74 pages
Multiprocessor Architecture Overview
No ratings yet
Multiprocessor Architecture Overview
39 pages
Memory Architectures in Parallel Computing
No ratings yet
Memory Architectures in Parallel Computing
14 pages
Lec4 - SMP NUMA Cache Coherence
No ratings yet
Lec4 - SMP NUMA Cache Coherence
45 pages
SIMD and MIMD Architectures Explained
No ratings yet
SIMD and MIMD Architectures Explained
16 pages
Parallel Computer Architecture Overview
No ratings yet
Parallel Computer Architecture Overview
18 pages
Overview of Shared Memory Architecture
No ratings yet
Overview of Shared Memory Architecture
17 pages
Overview of Multiprocessor Systems
No ratings yet
Overview of Multiprocessor Systems
17 pages
Shared Memory System Design Overview
No ratings yet
Shared Memory System Design Overview
31 pages
SMP and Cache Coherence Overview
No ratings yet
SMP and Cache Coherence Overview
34 pages
SIMD vs MIMD: Memory Impact & Costs
No ratings yet
SIMD vs MIMD: Memory Impact & Costs
70 pages
Parallel Computing in Search Engines
No ratings yet
Parallel Computing in Search Engines
12 pages
Overview of Distributed Shared Memory
No ratings yet
Overview of Distributed Shared Memory
36 pages
Parallel 1
No ratings yet
Parallel 1
15 pages
Parallel Computer Memory Architecture
No ratings yet
Parallel Computer Memory Architecture
31 pages
VTU BCS702 Parallel Computing Notes
No ratings yet
VTU BCS702 Parallel Computing Notes
25 pages
Shared vs. Distributed Memory in Computing
No ratings yet
Shared vs. Distributed Memory in Computing
6 pages
Lecture 27 Parallel Processing UMA NUMA
No ratings yet
Lecture 27 Parallel Processing UMA NUMA
13 pages
Shared Memory Systems in Parallel Processing
No ratings yet
Shared Memory Systems in Parallel Processing
24 pages
5............ CH3 Part 1
No ratings yet
5............ CH3 Part 1
24 pages
Parallel Processing Architectures Overview
No ratings yet
Parallel Processing Architectures Overview
27 pages
Lec 3
No ratings yet
Lec 3
5 pages
Overview of Embedded System Architecture
No ratings yet
Overview of Embedded System Architecture
10 pages
Multiprocessing in Computer Architecture
No ratings yet
Multiprocessing in Computer Architecture
8 pages
Wa0003
No ratings yet
Wa0003
12 pages
Introduction to Parallel Programming
No ratings yet
Introduction to Parallel Programming
37 pages
PC Ia2
No ratings yet
PC Ia2
5 pages
Parallel Processing Architectures Explained
No ratings yet
Parallel Processing Architectures Explained
7 pages
Multicore Processor Architecture Overview
No ratings yet
Multicore Processor Architecture Overview
19 pages
Multiprocessor and Multicomputer Systems
No ratings yet
Multiprocessor and Multicomputer Systems
11 pages
Hybrid Memory Architectures in Computing
No ratings yet
Hybrid Memory Architectures in Computing
18 pages
Memory Access Architectures Explained
No ratings yet
Memory Access Architectures Explained
4 pages
Overview of Shared Memory Multiprocessors
No ratings yet
Overview of Shared Memory Multiprocessors
99 pages
Multilevel Processors & Threading Concepts
No ratings yet
Multilevel Processors & Threading Concepts
11 pages
Shared-Memory Multiprocessors Overview
No ratings yet
Shared-Memory Multiprocessors Overview
8 pages
Characteristics of Multiprocessor Systems
No ratings yet
Characteristics of Multiprocessor Systems
14 pages
V3i9201434 PDF
No ratings yet
V3i9201434 PDF
6 pages
Shared vs Distributed Memory Architectures
79% (19)
Shared vs Distributed Memory Architectures
29 pages
Shared Memory Organization in Computing
No ratings yet
Shared Memory Organization in Computing
19 pages
Unit 1
No ratings yet
Unit 1
42 pages
Distributed Shared Memory Concepts
No ratings yet
Distributed Shared Memory Concepts
39 pages
Cache Coherence in Multiprocessor Systems
No ratings yet
Cache Coherence in Multiprocessor Systems
10 pages
SMP vs. DSM: Architecture Overview
No ratings yet
SMP vs. DSM: Architecture Overview
4 pages
PDC Project Report
No ratings yet
PDC Project Report
5 pages
Overview of Parallel Processor Types
No ratings yet
Overview of Parallel Processor Types
24 pages
Overview of Multiprocessor Systems
No ratings yet
Overview of Multiprocessor Systems
36 pages
Cache Coherence in Multiprocessor Systems
No ratings yet
Cache Coherence in Multiprocessor Systems
10 pages
Overview of Shared Memory Systems
No ratings yet
Overview of Shared Memory Systems
4 pages
Parallel and Scalable Architectures Overview
No ratings yet
Parallel and Scalable Architectures Overview
9 pages
DS Notes UNIT IV
No ratings yet
DS Notes UNIT IV
22 pages
PD Lecture 2
No ratings yet
PD Lecture 2
32 pages
Document Scanning Overview
No ratings yet
Document Scanning Overview
10 pages
Logic Structures in Knowledge Representation
No ratings yet
Logic Structures in Knowledge Representation
161 pages
Microwave Semiconductor Devices Overview
No ratings yet
Microwave Semiconductor Devices Overview
74 pages
Relationship Categories and Contacts
No ratings yet
Relationship Categories and Contacts
1 page
Document Scanned with CamScanner
No ratings yet
Document Scanned with CamScanner
11 pages
Family and Friends of Vivek Pasari
No ratings yet
Family and Friends of Vivek Pasari
1 page
Document Scanned with CamScanner
No ratings yet
Document Scanned with CamScanner
7 pages
Family and Friends Contact List
No ratings yet
Family and Friends Contact List
1 page
Document Scanned with CamScanner
No ratings yet
Document Scanned with CamScanner
26 pages
Family and Neighbors List
No ratings yet
Family and Neighbors List
1 page
Document Scanned with CamScanner
No ratings yet
Document Scanned with CamScanner
16 pages
Document Scanned with CamScanner
No ratings yet
Document Scanned with CamScanner
12 pages
Fixed vs Dynamic Partitioning in OS
No ratings yet
Fixed vs Dynamic Partitioning in OS
28 pages
DM70 Reference Manual
No ratings yet
DM70 Reference Manual
37 pages
Memory Organization in Computer Systems
No ratings yet
Memory Organization in Computer Systems
24 pages
EP3000 2D Scan Module User Guide
No ratings yet
EP3000 2D Scan Module User Guide
95 pages
Pico-8 Emulator Setup on Batocera
No ratings yet
Pico-8 Emulator Setup on Batocera
2 pages
Understanding Universal Serial Bus (USB)
No ratings yet
Understanding Universal Serial Bus (USB)
1 page
UEFI Setup and AMITSE Configuration
No ratings yet
UEFI Setup and AMITSE Configuration
1,003 pages
Fuzzy Control Development System
No ratings yet
Fuzzy Control Development System
6 pages
Fiber Optic Patch Panel Overview
No ratings yet
Fiber Optic Patch Panel Overview
2 pages
COA Syllabus for B.Tech 2024-25
No ratings yet
COA Syllabus for B.Tech 2024-25
6 pages
Zybo Z7 Pcam 5C Demo Overview
No ratings yet
Zybo Z7 Pcam 5C Demo Overview
11 pages
Weintek MT8072iP HMI Specifications
100% (1)
Weintek MT8072iP HMI Specifications
2 pages
HPE J2000 Flash Enclosure Overview
No ratings yet
HPE J2000 Flash Enclosure Overview
44 pages
GE PLC Practical Manual Guide
No ratings yet
GE PLC Practical Manual Guide
32 pages
FA11929 WindowsXP - OS EN
No ratings yet
FA11929 WindowsXP - OS EN
4 pages
Panasonic ToughBook Specs and Issues
No ratings yet
Panasonic ToughBook Specs and Issues
2 pages
Communication Skills Test Overview
No ratings yet
Communication Skills Test Overview
4 pages
Tax Invoice for MAX IT WORLD NX
No ratings yet
Tax Invoice for MAX IT WORLD NX
1 page
Electrical Supply Inventory List
No ratings yet
Electrical Supply Inventory List
1 page
Smart Console
100% (2)
Smart Console
50 pages
Key Components of Virtual Reality Systems
No ratings yet
Key Components of Virtual Reality Systems
3 pages
Lenovo IdeaPad Y580 I5-3210m
No ratings yet
Lenovo IdeaPad Y580 I5-3210m
4 pages
M15E M.2 5gbe: The Future of High-Speed Networking
No ratings yet
M15E M.2 5gbe: The Future of High-Speed Networking
1 page
Employee List with NIP Numbers
No ratings yet
Employee List with NIP Numbers
2 pages
eSIM Activation Codes and Details
No ratings yet
eSIM Activation Codes and Details
16 pages
(E0x Series Controller) - (Troubleshooting Manual) - (E)
No ratings yet
(E0x Series Controller) - (Troubleshooting Manual) - (E)
456 pages
i.MX Secure Boot with HABv4 Guide
No ratings yet
i.MX Secure Boot with HABv4 Guide
20 pages
DRYPIX 7000 Service Manual Guide
100% (1)
DRYPIX 7000 Service Manual Guide
150 pages
8051 Microcontroller Notes for BEC405A
No ratings yet
8051 Microcontroller Notes for BEC405A
105 pages
COA Unit 1 Important Questions
No ratings yet
COA Unit 1 Important Questions
49 pages
Cto Underground Huawei
No ratings yet
Cto Underground Huawei
8 pages

Overview of Shared Memory Systems

Uploaded by

Overview of Shared Memory Systems

Uploaded by

Shared Memory Architecture

Figure 1: Shared memory systems.

 Shared memory systems form a major category of multiprocessors. In this category,

Classification of Shared Memory Systems

Figure 2: Shared memory via two ports.

Uniform Memory Access (UMA)

Figure 3: Bus-based UMA (SMP) shared memory system.

 In the UMA system a shared memory is accessible by all processors through an

Nonuniform Memory Access (NUMA)

Figure 4: NUMA shared memory system.

Common questions

How does the architecture of a bus-based Uniform Memory Access (UMA) system mitigate contention issues, and what is its primary advantage over other shared memory systems?

Identify the architectural features that distinguish NUMA systems from UMA systems, and describe how these impact memory access patterns.

Compare the Uniform Memory Access (UMA) and Nonuniform Memory Access (NUMA) systems in terms of memory access times and architectural influence on performance.

Evaluate the potential application scenarios where NUMA architecture would outperform UMA, considering the scalability and access patterns involved.

Discuss the role of arbitration units in a shared memory system with two processors, and how they handle simultaneous requests.

How does the use of a single bus, multiple buses, or a crossbar switch in UMA systems affect the system's ability to balance memory access?

What are some examples of systems using UMA architecture, and why is it favored in certain applications?

What are the potential trade-offs when using caches in shared memory systems to solve contention issues, and how do they influence coherence?

What are the main challenges associated with designing a shared memory system, and how can these challenges be addressed?

Explain how the scalability issue acts as a main drawback of shared memory systems and its potential impacts on system performance.

You might also like