0% found this document useful (0 votes)

9 views16 pages

Introduction to Distributed File Systems

A Distributed File System (DFS) allows file sharing across multiple machines while maintaining transparency for users. It features location transparency and independence, enabling clients to access files without needing to know their physical storage locations. Key components include caching for performance, file replication for availability, and protocols like NFS for remote file access.

Uploaded by

Jonathan Yitskhaq Rundjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views16 pages

Introduction to Distributed File Systems

Uploaded by

Jonathan Yitskhaq Rundjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

DISTRIBUTED FILE SYSTEMS

an introduction to general DFS

DEFINITIONS:

• A traditional Distributed File System ( DFS ) is simply a classical model of a file

system distributed across multiple machines. The purpose is to promote sharing of
dispersed files with good transparency to users.

• The resources on a particular machine are local to itself. Resources on other

machines are remote.

• A file system provides a service for clients. The server interface is the normal set of file
operations: create, read, etc. on files.

1
DISTRIBUTED FILE Definitions
SYSTEMS

Clients, servers, and storage are dispersed across machines. Configuration and
implementation may vary -

a) Servers may run on dedicated machines, OR

b) Servers and clients can be on the same machines.
c) The OS itself can be distributed (with the file system a part of that distribution.
d) A distribution layer can be interposed between a conventional OS and the file
system.

Clients should view a DFS the same way they would a centralized FS; the distribution is
hidden at a lower level.

Performance is concerned with throughput and response time.

2
DISTRIBUTED FILE
SYSTEMS Naming and Transparency
Naming is the mapping between logical and physical objects.

• In a conventional file system, it's understood where the file actually resides; the
system and disk are known.

• In a transparent DFS, the location of a file, somewhere in the network, is hidden.

Location transparency -

The name of a file does not reveal any hint of the file's physical storage location
(machine, disk, or disk blocks).

3
DISTRIBUTED FILE
SYSTEMS Naming and Transparency
Location independence -

• The name of a file doesn't need to be changed when the file's physical storage
location changes. Dynamic, one-to-many mapping.

• Separates the naming hierarchy from the storage devices hierarchy.

Most DFSs today:

• Support location transparent systems.

4
DISTRIBUTED FILE
SYSTEMS Naming and Transparency
NAMING SCHEMES:

1. Remote directories are mounted to local directories.

• So a local system seems to have a coherent directory structure.

• The remote directories must be explicitly mounted. The files are location
independent.

• SUN NFS is a good example of this technique.

2. A single global name structure spans all the files in the system.

• The DFS is built the same way as a local filesystem. Location independent.
• GFS and HDFS work in this way

5
DISTRIBUTED FILE
SYSTEMS Naming and Transparency

IMPLEMENTATION TECHNIQUES:

• Can Map directories or larger aggregates rather than individual files.

• A non-transparent mapping technique:

name ----> < system, disk, cylinder, sector >

• A transparent mapping technique:

name ----> file_identifier ----> < system, disk, cylinder, sector >

• So when changing the physical location of a file, only the file identifier need be
modified. This identifier must be "unique" in the universe.

6
DISTRIBUTED FILE
SYSTEMS Remote File Access
CACHING

• Reduce network traffic by retaining recently accessed disk blocks in a cache, so that
repeated accesses to the same information can be handled locally.
• If required data is not already cached, a copy of data is brought from the server to the
user.
• Perform accesses on the cached copy.
• Files are identified with one master copy residing at the server machine,
• Copies of (parts of) the file are scattered in different caches.

Cache Consistency Problem -- Keeping the cached copies consistent with the master file.

7
DISTRIBUTED FILE
SYSTEMS Remote File Access

CACHE UPDATE POLICY:

• A write through cache has good reliability. But the user must wait for writes to get to the
server. Used by NFS.

• Delayed write - write requests complete more rapidly. Data may be written over the
previous cache write, saving a remote write. Poor reliability on a crash.

8
DISTRIBUTED FILE
SYSTEMS Remote File Access

FILE REPLICATION:

• Duplicating files on multiple machines improves availability and performance.

• Placed on failure-independent machines ( they won't fail together ).

Replication management should be "location-opaque".

• The main problem is consistency - when one copy changes, how do other copies reflect
that change? Often there is a tradeoff: consistency versus availability and performance.

• Atomic and serialized invalidation isn't guaranteed ( message could get lost / machine
could crash. )

9
Example: SUN Network File System
OVERVIEW:

• Runs on SUNOS - NFS is both an implementation and a specification of how to access

remote files. It's both a definition and a specific instance.
• The goal: to share a file system in a transparent way.
• Uses client-server model ( for NFS, a node can be both simultaneously.) Can act
between any two nodes ( no dedicated server. ) Mount makes a server file-system visible
from a client.

mount server:/usr/shared client:/usr/local

• Then, transparently, a request for /usr/local/dir-server accesses a file that is on the

server.
• The mount is controlled by: (1) access rights, (2) server specification of what's
mountable.
• Can use heterogeneous machines - different hardware, operating systems, network
protocols.
• Uses RPC for isolation - thus all implementations must have the same RPC calls. These
RPC's implement the mount protocol and the NFS protocol.

17: Distributed File Systems 10

DISTRIBUTED FILE
SYSTEMS SUN Network File System

THE MOUNT PROTOCOL:

The following operations occur:

1. The client's request is sent via RPC to the mount server ( on server machine.)

2. Mount server checks export list containing

a) file systems that can be exported,

b) legal requesting clients.
c) It's legitimate to mount any directory within the legal filesystem.

3. Server returns "file handle" to client.

4. Server maintains list of clients and mounted directories -- this is state information!
But this data is only a "hint" and isn't treated as essential.

5. Mounting often occurs automatically when client or server boots.

17: Distributed File Systems 11

DISTRIBUTED FILE
SYSTEMS SUN Network File System
THE NFS PROTOCOL:

RPC’s support these remote file operations:

a) Search for file within directory.

b) Read a set of directory entries.
c) Manipulate links and directories.
d) Read/write file attributes.
e) Read/write file data.

Note:
• Open and close are conspicuously absent from this list. NFS servers are stateless.
Each request must provide all information. With a server crash, no information is lost.

• Modified data must actually get to server disk before client is informed the action is
complete. Using a cache would imply state information.

• A single NFS write is atomic. A client write request may be broken into several atomic
RPC calls, so the whole thing is NOT atomic. Since lock management is stateful, NFS
doesn't do it. A higher level must provide this service.
17: Distributed File Systems 12
DISTRIBUTED FILE
SYSTEMS SUN Network File System
NFS ARCHITECTURE:

Follow local and remote access through this figure:

17: Distributed File Systems 13

DISTRIBUTED FILE
SYSTEMS SUN Network File System
NFS ARCHITECTURE:

1. UNIX filesystem layer - does normal open / read / etc. commands.

2. Virtual file system ( VFS ) layer -

a) Gives clean layer between user and filesystem.

b) Acts as deflection point by using global vnodes.

c) Understands the difference between local and remote names.

d) Keeps in memory information about what should be deflected (mounted

directories) and how to get to these remote directories.

3. System call interface layer -

a) Presents sanitized validated requests in a uniform way to the VFS.

17: Distributed File Systems 14

DISTRIBUTED FILE
SYSTEMS SUN Network File System
PATH-NAME TRANSLATION:

• Break the complete pathname into components.

• For each component, do an NFS lookup using the

component name + directory vnode.

• After a mount point is reached, each component piece will cause a server access.

• Can't hand the whole operation to server since the client may have a second mount on a
subsidiary directory (a mount on a mount ).

• A directory name cache on the client speeds up lookups.

17: Distributed File Systems 15

DISTRIBUTED FILE
SYSTEMS SUN Network File System
CACHES OF REMOTE DATA:

• The client keeps:

File block cache - ( the contents of a file )
File attribute cache - ( file header info (inode in UNIX) ).

• The local kernel hangs on to the data after getting it the first time.

• On an open, local kernel, it checks with server that cached data is still OK.

• Cached attributes are thrown away after a few seconds.

• Data blocks use read ahead and delayed write.

• Mechanism has:
Server consistency problems.
Good performance.

17: Distributed File Systems 16

Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
46 pages
DC Lecture 33
No ratings yet
DC Lecture 33
18 pages
Unit 4
No ratings yet
Unit 4
22 pages
Distributed File System Overview
No ratings yet
Distributed File System Overview
33 pages
Overview of Distributed File Systems
100% (1)
Overview of Distributed File Systems
17 pages
Key Requirements of Distributed File Systems
No ratings yet
Key Requirements of Distributed File Systems
4 pages
Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
54 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
23 pages
Overview of Distributed File System (DFS)
No ratings yet
Overview of Distributed File System (DFS)
12 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
5 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
50 pages
Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
30 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
27 pages
الحوسبة السحابية 7
No ratings yet
الحوسبة السحابية 7
45 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
35 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
43 pages
Big Data and Distributed File Systems
No ratings yet
Big Data and Distributed File Systems
24 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
27 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
58 pages
Distributed System DS Unit5
No ratings yet
Distributed System DS Unit5
61 pages
Module #6 Distributed File System
No ratings yet
Module #6 Distributed File System
54 pages
Distributed File System Design Overview
100% (1)
Distributed File System Design Overview
30 pages
NFS Architecture and Protocol Overview
No ratings yet
NFS Architecture and Protocol Overview
43 pages
07 dfs1
No ratings yet
07 dfs1
73 pages
CC Lec 23
No ratings yet
CC Lec 23
49 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
9 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
11 pages
Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
25 pages
Overview of Distributed File System (DFS)
No ratings yet
Overview of Distributed File System (DFS)
37 pages
07 - Distributed File System
No ratings yet
07 - Distributed File System
28 pages
5.distributed File System
No ratings yet
5.distributed File System
86 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
31 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
4 pages
NFS vs AFS: Key Differences Explained
No ratings yet
NFS vs AFS: Key Differences Explained
37 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
43 pages
Distributed File System Design Insights
No ratings yet
Distributed File System Design Insights
31 pages
Understanding Distributed File System (DFS)
No ratings yet
Understanding Distributed File System (DFS)
5 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
39 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
51 pages
Overview of Sun Network File System (NFS)
No ratings yet
Overview of Sun Network File System (NFS)
17 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
45 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
35 pages
Distributed and Federated Storage Overview
No ratings yet
Distributed and Federated Storage Overview
53 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
27 pages
Cloud Storage Systems Overview
No ratings yet
Cloud Storage Systems Overview
40 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
24 pages
Sun NFS in Distributed Systems
No ratings yet
Sun NFS in Distributed Systems
1 page
Understanding Network File System (NFS)
No ratings yet
Understanding Network File System (NFS)
31 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
32 pages
NFS and DFS in Distributed File Systems
No ratings yet
NFS and DFS in Distributed File Systems
45 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
12 pages
Understanding Distributed File Systems
No ratings yet
Understanding Distributed File Systems
46 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
49 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
3 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
109 pages
DFS Design and Implementation Overview
No ratings yet
DFS Design and Implementation Overview
40 pages
SKCET Distributed File System Overview
No ratings yet
SKCET Distributed File System Overview
66 pages
DFS File System Design Overview
No ratings yet
DFS File System Design Overview
40 pages
Introduction to Big Data Concepts
No ratings yet
Introduction to Big Data Concepts
297 pages
Dance Booking System Technical Report
No ratings yet
Dance Booking System Technical Report
9 pages
Computer Graphics Techniques Overview
No ratings yet
Computer Graphics Techniques Overview
36 pages
OCX Control for Fingerprint Processing
No ratings yet
OCX Control for Fingerprint Processing
16 pages
Bishop Fox IoT Security Review Methodology
No ratings yet
Bishop Fox IoT Security Review Methodology
8 pages
Snort Installation Guide for Ubuntu 14.04
No ratings yet
Snort Installation Guide for Ubuntu 14.04
8 pages
Solar-Powered IoT Waste Monitoring System
No ratings yet
Solar-Powered IoT Waste Monitoring System
34 pages
Suprema - CCURE - Integration - BAS - IG - UM - v1.5 - EN 1
No ratings yet
Suprema - CCURE - Integration - BAS - IG - UM - v1.5 - EN 1
98 pages
Philippine PFM Program Overview
No ratings yet
Philippine PFM Program Overview
37 pages
Understanding Hypervisor Types and Examples
No ratings yet
Understanding Hypervisor Types and Examples
23 pages
AI Automation Engineer Position Available
No ratings yet
AI Automation Engineer Position Available
3 pages
Online Banking System Project Management
No ratings yet
Online Banking System Project Management
13 pages
Multimodal Communication in Digital Media
No ratings yet
Multimodal Communication in Digital Media
11 pages
Computer Literacy Exam Paper
No ratings yet
Computer Literacy Exam Paper
3 pages
Understanding MapReduce Job Execution
No ratings yet
Understanding MapReduce Job Execution
22 pages
Tunable Metasurface Antennas for 5G
No ratings yet
Tunable Metasurface Antennas for 5G
57 pages
Smart Sensors: Overview and Applications
No ratings yet
Smart Sensors: Overview and Applications
15 pages
Hotel Management System in Python
No ratings yet
Hotel Management System in Python
7 pages
Emotion-Based Music Recommendation System
No ratings yet
Emotion-Based Music Recommendation System
5 pages
HPE OneView for VMware vCenter 5.5 Update
No ratings yet
HPE OneView for VMware vCenter 5.5 Update
47 pages
Salesforce Automation for Vehicle Orders
No ratings yet
Salesforce Automation for Vehicle Orders
73 pages
Understanding Agile Methodologies
89% (9)
Understanding Agile Methodologies
66 pages
Bank Management System Project Overview
No ratings yet
Bank Management System Project Overview
14 pages
SMM Project Ideas for Skill Enhancement
No ratings yet
SMM Project Ideas for Skill Enhancement
11 pages
Roombox Open Source Software Access
No ratings yet
Roombox Open Source Software Access
1 page
Creating a Ball Bounce App in MIT
No ratings yet
Creating a Ball Bounce App in MIT
17 pages
Input and Output Devices Overview
No ratings yet
Input and Output Devices Overview
17 pages
FLMHD
No ratings yet
FLMHD
47 pages
Control Statements in C Programming
No ratings yet
Control Statements in C Programming
12 pages
B Viewer EN PL 0
No ratings yet
B Viewer EN PL 0
128 pages

Introduction to Distributed File Systems

Uploaded by

Introduction to Distributed File Systems

Uploaded by

DISTRIBUTED FILE SYSTEMS

an introduction to general DFS

• A traditional Distributed File System ( DFS ) is simply a classical model of a file

• The resources on a particular machine are local to itself. Resources on other

a) Servers may run on dedicated machines, OR

Performance is concerned with throughput and response time.

• In a transparent DFS, the location of a file, somewhere in the network, is hidden.

• Separates the naming hierarchy from the storage devices hierarchy.

Most DFSs today:

• Support location transparent systems.

1. Remote directories are mounted to local directories.

• So a local system seems to have a coherent directory structure.

• SUN NFS is a good example of this technique.

• Can Map directories or larger aggregates rather than individual files.

• A non-transparent mapping technique:

name ----> < system, disk, cylinder, sector >

• A transparent mapping technique:

CACHE UPDATE POLICY:

• Duplicating files on multiple machines improves availability and performance.

• Placed on failure-independent machines ( they won't fail together ).

Replication management should be "location-opaque".

• Runs on SUNOS - NFS is both an implementation and a specification of how to access

mount server:/usr/shared client:/usr/local

• Then, transparently, a request for /usr/local/dir-server accesses a file that is on the

17: Distributed File Systems 10

THE MOUNT PROTOCOL:

The following operations occur:

2. Mount server checks export list containing

a) file systems that can be exported,

3. Server returns "file handle" to client.

5. Mounting often occurs automatically when client or server boots.

17: Distributed File Systems 11

RPC’s support these remote file operations:

a) Search for file within directory.

Follow local and remote access through this figure:

17: Distributed File Systems 13

1. UNIX filesystem layer - does normal open / read / etc. commands.

2. Virtual file system ( VFS ) layer -

a) Gives clean layer between user and filesystem.

b) Acts as deflection point by using global vnodes.

c) Understands the difference between local and remote names.

d) Keeps in memory information about what should be deflected (mounted

3. System call interface layer -

a) Presents sanitized validated requests in a uniform way to the VFS.

17: Distributed File Systems 14

• Break the complete pathname into components.

• For each component, do an NFS lookup using the

component name + directory vnode.

• A directory name cache on the client speeds up lookups.

17: Distributed File Systems 15

• The client keeps:

• Cached attributes are thrown away after a few seconds.

• Data blocks use read ahead and delayed write.

17: Distributed File Systems 16

You might also like