Lecture Notes on Distributed File Systems | Study notes Computer Science

Student Notes Pages





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 1

Computer Science

425

Distributed Systems

Computer Science

425

Distributed Systems

Lecture 22

Distributed File Systems

Reading: Chapter 8





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 2

File Systems



A file is a collection of data with a user view (file structure)

and a physical view (blocks).



A directory is a file that provides a mapping from text names

to internal file identifiers.



File systems implement file management:



Naming and locating a file



Accessing a file – create, delete, open, close, read, write, append,

truncate



Physical allocation of a file.



Security and protection of a file.



A distributed file system (DFS) is a file system with

distributed storage and distributed users . Files may be

located remotely on servers, and accesse d by multiple clients.



E.g., SUN NFS and AFS



DFS provides transparency of location, a ccess, and

migration of files.



DFS systems use cache replicas for effic iency and fault

tolerance





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 3

File Attributes & System Modules

File Attribute

Record Block Block Block

length

creation timestamp

read timestamp

write timestamp

attribute timestamp

reference count

file type

ownership

access control list

Directory

Module

File

Module

Access

control

Module

File

Access

Module

Block

Module

Device

Module

File System Modules





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 4

File System Modules

Directory module: relates file names to file IDs

File module: relates file IDs to particular files

Access control module: checks permission for operation requested

File access module: reads or writes file data or attributes

Block module: accesses and allocates disk blocks

Device module: disk I/O and buffering

(Single host File system. DFS may require additional components.)

Layered architecture: each layer depends only on the layers below it.





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 5

UNIX File System Operations

filedes = open(name, mode)

filedes = creat(name, mode)

Opens an existing file with the given name.

Creates a new file with the given name.

Both operations deliver a file descriptor referencing the open

file. The mode is read, writeor both.

status = close(filedes) Closes the open file filedes.

count = read(filedes, buffer, n)

count = write(filedes, buffer, n)

Transfers nbytes from the file referenced by filedes to buffer.

Transfers nbytes to the file referenced by filedesfrom buffer.

Both operations deliver the number of bytes actually transferred

and advance the read-write pointer.

pos = lseek(filedes, offset,

whence)

Moves the read-write pointer to offset (relative or absolute,

depending on whence).

status = unlink(name) Removes the file namefrom the directory structure. If the file

has no other links to it, it is deleted from disk.

status = link(name1, name2) Creates a new link (name2) for a file (name1).

status = stat(name, buffer) Gets the file attributes for file nameinto buffer.





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 22- 6

Distributed File System (DFS) Requirements



Transparency - server-side changes sho uld be invisible to

the client-side.



Access transparency: A single set of operations is provided for

access to local/remote files.



Location Transparency: All client processes see a uniform file

name space.



Migration Transparency: When files are moved from one ser ver

to another, users should not see it



Performance Transparency



Scaling Transparency



File Replication



A file may be represented by several copies for service efficiency and

fault tolerance.



Concurrent File Updates



Changes to a file by one client should not interfere with the operation of

other clients simultaneously accessing the same file.

Lecture Notes on Distributed File Systems, Study notes of Computer Science

Related documents

Partial preview of the text

Download Lecture Notes on Distributed File Systems and more Study notes Computer Science in PDF only on Docsity!

Copyright 2001, Medic T. Harandi

Computer Science

Distributed Systems

Computer Science

Distributed Systems

Lecture 22

Distributed File Systems

Reading: Chapter 8

File Systems File Systems

A file is a collection of data with a user view (file structure)

and a physical view (blocks).

A directory is a file that provides a mapping from text names

to internal file identifiers.

File systems implement file management:

Naming and locating a file

Accessing a file – create, delete, open, close, read, write, append,

Physical allocation of a file.

Security and protection of a file.

A distributed file system (DFS) is a file system with

distributed storage and distributed users. Files may be

located remotely on servers, and accessed by multiple clients.

E.g., SUN NFS and AFS

DFS provides transparency of location, access, and

migration of files.

DFS systems use cache replicas for efficiency and fault

tolerance

File Attributes & System Modules File Attributes & System Modules

length

creation timestamp

read timestamp

write timestamp

attribute timestamp

reference count

file type

ownership

access control list

Directory

Module

File

Module

Access

control

Module

File

Access

Module

Block

Module

Device

Module

File System Modules

File System Modules File System Modules

Directory module: relates file names to file IDs

File module: relates file IDs to particular files

Access control module: checks permission for operation requested

File access module: reads or writes file data or attributes

Block module: accesses and allocates disk blocks

Device module: disk I/O and buffering

UNIX File System Operations UNIX File System Operations

Distributed File System (DFS) Requirements Distributed File System (DFS) Requirements

Transparency - server-side changes should be invisible to

the client-side.

Access transparency: A single set of operations is provided for

access to local/remote files.

Location Transparency: All client processes see a uniform file

name space.

Migration Transparency: When files are moved from one server

to another, users should not see it

Performance Transparency

Scaling Transparency

File Replication

A file may be represented by several copies for service efficiency and

Concurrent File Updates

Changes to a file by one client should not interfere with the operation of

Copyright 2001, Medic T. Harandi

DFS Requirements (2) DFS Requirements (2)