U2FILE SYSTEMS
U2FILE SYSTEMS
and flash drives. In the Microsoft Windows family of operating systems, users are presented
with several different choices of file systems when formatting such media. These choices
depend on the type of media involved and the situations in which the media is being
formatted.
The two most common file systems in Windows are as follows:
NTFS
FAT
exFAT
HFS Plus
EXT
File systems can differ between operating systems (OS), such as Microsoft Windows,
MacOS and Linux-based systems. Some file systems are designed for specific applications.
Major types of file systems include distributed file systems, disk-based file systems and
special purpose file systems.
A file system is a process that manages how and where data on a storage disk, typically a
hard disk drive (HDD), is stored, accessed and managed. It is a logical disk component that
manages a disk's internal operations as it relates to a computer and is abstract to a human
user.
FAT is an acronym for File Allocation Table and the 16 indicates a 16-bit file system.
The file system was initially developed for use on floppy
The FAT used for most older systems, used a 16-bit binary number to hold cluster
numbers. A volume using FAT16 can hold a maximum of 65,526 clusters, FAT16 was
used for hard disk volumes ranging in size from 16 MB to 2,048 MB. VFAT is a variant
of FAT16.
FAT32 is supported by Windows 95's OEM SR2 release, as well as Windows 98,
Windows ME and Windows 2000. FAT32 uses a 28-bit binary cluster number--not 32,
because 4 of the 32 bits are "reserved". FAT32 can theoretically handle volumes with
over 268 million clusters, and will support (theoretically) drives up to 2 TB in size.
First, FAT32, being a 32-bit file system, supports much larger disks than the FAT16
file system. FAT16 supports a maximum volume size of 4 GB, while FAT32 can be
used on volumes up to 16 TB.
Maximum volume size depends on FAT type: FAT12: 256MB (for 64kB clusters)
FAT16: 4GB (for 64kB clusters) FAT32: 2TB (4G of 512B sectors)
NT file system (NTFS), which is also sometimes called the New Technology File
System, is a process that the Windows NT operating system uses for storing,
organizing, and finding files on a hard disk efficiently.
NTFS supports metadata, advanced data structures, reliability, disk space utilization.
And more additional functions like encrypting file system, hard links, sparse files, and
reparse points. If you only use a Windows computer, it is recommended to use NTFS
format.
NTFS can support volumes as large as 8 petabytes on Windows Server 2019 and newer
and Windows 10, version 1709 and newer (older versions support up to 256 TB).
NTFS and FAT32 file systems differ in the following ways: The FAT32 file system
cannot store individual files larger than 4 GB, while the NTFS file system can.
Compared to FAT32, the NTFS file system has higher disk utilization and can manage
disk space more effectively.
Unix File System
UNIX file system is a logical method of organizing and storing large amounts of
information in a way that makes it easy to manage.
A file is a smallest unit in which the information is stored.
UNIX file system has several important features.
All data in UNIX is organized into files.
All files are organized into directories.
These directories are organized into a tree-like structure called the file system.
Files in Unix System are organized into multi-level hierarchy structure known as a
directory tree.
At the very top of the file system is a directory called “root” which is represented by
a “/”. All other files are “descendants” of root.
The UNIX file system is a hierarchical file system used by Unix-based operating systems
to store and organize files and directories. It is a tree-like structure that starts with a single
directory called the root directory, which is denoted by a forward slash (/) character.
The UNIX file system uses a directory hierarchy that allows for easy navigation and
organization of files. Directories can contain both files and other directories, and each
file or directory has a unique name.
UNIX file system also uses a set of permissions to control access to files and directories.
Each file and directory has an owner and a group associated with it, and permissions can
be set to allow or restrict access to these entities.
One of the most important features of the UNIX file system is its support for symbolic
links, which are pointers to other files or directories. This allows for flexible organization
of files and directories without having to physically move them around.
Overall, the UNIX file system is a robust and flexible system that has been used for
decades and continues to be the foundation for many modern operating systems.
1. Ordinary files – An ordinary file is a file on the system that contains data, text, or program
instructions.
Used to store your information, such as some text you have written or an image you
have drawn. This is the type of file that you usually work with.
Always located within/under a directory file.
Do not contain other files.
In long-format output of ls -l, this type of file is specified by the “-” symbol.
2. Directories – Directories store both special and ordinary files. For users familiar with
Windows or Mac OS, UNIX directories are equivalent to folders.
A directory file contains an entry for every file and subdirectory that it houses. If you have
10 files in a directory, there will be 10 entries in the directory. Each entry has two
components. (1) The Filename (2) A unique identification number for the file or directory
(called the inode number)
Branching points in the hierarchical tree.
Used to organize groups of files.
May contain ordinary files, special files or other directories.
Never contain “real” information which you would work with (such as text). Basically,
just used for organizing files.
All files are descendants of the root directory, (named /) located at the top of the tree.
In long-format output of ls –l, this type of file is specified by the “d” symbol.
3. Special Files – Used to represent a real physical device such as a printer, tape drive or
terminal, used for Input/output (I/O) operations. Device or special files are used for device
Input/Output (I/O) on UNIX and Linux systems. They appear in a file system just like an
ordinary file or a directory. On UNIX systems there are two flavours of special files for each
device, character special files and block special files:
When a character special file is used for device Input/output (I/O), data is transferred one
character at a time. This type of access is called raw device access.
When a block special file is used for device Input/Output
(I/O), data is transferred in large fixed-size blocks. This type of access is called block
device access.
For terminal devices, it’s one character at a time. For disk devices though, raw access means
reading or writing in whole chunks of data – blocks, which are native to your disk.
In long-format output of ls -l, character special files are marked by the “c” symbol.
In long-format output of ls -l, block special files are marked by the “b” symbol.
6. Sockets –
A UNIX socket (or Inter-process communication socket) is a special file which allows
for advanced inter-process communication.
A Unix Socket is used in a client-server application framework. In essence, it is a
stream of data, very similar to network stream (and network sockets), but all the
transactions are local to the filesystem.
In long-format output of ls -l, UNIX sockets are marked by “s” symbol.
7. Symbolic Link
Symbolic link is used for referencing some other file of the file system.
Symbolic link is also known as Soft link. It contains a text form of the path to the file
it references.
To an end user, symbolic link will appear to have its own name, but when you try
reading or writing data to this file, it will instead reference these operations to the
file it points to. If we delete the soft link itself, the data file would still be there. If
we delete the source file or move it to a different location, symbolic file will not
function properly. In long-format output of ls –l, Symbolic link are marked by the
“l” symbol (that’s a lower case L).
Advantages or Disadvantages:
RAID:
The Full Form of RAID is Redundant Array of Inexpensive/Independent Disks.
It is a process of data visualisation, which utilises numerous solid-state drive (SSD) or
hard drives, thus enabling redundancy of data and improvement of performance.
In the instance of any unexpected incidents, the data also encounters threat resilience
because of redundancy. This enables benefits for the traditional storage process, such as
the SLED (Single Large Expensive Disk). Hence, instead of retaining the entire data in
one SLED, RAID enables the use of more than one small sized disks that permits quicker
I/O functions while providing a kind of strength to the whole system.
If ever any one of the system’s disk crash, others remain protected and safe, thus saving
the complete system from collapsing.
History of RAID
1988- “A Case for Redundant Arrays of Inexpensive Disks (RAID)” a technical report
by David Patterson, Randy Katz, and Garth A. Gibson published.
Discussions and mentions of RAID in the report
It further establishes the resilience and reliability of RAID in comparison to just one
disk.
Characteristics of RAID
Redundant Data – Multiple disks are used to store information, and this data is copied
and are stored in various locations or disks to enable better disaster management.
Stored in various disks– As an alternative to the single large disk, RAID keeps the
data stored in numerous small disks that are less expensive.
Enable multiple storage process– Different RAID uses various storing methods such
as striping, mirroring the parity to enable multiple characteristics for the users.
Numerous or Multiple Systems– RAID is provided in six different levels, thus
allowing for some unique characteristics, while also compromising on some features.
Finally, the user has the authority to confirm their requirement and then decide what
the applicable features are in their system.
Benefits of RAID
Speedy data access- The speed of data access RAID systems is definitely more than
that of SLED systems. RAID 0, RAID 4 and RAID 5 are especially designed, thus
enabling quick data access at reasonable rates.
Data Redundancy– Because of the data redundancy feature of the RAID systems, the
data storage is reliable. RAID 1 makes use of a data mirroring method to store the copies
of data to ensure reliability.
Correction of Errors– RAID 2, RAID 3, RAID 4 and RAID 5 apply hamming code
parity to correct the error in data.
I/O requests simultaneously: RAID 0, RAID 4 and RAID 5 apply the striping storage
methods and so it supports multiple I/O operations, simultaneously.
Data transfer in bulk– RAID 3 allows for speedy transfer of bulk data.
Security of Data- Striping and ongoing parity checks enable high data security.
Limitations of RAID
Expensive-RAID systems cost more than SLED systems.
Loss of data-Vulnerability of RAID systems, which do not utilise mirroring, will
enable data loss.
Selection of RAID level- Since there are multiple RAID levels, with each having
specific drawbacks and features, it is difficult to select what system should be used.
Not used properly- The complete system performance may decrease, if the RAID is
not utilised properly.
Complex technology- RAID is a complex data storage architecture and needs an adept
and competent person to unlock the total potential of RAID.