0% found this document useful (0 votes)
11 views

Project File

The document summarizes the RAID database submitted by Mr. Aakash Surana to Prof. Vishvesh Kumar of Sanghvi Institute of Management & Science. It defines RAID and the 7 levels of RAID (RAID 0 to RAID 6), describing the characteristics and pros/cons of RAID levels 0, 1, 2, 3, and 4. The key points are increased performance, data redundancy, and the ability to recover data after drive failures depending on the RAID level used.

Uploaded by

aakashsurana31
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Project File

The document summarizes the RAID database submitted by Mr. Aakash Surana to Prof. Vishvesh Kumar of Sanghvi Institute of Management & Science. It defines RAID and the 7 levels of RAID (RAID 0 to RAID 6), describing the characteristics and pros/cons of RAID levels 0, 1, 2, 3, and 4. The key points are increased performance, data redundancy, and the ability to recover data after drive failures depending on the RAID level used.

Uploaded by

aakashsurana31
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

Sanghvi Institute of Management & Science

(Approved By AICTE, Affiliated to DAVV, RGPV Recognized By)

RAID Database
Submitted for the fulfillment of the requirement for
The practical examination for the Master of Computer Application

(Session 2021-2023)
Prof. Vishvesh Kumar
Head of Department, Computer Application

Guided By Submitted By:


Prof. Vishvesh Kumar Mr. Aakash Surana

Affiliated to
Rajiv Gandhi Proudyogiki Vishwavidyalaya, Indore
(M.P.)
What is Database
The database is a collection of inter-related
data which is used to retrieve, insert and
delete the data efficiently. It is also used to
organize the data in the form of a table,
schema, views, and reports, etc.
For example: The college Database
organizes the data about the admin, staff,
students and faculty etc.
Using the database, you can easily retrieve,
insert, and delete the information.

Database Management
System
o Database management system is a software
which is used to manage the database. For
example: MySQL, Oracle, etc are a very
popular commercial database which is used
in different applications.
o DBMS provides an interface to perform
various operations like database creation,
storing data in it, updating data, creating a
table in the database and a lot more.
o It provides protection and security to the
database. In the case of multiple users, it
also maintains data consistency.

RAID
RAID refers to redundancy array of the
independent disk. It is a technology which is
used to connect multiple secondary storage
devices for increased performance, data
redundancy or both. It gives you the ability to
survive one or more drive failure depending
upon the RAID level used.
It consists of an array of disks in which multiple
disks are connected to achieve different goals.
RAID, or “Redundant Arrays of Independent
Disks” is a technique which makes use of a
combination of multiple disks instead of using a
single disk for increased performance, data
redundancy or both. The term was coined by
David Patterson, Garth A. Gibson, and Randy
Katz at the University of California, Berkeley
in 1987.

Why data redundancy?


Data redundancy, although taking up extra
space, adds to disk reliability. This means, in
case of disk failure, if the same data is also
backed up onto another disk, we can retrieve
the data and go on with the operation. On the
other hand, if the data is spread across just
multiple disks without the RAID technique, the
loss of a single disk can affect the entire data .

Key evaluation points for a


RAID System
 Reliability: How many disk faults can the
system tolerate?
 Availability: What fraction of the total
session time is a system in uptime mode,
i.e. how available is the system for actual
use?
 Performance: How good is the response
time? How high is the throughput (rate of
processing work)? Note that performance
contains a lot of parameters and not just
the two.
 Capacity: Given a set of N disks each

with B blocks, how much useful capacity


is available to the user?
RAID is very transparent to the underlying
system. This means, to the host system, it
appears as a single big disk presenting itself as
a linear array of blocks. This allows older
technologies to be replaced by RAID without
making too many changes in the existing code .

RAID technology
There are 7 levels of RAID schemes. These
schemas are as RAID 0, RAID 1, ...., RAID 6.
These levels contain the following
characteristics:
56.4M
1.2K
OOPs Concepts in Java

o It contains a set of physical disk drives.


o In this technology, the operating system
views these separate disks as a single logical
disk.
o In this technology, data is distributed across
the physical drives of the array.
o Redundancy disk capacity is used to store
parity information.
o In case of disk failure, the parity information
can be helped to recover the data.
Standard RAID levels
RAID 0
o RAID level 0 provides data stripping, i.e., a
data can place across multiple disks. It is
based on stripping that means if one disk
fails then all data in the array is lost.
o This level doesn't provide fault tolerance but
increases the system performance.
Example:
Disk 0 Disk 1 Disk 2 Disk 3

20 21 22 23

24 25 26 27

28 29 30 31

32 33 34 35

In this figure, block 0, 1, 2, 3 form a stripe.


In this level, instead of placing just one block
into a disk at a time, we can work with two or
more blocks placed it into a disk before moving
on to the next one.
Disk 0 Disk 1 Disk 2 Disk 3

20 22 24 26

21 23 25 27

28 30 32 34

29 31 33 35

In this above figure, there is no duplication of


data. Hence, a block once lost cannot be
recovered.

Pros of RAID 0:
o In this level, throughput is increased because
multiple data requests probably not on the
same disk.
o This level full utilizes the disk space and
provides high performance.
o It requires minimum 2 drives.

Cons of RAID 0:
o It doesn't contain any error detection
mechanism.
o The RAID 0 is not a true RAID because it is
not fault-tolerance.
o In this level, failure of either disk results in
complete data loss in respective array.

RAID 1
This level is called mirroring of data as it copies
the data from drive 1 to drive 2. It provides
100% redundancy in case of a failure.
Example:
Disk 0 Disk 1 Disk 2 Disk 3

A A B B

C C D D

E E F F

G G H H

Only half space of the drive is used to store the


data. The other half of drive is just a mirror to
the already stored data.

Pros of RAID 1:
o The main advantage of RAID 1 is fault
tolerance. In this level, if one disk fails, then
the other automatically takes over.
o In this level, the array will function even if
any one of the drives fails.

Cons of RAID 1:
o In this level, one extra drive is required per
drive for mirroring, so the expense is higher.
RAID 2
o RAID 2 consists of bit-level striping using
hamming code parity. In this level, each data
bit in a word is recorded on a separate disk
and ECC code of data words is stored on
different set disks.
o Due to its high cost and complex structure,
this level is not commercially used. This
same performance can be achieved by RAID
3 at a lower cost.

Pros of RAID 2:
o This level uses one designated drive to store
parity.
o It uses the hamming code for error detection.

Cons of RAID 2:
o It requires an additional drive for error
detection.

RAID 3
o RAID 3 consists of byte-level striping with
dedicated parity. In this level, the parity
information is stored for each disk section
and written to a dedicated parity drive.
o In case of drive failure, the parity drive is
accessed, and data is reconstructed from the
remaining devices. Once the failed drive is
replaced, the missing data can be restored on
the new drive.
o In this level, data can be transferred in bulk.
Thus high-speed data transmission is
possible.
Disk 0 Disk 1 Disk 2 Disk 3

A B C P(A, B, C)

D E F P(D, E, F)

G H I P(G, H, I)

J K L P(J, K, L)

Pros of RAID 3:
o In this level, data is regenerated using parity
drive.
o It contains high data transfer rates.
o In this level, data is accessed in parallel.
Cons of RAID 3:
o It required an additional drive for parity.
o It gives a slow performance for operating on
small sized files.

RAID 4
o RAID 4 consists of block-level stripping
with a parity disk. Instead of duplicating
data, the RAID 4 adopts a parity-based
approach.
o This level allows recovery of at most 1 disk
failure due to the way parity works. In this
level, if more than one disk fails, then there
is no way to recover the data.
o Level 3 and level 4 both are required at least
three disks to implement RAID.
Disk 0 Disk 1 Disk 2 Disk 3

A B C P0

D E F P1

G H I P2

J K L P3

In this figure, we can observe one disk


dedicated to parity.
In this level, parity can be calculated using an
XOR function. If the data bits are 0,0,0,1 then
the parity bits is XOR(0,1,0,0) = 1. If the parity
bits are 0,0,1,1 then the parity bit is
XOR(0,0,1,1)= 0. That means, even number of
one results in parity 0 and an odd number of one
results in parity 1.
C1 C2 C3 C4 Parity

0 1 0 0 1
0 0 1 1 0

Suppose that in the above figure, C2 is lost due


to some disk failure. Then using the values of all
the other columns and the parity bit, we can
recompute the data bit stored in C2. This level
allows us to recover lost data.

RAID 5
o RAID 5 is a slight modification of the RAID
4 system. The only difference is that in
RAID 5, the parity rotates among the drives.
o It consists of block-level striping with
DISTRIBUTED parity.
o Same as RAID 4, this level allows recovery
of at most 1 disk failure. If more than one
disk fails, then there is no way for data
recovery.
Disk 0 Disk 1 Disk 2 Disk 3 Disk 4

0 1 2 3 P0

5 6 7 P1 4

10 11 P2 8 9
15 P3 12 13 14

P4 16 17 18 19

This figure shows that how parity bit rotates.


This level was introduced to make the random
write performance better.

Pros of RAID 5:
o This level is cost effective and provides high
performance.
o In this level, parity is distributed across the
disks in an array.
o It is used to make the random write
performance better.

Cons of RAID 5:
o In this level, disk failure recovery takes
longer time as parity has to be calculated
from all available drives.
o This level cannot survive in concurrent drive
failure.
RAID 6
o This level is an extension of RAID 5. It
contains block-level stripping with 2 parity
bits.
o In RAID 6, you can survive 2 concurrent
disk failures. Suppose you are using RAID
5, and RAID 1. When your disks fail, you
need to replace the failed disk because if
simultaneously another disk fails then you
won't be able to recover any of the data, so
in this case RAID 6 plays its part where you
can survive two concurrent disk failures
before you run out of options.
Disk 1 Disk 2 Disk 3 Disk 4

A0 B0 Q0 P0

A1 Q1 P1 D1

Q2 P2 C2 D2

P3 B3 C3 Q3
Pros of RAID 6:
o This level performs RAID 0 to strip data and
RAID 1 to mirror. In this level, stripping is
performed before mirroring.
o In this level, drives required should be
multiple of 2.

Cons of RAID 6:
o It is not utilized 100% disk capability as half
is used for mirroring.
o It contains very limited scalability.

You might also like