Normalisasi
Normalisasi
rikip ginanjar
What is Normalization?
1. Normalization is a database design technique that reduces
data redundancy and eliminates undesirable characteristics
like Insertion, Update and Deletion Anomalies
2. Normalization rules divides larger tables into smaller
tables and links them using relationships
3. The purpose of Normalisation in SQL is to eliminate
redundant (repetitive) data and ensure data is stored
logically.
4. The inventor of the relational model Edgar Codd proposed
the theory of normalization of data with the introduction of
the First Normal Form and he continued to extend theory
with Second and Third Normal Form
5. Later he joined Raymond F. Boyce to develop the theory
of Boyce-Codd Normal Form.
Normalization
Normalization
We discuss four normal forms: first, second, third, and
Boyce-Codd normal forms
1NF, 2NF, 3NF, and BCNF
91.2914 3
Normalization
There is a sequence to normal forms:
1NF is considered the weakest,
2NF is stronger than 1NF,
3NF is stronger than 2NF, and
BCNF is considered the strongest
Also,
any relation that is in BCNF, is in 3NF;
any relation in 3NF is in 2NF; and
any relation in 2NF is in 1NF.
91.2914 4
Database Normal Forms
1. The Theory of Data Normalization in MySQL server is
still being developed further.
2. For example, there are discussions even on 6th Normal
Form.
3. However, in most practical applications, normalization
achieves its best in 3rd Normal Form.
4. The evolution of Normalization in SQL theories is
illustrated below-
Normalization
91.2914 6
Normalization
We consider a relation in BCNF to be fully normalized.
A design that has a lower normal form than another design has
more redundancy. Uncontrolled redundancy can lead to data
integrity problems.
91.2914 9
Functional Dependencies
EmpNum → EmpEmail
EmpNum → EmpFname 3 different ways
EmpNum → EmpLname you might see FDs
depicted
EmpEmail
EmpNum EmpFname
EmpLname
91.2914 10
Determinant
Functional Dependency
EmpNum → EmpEmail
91.2914 11
Transitive dependency
Transitive dependency
Consider attributes A, B, and C, and where
A → B and B → C.
Functional dependencies are transitive, which
means that we also have the functional dependency
A→C
We say that C is transitively dependent on A
through B.
91.2914 12
Transitive dependency
EmpNum → DeptNum
EmpNum EmpEmail DeptNum DeptNname
DeptNum → DeptName
91.2914 13
Partial dependency
A partial dependency exists when an attribute B is
functionally dependent on an attribute A, and A is a
component of a multipart candidate key.
91.2914 15
First Normal Form
The following in not in 1NF
91.2914 16
First Normal Form
EmpNum EmpPhone EmpDegrees
123 233-9876
333 233-1231 BA, BSc, PhD
679 233-1231 BSc, MSc
91.2914 17
First Normal Form
EmployeeDegree
Employee
EmpNum EmpDegree
EmpNum EmpPhone
333 BA
123 233-9876
333 BSc
333 233-1231
333 PhD
679 233-1231
679 BSc
679 MSc
91.2914 18
exercise
StudentID StudentName Phone# EmailAddress StudyProg
StudentID EmailAddress
1 [email protected]
1 [email protected]
2 [email protected]
2 [email protected]
3 [email protected]
Second Normal Form
Second Normal Form
A relation is in 2NF if it is in 1NF, and every non-key attribute
is fully dependent on each candidate key. (That is, we don’t
have any partial functional dependency.)
91.2914 21
Second Normal Form
Consider this InvLine table (in 1NF):
InvNum LineNum ProdNum Qty InvDate
InvNum InvDate
91.2914 24
Examples/exercise
StID SubjID SubjName Semester
1 1 Database 2
1 2 InternetProg 2
SubjID SubjName
1 Database
2 InternetProg
exercise
StID SubjID StName SubName Score
2NF, but not in 3NF, nor in BCNF:
EmployeeDept
ename ssn bdate address dnumber dname
dnumber dname.
91.2914 27
Third Normal Form
Third Normal Form
• A relation is in 3NF if the relation is in 1NF and all
determinants of non-key attributes are candidate keys
That is, for any functional dependency: X Y, where Y is
a non-key attribute (or a set of non-key attributes), X is a
candidate key.
• This definition of 3NF differs from BCNF only in the
specification of non-key attributes - 3NF is weaker than
BCNF. (BCNF requires all determinants to be candidate
keys.)
• A relation in 3NF will not have any transitive dependencies
of non-key attribute on a candidate key through another
non-key attribute.
91.2914 28
Third Normal Form
Consider this Employee relation Candidate keys
are? …
EmpNum EmpName DeptNum DeptName
91.2914 29
Third Normal Form
EmpNum EmpName DeptNum DeptName
91.2914 30
Example
StID StName MajID MajName BDate Address
Majd MajName
exercise
StID StName MajID FacID MajName FacName Bdate
FacID FacName
In 3NF, but not in BCNF:
91.2914 33
student_no course_no instr_no
student_no instr_no
course_no instr_no
91.2914 35
Exercises
Give the 1NF, 2NF, 3NF for the following
1st NF
2nd NF
3rd NF