M0 DBS Perspective 2025
M0 DBS Perspective 2025
CS3700
Introduction to Database Systems
Perspective - Big Picture
Prof P Sreenivasa Kumar
Department of CSE, I I T Madras
Kinds of Data
• Various kinds of Data we encounter everyday
– Enterprise (business) data
– Structured data
– Documents / webpages – Text data
– Objects – JSON; XML – (label, value) pairs data
– Semi-structured data
– Voice / music / image / video data
– Unstructured data
• This course focuses on structured data
1
1/24/25
Data Warehouses
• DB Querying
– Ask for details present in data or can be derived
– Use logic-based expression languages – SQL/OQL etc
– Ask for aggregated information
– How many ‘fiction’ books got issued to ‘CS’ students in the
last quarter ?
Prof P Sreenivasa Kumar, 6
Department of CS&E, IITM.
2
1/24/25
OLAP - OLTP
• OLAP (Online analytical processing) queries:
– Ask for aggregated information from data
– Needs summarization of large amount of data
– How many ‘cosmetics’ products were sold in all the stores located
in the ‘southern’ region of the country in the last 2 quarters ( Big
Basket or Amazon )
– Specialized architecting of data is needed
– Done on warehouse data using “Data Cubes”
3
1/24/25
Knowledge Bases
• Knowledge Bases
– Founded on mathematical logic based languages
– Inferences / reasoning incorporated
– Every surgeon is a doctor; John is a surgeon and hence a doctor
– Data : modeled as collection of ‘statements’
– Metadata: Terms/Vocabulary of the domain
– Names of entity types of interest, properties of interest
for these entities; binary relationships among entities
– Modeled as Ontologies to capture domain knowledge
– Can be viewed as graphs – ‘Knowledge Graphs’
– More in the TAO course….
Prof P Sreenivasa Kumar, 10
Department of CS&E, IITM.
10
NoSQL Databases
• Non-relational databases / Not only SQL
– Offer flexibility – no rigid row/col structure
– Semi-structured + other simpler data models
– Need: faster response to certain queries, quick data
model updates, quick app development, horizontal
scale-out ..
– System ‘availability’ over ‘consistency’ of reads
– (-ve) No standard query language to access data
– Multiple types of DBs
– JSON / XML, key-value, columnar, graph etc
11
12
4
1/24/25
Course Structure
Course has 8 modules (Total #slides 451)
Module Name #Slides
Introduction 24
Entity/Relationship Models 38
Relational Model 64
SQL Standard 66
Normal Forms 64
Transaction Processing 74
13
Course Organization
• Problem Sessions – 5 or 6 ( tutorials )
– Problem sheet – start of the class or a day before
– Students to solve – discussion and questions – OK
– No marks – no judging; answers posted later
– Course TAs would be present - aid discussions
• Design and Dev assignment – group/individual
– Running assignment – with 3 or 4 stages
• Exams
– Mid-sem Exam / Two Quizzes – TBD
– End-sem Exam as per schedule.
Prof P Sreenivasa Kumar, 14
Department of CS&E, IITM.
14