Introduction To Data Science
Introduction To Data Science
Science
AGENDA
• Defining Data Science
• What Does a Data Science Professional Do?
• Data Science in Business
• Use Cases for Data Science
• Installation of R and R studio
DEFINING DATA SCIENCE
• Data Science deals with the science and algorithms related to
data.
• Data generated from various sort of sources.
• Report says, “Every day, approximately 2 quintillion bytes of
data is generated. If it grows at this pace, then by the next 3
years, it is expected that 2MB of data will be created every
second for every individual on this planet.”
• Last 2 years witnessing the creation of 90% of data over the
globe.
• Data has two sources:
• Structured
• Unstructured
• Structured sources include information that iscompatible
with the
relational database.
• E.g. ATM transactions, Flight Tickets which enable SQL
to
make changes in them.
• Unstructured data is generated from tweets and comments
on social media, audio and video files which the SQL cannot
process.
DEFINITION
“ Data Science is a broad field which is an assembly of scientific
techniques, methods, processes used to clean the data and then
extract some useful patterns and insights in form of
visualizations.”
• Many more…….
DATA SCIENTISTS ANDTHEIR ROLE
• Data Scientist is a Rockstar!!!
• A Data Scientist is an individual who has the power and freedom to
experiment with tons of different kinds of data.
• Based on knowledge of:
– Mathematics
– Problem solving
– Critical thinking
– Careful analysis
WHAT DOES DATA SCIENCE PROFESSIONAL
DO?
DATA ANALYST V/S DATA SCIENTIST
• Data Analyst has a lot to do with converting the data into a
structured format in order to process it further.
• Focus more on Data Mining and Data Auditing
• Data mining involves retrieving information from large databases with
the help of SQL to extract new data/information.
• Data auditing involves checking the essence of data and trying to figure out
if the data is
capable enough for gaining useful insights or not.
DATA ANALYST V/S DATA SCIENTIST
• Data Scientist take the clean data and trying to gain
some meaningful insights.