SlideShare a Scribd company logo
Introduction to Data Science
● Eduction
○ 2012 Pass out, M.Sc. Information system - Bits, Pilani Rajasthan.
○ Trained in RHEL 6, AIX, Business Communications
○ Certified Data Modelling Engineer.
● Software Engineer
○ 4.5 Years in Data Engineering & Data Analytic.
○ 1 Year in Data Sciences and Data Modelling.
○ Python, Oracle DB, Oracle Apex.
● Personal Life
○ Teaching(blog), Music, Anime, lazy.
○ Health Conscious, Gym/Yoga/lots of Sleep.
○ Technology & Personal communication skills.
● Motivation:
○ Bridge the gap between Technology and People. Lead a R&D Team.
About Me
0:05 Nobody's born smart
1:08 Because the most beautiful, complex concepts in the whole universe are built on basic ideas
1:13 that anyone can learn, anywhere can understand. Whoever you are, whereever you are
1:18 You only have to know one thing: You can learn anything
Introduction to data science
2011 Watson - Jeopardy
Data Science
1952 - Tic Tac Toe ⇒ Human vs Computer
1997 - Deep Blue - Chess ⇒ Exploring Solution Space
2011 - Watson - Jeopardy ⇒ Constructive Reasoning
2017 - AlphaGo - Go ⇒ Developing Intuition
In AlphaGo, no. of possibilities > total no. atoms in this universe.
Plan
Introduction
● Definitions [ Data Science ]
● What, Why and How
● Examples
Data Science - In Action
● Stages [DG, DC, DM, ME]
● Regression & Clustering Models
● Basics [ LR, GD ]
Real Life Application
● Examples
Data Science Tools
● Examples
Suggestions
● Tips
What is Data Science?
What is Data Science?
da•ta
Factual information, especially information organized for
analysis or used to reason or make decisions.
Computer Science Numerical or other information
represented in a form suitable for processing by computer.
Values derived from scientific experiments.
sci·ence (sī′əns)
The observation, identification, description,
experimental investigation, and theoretical explanation
of phenomena. Ex. New advances in science and
technology.
Such activities restricted to a class of natural
phenomena. Ex. The science of astronomy.
A systematic method or body of knowledge in a given
area. Ex. The science of marketing.
Archaic Knowledge, especially that gained through
experience.
Data Science Examples
Why Data Science?
● Technological Advancements
● Cheaper Storage
● Faster Computations
● IOT
● RAD Tools
● Bigger Questions?
Growing Devices
Information Explosion & Doubling Processing Power
Metcalfe's law states that the value of a telecommunications network is
proportional to the square of the number of connected users of the system (n2).
Moore's law is the observation that the number of transistors in a dense integrated
circuit doubles approximately every two years.
(Population - Thanks to Advanced Medical Sciences & Improving Health Care.)
Sources: Wikipedia
How to do Data Science?
How to Data Science? - AI, ML
Rosey, Spacely, Jetson MIT Cheetah Robot
How to do Data Science
You can use lots of sophisticated analytical & Business Intelligent tools and come to
a simple understandable explanations.
(or)
You can also use, simple tools like calculators or excel sheet to generate simple
and simple results.
Plan
Introduction
● Definitions [ Data Science ]
● What, Why and How
● Examples
Data Science - In Action
● Stages [DG, DC, DM, ME]
● Regression & Clustering Models
● Basics [ LR, GD ]
Real Life Application
● Examples
Data Science Tools
● Examples
Suggestions
● Tips
Data Science - In Action
Battles behind the scenes
Stages of Data Science
● Purpose
● Relevant Data Collection
● Wrangling(cleansing)*
● Data Analytics
● Feature Engg.*
● Data Modelling*
● Data Prediction*
● Evaluation*
(*) ⇒ Repetitive stages
● Reportings
● Finalising Report
● Data Product Building (software
development)
○ Architecture
○ Development
○ Testing
○ Deployment
Data Model
● Random Forest Model
○ Bagging
● SVM
○ Linear Equation
Iris Dataset - Goal
<< Ipython Notebook >>
Plan
Introduction
● Definitions [ Data Science ]
● What, Why and How
● Examples
Data Science - In Action
● Stages [DG, DC, DM, ME]
● Regression & Clustering Models
● Basics [ LR, GD ]
Real Life Application
● Examples
Data Science Tools
● Examples
Suggestions
● Tips
Data Science - Real Life App
Few applications that inspired me
Passive Designs + AI
Maurice Cont
Director of Applied Research & Innovation
Autodesk, San Francisco Bay Area.
TED Talk: The incredible inventions of intuitive AI
Generative Designs > Passive Designs
AI Designed Lightweight Cabin Partition
Airbus - A320
AI Designed Lightweight Drone Chassis
Generative Designs
Generative Designs
AI Designed Car Chassis
Music XRay
● Jimmy Lloyd Songwriter Showcase
● Popular songs share Melody & Rhythm
● Genere - 70
● Cluster 60
● Singer & Song Writer NY
● http://www.heidimerrill.com/epk/index.html
Pred Pole
● 2011 Santa Cruz Pred Pole
● Crime, Location & Date-Time
● https://www.predpol.com/
Results:
● 50% Crime Rate control
● 20% reduction in Crime Rate
Generative Designs Project - Interlace
Plan
Introduction
● Definitions [ Data Science ]
● What, Why and How
● Examples
Data Science - In Action
● Stages [DG, DC, DM, ME]
● Regression & Clustering Models
● Basics [ LR, GD ]
Real Life Application
● Examples
Data Science Tools
● Examples
Suggestions
● Tips
Data Science - Tools
Too many to name, but none of them are close perfection.
Data Science Tools
● Languages: Scala, R, Python, Java, C#
● Lib: Scikit, DeepNet, Tensor flow, Theano, H20
● Frameworks: Apache Spark
These are some used by used us (Imaginea Labs - Data Sciences - 4th Floor, Hyd).
Suggestions
Challenges in DS & Tips to who want to start.
Suggestions?
● Data Preparation
○ “Give me six hours to chop down a tree and I will spend the first four sharpening the axe”.
Abraham Lincoln
○ Python, Scala, Excel, Databases(regex).
● Data Analytics
○ “Seeing is believing”
○ Python(Matplotlib, Seaborn), D3.Js, Excel.
● Data Models
○ “There are no perfect solutions, but some work better”
○ Learn 2-3 types of Clustering, Regression Models(LR,RF,SVM,KNN,XGB)
● Evaluation
○ “A product not tested is broken by default”
○ Accuracy, RMSE, Precision-Recall, F1 Score
Questions?
Sampath - Desk 4F 072. Imaginea Labs - Data Sciences.
Sachin, Keerat, Bipul, Kavi, Mageshwaran.
Thank you
Ad

More Related Content

What's hot (20)

Data science
Data scienceData science
Data science
Ranjit Nambisan
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Srishti44
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
SadhanaParameswaran
 
Data science
Data scienceData science
Data science
Mohamed Loey
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
Data Science Club
 
Data science
Data scienceData science
Data science
SwapnilDahake2
 
Data Science
Data ScienceData Science
Data Science
Amit Singh
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
data science
data sciencedata science
data science
skhraletta
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Edureka!
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Data Science
Data ScienceData Science
Data Science
Prakhyath Rai
 
Data science
Data scienceData science
Data science
Sreejith c
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Edureka!
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
Data analytics
Data analyticsData analytics
Data analytics
BindhuBhargaviTalasi
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
Data analytics
Data analyticsData analytics
Data analytics
Tilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL
 
Data analytics
Data analyticsData analytics
Data analytics
davidfergarcia
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
Gang Tao
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Srishti44
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
SadhanaParameswaran
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
Data Science Club
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Edureka!
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Edureka!
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
Gang Tao
 

Similar to Introduction to data science (20)

Big Data & Social Analytics presentation
Big Data & Social Analytics presentationBig Data & Social Analytics presentation
Big Data & Social Analytics presentation
gustavosouto
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
Roopesh Kohad
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
ImonBennett
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist
Manjunath Sindagi
 
Data Science as Scale
Data Science as ScaleData Science as Scale
Data Science as Scale
Conor B. Murphy
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dan Lynn
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
sumitkumar600840
 
Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspective
Amir Ziai
 
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016
Dan Lynn
 
First steps in Data Mining Kindergarten
First steps in Data Mining KindergartenFirst steps in Data Mining Kindergarten
First steps in Data Mining Kindergarten
Alexey Zinoviev
 
DATA SCIENCE-1. Enginnering course .pdf
DATA SCIENCE-1. Enginnering course  .pdfDATA SCIENCE-1. Enginnering course  .pdf
DATA SCIENCE-1. Enginnering course .pdf
fekiy64690
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
Rohit Dubey
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
SatyajitPatil42
 
Artificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdfArtificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdf
Jayanti Prasad Ph.D.
 
Data science as career
Data science as careerData science as career
Data science as career
Manjunath Sindagi
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science
Sagar Hedau
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
Data Con LA
 
Welcome to CS310!
Welcome to CS310!Welcome to CS310!
Welcome to CS310!
Dmitry Zinoviev
 
Data science
Data scienceData science
Data science
Purna Chander
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
DIGITALSAI1
 
Big Data & Social Analytics presentation
Big Data & Social Analytics presentationBig Data & Social Analytics presentation
Big Data & Social Analytics presentation
gustavosouto
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
Roopesh Kohad
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist
Manjunath Sindagi
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dan Lynn
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
sumitkumar600840
 
Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspective
Amir Ziai
 
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016
Dan Lynn
 
First steps in Data Mining Kindergarten
First steps in Data Mining KindergartenFirst steps in Data Mining Kindergarten
First steps in Data Mining Kindergarten
Alexey Zinoviev
 
DATA SCIENCE-1. Enginnering course .pdf
DATA SCIENCE-1. Enginnering course  .pdfDATA SCIENCE-1. Enginnering course  .pdf
DATA SCIENCE-1. Enginnering course .pdf
fekiy64690
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
Rohit Dubey
 
Artificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdfArtificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdf
Jayanti Prasad Ph.D.
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science
Sagar Hedau
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
Data Con LA
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
DIGITALSAI1
 
Ad

Recently uploaded (20)

i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
ggg032019
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Shotgun detailed overview my this ppt formate
Shotgun detailed overview my this ppt formateShotgun detailed overview my this ppt formate
Shotgun detailed overview my this ppt formate
freefreefire0998
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
AllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptxAllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptx
bpkr84
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Call illuminati Agent in uganda+256776963507/0741506136
Call illuminati Agent in uganda+256776963507/0741506136Call illuminati Agent in uganda+256776963507/0741506136
Call illuminati Agent in uganda+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docxMASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
santosh162
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Introcomputerscienceand datascience.pptx
Introcomputerscienceand datascience.pptxIntrocomputerscienceand datascience.pptx
Introcomputerscienceand datascience.pptx
abdulrehmanbscsf22
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
i_o updated.pptx 6=₹cnjxifj,lsbd ধ and vjcjcdbgjfu n smn u cut the lb, it ও o...
ggg032019
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Shotgun detailed overview my this ppt formate
Shotgun detailed overview my this ppt formateShotgun detailed overview my this ppt formate
Shotgun detailed overview my this ppt formate
freefreefire0998
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
AllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptxAllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptx
bpkr84
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docxMASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
santosh162
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Introcomputerscienceand datascience.pptx
Introcomputerscienceand datascience.pptxIntrocomputerscienceand datascience.pptx
Introcomputerscienceand datascience.pptx
abdulrehmanbscsf22
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Ad

Introduction to data science

  • 2. ● Eduction ○ 2012 Pass out, M.Sc. Information system - Bits, Pilani Rajasthan. ○ Trained in RHEL 6, AIX, Business Communications ○ Certified Data Modelling Engineer. ● Software Engineer ○ 4.5 Years in Data Engineering & Data Analytic. ○ 1 Year in Data Sciences and Data Modelling. ○ Python, Oracle DB, Oracle Apex. ● Personal Life ○ Teaching(blog), Music, Anime, lazy. ○ Health Conscious, Gym/Yoga/lots of Sleep. ○ Technology & Personal communication skills. ● Motivation: ○ Bridge the gap between Technology and People. Lead a R&D Team. About Me
  • 3. 0:05 Nobody's born smart 1:08 Because the most beautiful, complex concepts in the whole universe are built on basic ideas 1:13 that anyone can learn, anywhere can understand. Whoever you are, whereever you are 1:18 You only have to know one thing: You can learn anything
  • 5. 2011 Watson - Jeopardy Data Science 1952 - Tic Tac Toe ⇒ Human vs Computer 1997 - Deep Blue - Chess ⇒ Exploring Solution Space 2011 - Watson - Jeopardy ⇒ Constructive Reasoning 2017 - AlphaGo - Go ⇒ Developing Intuition In AlphaGo, no. of possibilities > total no. atoms in this universe.
  • 6. Plan Introduction ● Definitions [ Data Science ] ● What, Why and How ● Examples Data Science - In Action ● Stages [DG, DC, DM, ME] ● Regression & Clustering Models ● Basics [ LR, GD ] Real Life Application ● Examples Data Science Tools ● Examples Suggestions ● Tips
  • 7. What is Data Science?
  • 8. What is Data Science? da•ta Factual information, especially information organized for analysis or used to reason or make decisions. Computer Science Numerical or other information represented in a form suitable for processing by computer. Values derived from scientific experiments. sci·ence (sī′əns) The observation, identification, description, experimental investigation, and theoretical explanation of phenomena. Ex. New advances in science and technology. Such activities restricted to a class of natural phenomena. Ex. The science of astronomy. A systematic method or body of knowledge in a given area. Ex. The science of marketing. Archaic Knowledge, especially that gained through experience.
  • 11. ● Technological Advancements ● Cheaper Storage ● Faster Computations ● IOT ● RAD Tools ● Bigger Questions? Growing Devices
  • 12. Information Explosion & Doubling Processing Power Metcalfe's law states that the value of a telecommunications network is proportional to the square of the number of connected users of the system (n2). Moore's law is the observation that the number of transistors in a dense integrated circuit doubles approximately every two years. (Population - Thanks to Advanced Medical Sciences & Improving Health Care.) Sources: Wikipedia
  • 13. How to do Data Science?
  • 14. How to Data Science? - AI, ML Rosey, Spacely, Jetson MIT Cheetah Robot
  • 15. How to do Data Science You can use lots of sophisticated analytical & Business Intelligent tools and come to a simple understandable explanations. (or) You can also use, simple tools like calculators or excel sheet to generate simple and simple results.
  • 16. Plan Introduction ● Definitions [ Data Science ] ● What, Why and How ● Examples Data Science - In Action ● Stages [DG, DC, DM, ME] ● Regression & Clustering Models ● Basics [ LR, GD ] Real Life Application ● Examples Data Science Tools ● Examples Suggestions ● Tips
  • 17. Data Science - In Action Battles behind the scenes
  • 18. Stages of Data Science ● Purpose ● Relevant Data Collection ● Wrangling(cleansing)* ● Data Analytics ● Feature Engg.* ● Data Modelling* ● Data Prediction* ● Evaluation* (*) ⇒ Repetitive stages ● Reportings ● Finalising Report ● Data Product Building (software development) ○ Architecture ○ Development ○ Testing ○ Deployment
  • 19. Data Model ● Random Forest Model ○ Bagging ● SVM ○ Linear Equation
  • 20. Iris Dataset - Goal << Ipython Notebook >>
  • 21. Plan Introduction ● Definitions [ Data Science ] ● What, Why and How ● Examples Data Science - In Action ● Stages [DG, DC, DM, ME] ● Regression & Clustering Models ● Basics [ LR, GD ] Real Life Application ● Examples Data Science Tools ● Examples Suggestions ● Tips
  • 22. Data Science - Real Life App Few applications that inspired me
  • 23. Passive Designs + AI Maurice Cont Director of Applied Research & Innovation Autodesk, San Francisco Bay Area. TED Talk: The incredible inventions of intuitive AI
  • 24. Generative Designs > Passive Designs AI Designed Lightweight Cabin Partition Airbus - A320 AI Designed Lightweight Drone Chassis
  • 27. Music XRay ● Jimmy Lloyd Songwriter Showcase ● Popular songs share Melody & Rhythm ● Genere - 70 ● Cluster 60 ● Singer & Song Writer NY ● http://www.heidimerrill.com/epk/index.html
  • 28. Pred Pole ● 2011 Santa Cruz Pred Pole ● Crime, Location & Date-Time ● https://www.predpol.com/ Results: ● 50% Crime Rate control ● 20% reduction in Crime Rate
  • 30. Plan Introduction ● Definitions [ Data Science ] ● What, Why and How ● Examples Data Science - In Action ● Stages [DG, DC, DM, ME] ● Regression & Clustering Models ● Basics [ LR, GD ] Real Life Application ● Examples Data Science Tools ● Examples Suggestions ● Tips
  • 31. Data Science - Tools Too many to name, but none of them are close perfection.
  • 32. Data Science Tools ● Languages: Scala, R, Python, Java, C# ● Lib: Scikit, DeepNet, Tensor flow, Theano, H20 ● Frameworks: Apache Spark These are some used by used us (Imaginea Labs - Data Sciences - 4th Floor, Hyd).
  • 33. Suggestions Challenges in DS & Tips to who want to start.
  • 34. Suggestions? ● Data Preparation ○ “Give me six hours to chop down a tree and I will spend the first four sharpening the axe”. Abraham Lincoln ○ Python, Scala, Excel, Databases(regex). ● Data Analytics ○ “Seeing is believing” ○ Python(Matplotlib, Seaborn), D3.Js, Excel. ● Data Models ○ “There are no perfect solutions, but some work better” ○ Learn 2-3 types of Clustering, Regression Models(LR,RF,SVM,KNN,XGB) ● Evaluation ○ “A product not tested is broken by default” ○ Accuracy, RMSE, Precision-Recall, F1 Score
  • 35. Questions? Sampath - Desk 4F 072. Imaginea Labs - Data Sciences. Sachin, Keerat, Bipul, Kavi, Mageshwaran.

Editor's Notes

  • #3: Big Data - Blue whale.
  • #4: Lets start
  • #5: 1952 - Tic Tac Toe # Picture Above. First Human vs Computer race started. 1997 - Deep Blue - Chess ==> Exploring Solution Space 2011 - Watson - Jeopardy ==> Constructive Reasoning 2017 - Alpha Go - Go - [Possibilities > total no. atoms in this universe] ==> Developing Intuition
  • #6: 1952 - Tic Tac Toe 1997 - Deep Blue - Chess ==> Exploring Solution Space 2011 - Watson - Jeopardy ==> Constructive Reasoning 2017 - Alpha Go - Go - [Possibilities > total no. atoms in this universe] ==> Developing Intuition
  • #8: AQ - System Admins/Developers/ QA/ HR/ AQ - How many of you heard of Data Science? Can you explain me, what is data science to you?
  • #9: Learn to draw - Newton’s observation of Apple falling from a Tree. Trojan Horse. Galileo - Watching ships moving, Kepler’s Law - Planetary System. Edision - bulb.
  • #10: > Newton’s Laws of Motions > Laws of Diminishing Returns > Kepler’s Laws of Planetary Motions > U-235 Chain Reaction > Arts - Music, Painting, Linguistics,..
  • #12: Usual Method: Data ⇒ Analysis ⇒ Rules/ Principles. Data ⇒ Principles/Laws/Observation ⇒ Evaluation Experiments ⇒ Real Life Applications. # Landing on Moon # Talking to a person at the other End of the world # Flying to other end of worlds
  • #13: Basic Fundamental
  • #15: Artificial Intelligence. Actual Goal of - simulate a human being. 1 understand 2 (action) interact 3 expressive # they know table manners Like a child, first achievement is talking first step. 1 understand situations 2 acting(judge height/speed/time)
  • #19: Wrangling - Structuring Data. Preparations -> Numbers.
  • #23: Experience is the best mentor.
  • #24: Passive Designs > Generative > Intuitive
  • #25: Director of Applied Research & Innovation, Autodesk 3D Printed AI Design - Cabin Partition for Airbus - A320 Cars - Manufactured to Farmed Buildings - Constructions to Growns Cities - Isolated to Connected
  • #26: Traditions Race Car Chassis - Gave Nervous System - 4 Billions Data Points
  • #27: 4 Billions Data Points
  • #28: AI - Predicting if a Song will be HIT Songs - Optimal Mathematical Patterns 25 Million Views
  • #29: #### Minority Report is a 2002 American Sci-Fi #### Director:Steven Spielberg #### Starring:Tom Cruise, Colin Farrell, Samantha Morton, Max von Sydow
  • #30: Project Interlace - Singapore DayLights Problems + Energy Consumption + Water Bodies(micro Climates)
  • #32: Experience is the best mentor.
  • #33: Open Sourced Tools used us, if you are planning to use these - you can take some help.
  • #35: Add pyramid Model.
  • #37: Big Data - Blue whale.