SlideShare a Scribd company logo
1 | P a g e
What is data science and why it is important now?
What is data science and why it is important now?
Author – Bohitesh Misra (bohitesh.misra@gmail.com), September 2017
Data Science!
Fundamentally, in layman terms, data scientists collect data from various
data sources, clean them, organize the data and shape them to be able to
analyze them. We can separate data into training and testing to assess and
experiment the algorithm or model that is developed using statistics and
apply them to any area or sector that we find suitable. Data mining helps end
users extract useful business information from large databases.
Asking the right questions
Asking the right questions is extremely important, and hence apt
communications skills is essential for data scientists. With the advent of
technology and the internet, we now have access to data instantly and the
technology to test our interpretation to make decisions rapidly and promptly.
Data scientist
Data scientists use their data and analytical ability to find and interpret rich
data sources; manage large volume of data; merge data sources; ensure
consistency of datasets; create visualizations in understanding data; build
mathematical models using the data; and present and communicate the data
insights and findings to business decision makers.
"Data scientist" has become a popular buzzword with Harvard Business
Review dubbing it "The Sexiest Job of the 21st Century" and McKinsey &
Company projecting a global excess demand of 1.5 million new data
scientists.
Statistical models
2 | P a g e
What is data science and why it is important now?
How does data mining works? It works the same way a human being does.
Basically, it uses historical information to learn for future. Mathematical
models like linear algebra, probability, statistics and calculus, regression,
clustering, predictive analysis are indispensable in data science. Python and
R are preferred programming languages that have packages and libraries
built specifically for data science which allow us to learn programming and
start applying. I’ve begun with R and use basic libraries for text and data
mining.
Data Cleaning
80% of the work by data scientists is data cleaning. Data is sometimes
available in preferred formats such as csv and xls, but you’ll find very little
data directly available to be executed using programming. APIs, web scraping
and SQL come in to the rescue of Data Scientists. Spark and Map-Reduce are
used to clean and analyze large and distributed datasets.
It’s everywhere!
Data-driven solutions are being used everywhere, from e-commerce websites,
social networking sites, financial visualization and interpretation.
Data-driven practices are increasingly being employed by companies over the
last few years. In fact, it would be difficult to find a sector in which data
science cannot be used to take better decisions, and companies are slowly
realizing this and adopting it.
Want to learn it?
I came across data science and decided it was the right fit for me and recently
completed Executive Management Programme from Indian Institute of
Technology Delhi in the same subject. Learning data science is very easy and
convenient, with the large number of MOOCs and eBooks available for free
online.
I urge you to think about how it may be applied to you, whether it is your
business where you can gather data in the form of reviews and opinions of
3 | P a g e
What is data science and why it is important now?
customers to make better data-driven decisions. You can use the data from
movie review sites to choose your next movie.
Data science for Startups
Startups critically need a Data strategy around the collection, storage and
usage of large data, in a way that data can serve the purpose behind the selling
point of a startup and can also open-up additional potential monetisation
avenues in the future.
A common case can be recommendation engine, which can benefit from
all kinds of information about the users: age, gender, purchases, offerings and
discounts. Designing the platform in a way that improves information
collection from its users, results in a big database that can be used to improve
in better managing discount deals, improving advertising or even the user
experience on the platform.
A clear data strategy can provide startups with additional revenue scope
and can also provide with a competitive advantage.
Ad

More Related Content

What's hot (20)

Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Data analytics
Data analyticsData analytics
Data analytics
Dr.Bhuvaneswari Velumani
 
Data analytics
Data analyticsData analytics
Data analytics
HimanshuPise2
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
RAVIKANTSHARMA98
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Swiss Big Data User Group
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
bhavesh lande
 
2005)
2005)2005)
2005)
butest
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Edureka!
 
Data Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science CultureData Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science Culture
Formulatedby
 
What is Data?
What is Data?What is Data?
What is Data?
Ranjit Nambisan
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Professor Lili Saghafi
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
Applications of machine learning
Applications of machine learningApplications of machine learning
Applications of machine learning
SakshiTiwari63
 
Vikrant data scientist
Vikrant data scientistVikrant data scientist
Vikrant data scientist
Vikrant Narayan
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
Professor Lili Saghafi
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystData Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science Catalyst
Formulatedby
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Utkarsh Sharma
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use Cases
Cagri Sarigoz
 
What is data science artical
What is data science articalWhat is data science artical
What is data science artical
kavyapandala
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
Enes Bolfidan
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Swiss Big Data User Group
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
bhavesh lande
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Edureka!
 
Data Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science CultureData Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science Culture
Formulatedby
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Professor Lili Saghafi
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
Applications of machine learning
Applications of machine learningApplications of machine learning
Applications of machine learning
SakshiTiwari63
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
Professor Lili Saghafi
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystData Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science Catalyst
Formulatedby
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Utkarsh Sharma
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use Cases
Cagri Sarigoz
 
What is data science artical
What is data science articalWhat is data science artical
What is data science artical
kavyapandala
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
Enes Bolfidan
 

Similar to What is data science ? (20)

Best Data Science Hybrid Course in Pune.
Best Data Science Hybrid Course in Pune.Best Data Science Hybrid Course in Pune.
Best Data Science Hybrid Course in Pune.
3RI Technologies Pvt Ltd
 
Data Science for Finance Interview.
Data Science for Finance Interview. Data Science for Finance Interview.
Data Science for Finance Interview.
James LoBuono, CAPM, ITILv4
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 
Difference b/w DataScience, Data Analyst
Difference b/w DataScience, Data AnalystDifference b/w DataScience, Data Analyst
Difference b/w DataScience, Data Analyst
3RI Technologies Pvt Ltd
 
PPT presentation Data science courses in kochi.pdf
PPT presentation Data science courses in kochi.pdfPPT presentation Data science courses in kochi.pdf
PPT presentation Data science courses in kochi.pdf
ameeshadotin
 
L3 Big Data and Application.pptx
L3  Big Data and Application.pptxL3  Big Data and Application.pptx
L3 Big Data and Application.pptx
Shambhavi Vats
 
_What Is Data Science.pdf
_What Is Data Science.pdf_What Is Data Science.pdf
_What Is Data Science.pdf
FlyWly
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
Vipul Kalamkar
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
Best Data Science training institute in Hyderabad
Best Data Science training institute in HyderabadBest Data Science training institute in Hyderabad
Best Data Science training institute in Hyderabad
codingmaster021
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
University of Sindh
 
KOHN.ppt
KOHN.pptKOHN.ppt
KOHN.ppt
MunyaradziPasinawako
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
PoojaPatidar11
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
Aspire Techsoft Academy
 
Data science in business Administration Nagarajan.pptx
Data science in business Administration Nagarajan.pptxData science in business Administration Nagarajan.pptx
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
ds.pptx
ds.pptxds.pptx
ds.pptx
Elves3
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
IvanHo572682
 
The Power of Data Science by DICS INNOVATIVE.pptx
The Power of Data Science by DICS INNOVATIVE.pptxThe Power of Data Science by DICS INNOVATIVE.pptx
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
How to start thinking like a data scientist
How to start thinking like a data scientistHow to start thinking like a data scientist
How to start thinking like a data scientist
Debashish Jana
 
Achieving Business Success with Data.pdf
Achieving Business Success with Data.pdfAchieving Business Success with Data.pdf
Achieving Business Success with Data.pdf
Data Science Council of America
 
PPT presentation Data science courses in kochi.pdf
PPT presentation Data science courses in kochi.pdfPPT presentation Data science courses in kochi.pdf
PPT presentation Data science courses in kochi.pdf
ameeshadotin
 
L3 Big Data and Application.pptx
L3  Big Data and Application.pptxL3  Big Data and Application.pptx
L3 Big Data and Application.pptx
Shambhavi Vats
 
_What Is Data Science.pdf
_What Is Data Science.pdf_What Is Data Science.pdf
_What Is Data Science.pdf
FlyWly
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
Vipul Kalamkar
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
Best Data Science training institute in Hyderabad
Best Data Science training institute in HyderabadBest Data Science training institute in Hyderabad
Best Data Science training institute in Hyderabad
codingmaster021
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
University of Sindh
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
PoojaPatidar11
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
Aspire Techsoft Academy
 
Data science in business Administration Nagarajan.pptx
Data science in business Administration Nagarajan.pptxData science in business Administration Nagarajan.pptx
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
ds.pptx
ds.pptxds.pptx
ds.pptx
Elves3
 
The Power of Data Science by DICS INNOVATIVE.pptx
The Power of Data Science by DICS INNOVATIVE.pptxThe Power of Data Science by DICS INNOVATIVE.pptx
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
How to start thinking like a data scientist
How to start thinking like a data scientistHow to start thinking like a data scientist
How to start thinking like a data scientist
Debashish Jana
 
Ad

More from Bohitesh Misra, PMP (10)

Innovation in enterpreneurship_2021
Innovation in enterpreneurship_2021Innovation in enterpreneurship_2021
Innovation in enterpreneurship_2021
Bohitesh Misra, PMP
 
Use of data science for startups_Sept 2021
Use of data science for startups_Sept 2021Use of data science for startups_Sept 2021
Use of data science for startups_Sept 2021
Bohitesh Misra, PMP
 
Building castles on sand - Project Management in distributed project environment
Building castles on sand - Project Management in distributed project environmentBuilding castles on sand - Project Management in distributed project environment
Building castles on sand - Project Management in distributed project environment
Bohitesh Misra, PMP
 
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Disruptive technologies - Session 4 - Biochip Digital twin Smart FabricsDisruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Bohitesh Misra, PMP
 
Disruptive technologies - Session 3 - Green it_Smartdust
Disruptive technologies - Session 3 - Green it_SmartdustDisruptive technologies - Session 3 - Green it_Smartdust
Disruptive technologies - Session 3 - Green it_Smartdust
Bohitesh Misra, PMP
 
Disruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contractsDisruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contracts
Bohitesh Misra, PMP
 
Disruptive technologies - Session 1 - introduction
Disruptive technologies - Session 1 - introductionDisruptive technologies - Session 1 - introduction
Disruptive technologies - Session 1 - introduction
Bohitesh Misra, PMP
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
Bohitesh Misra, PMP
 
Business analytics why now_what next
Business analytics why now_what nextBusiness analytics why now_what next
Business analytics why now_what next
Bohitesh Misra, PMP
 
Internet of Things (IoT) based Solar Energy System security considerations
Internet of Things (IoT) based Solar Energy System security considerationsInternet of Things (IoT) based Solar Energy System security considerations
Internet of Things (IoT) based Solar Energy System security considerations
Bohitesh Misra, PMP
 
Innovation in enterpreneurship_2021
Innovation in enterpreneurship_2021Innovation in enterpreneurship_2021
Innovation in enterpreneurship_2021
Bohitesh Misra, PMP
 
Use of data science for startups_Sept 2021
Use of data science for startups_Sept 2021Use of data science for startups_Sept 2021
Use of data science for startups_Sept 2021
Bohitesh Misra, PMP
 
Building castles on sand - Project Management in distributed project environment
Building castles on sand - Project Management in distributed project environmentBuilding castles on sand - Project Management in distributed project environment
Building castles on sand - Project Management in distributed project environment
Bohitesh Misra, PMP
 
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Disruptive technologies - Session 4 - Biochip Digital twin Smart FabricsDisruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Bohitesh Misra, PMP
 
Disruptive technologies - Session 3 - Green it_Smartdust
Disruptive technologies - Session 3 - Green it_SmartdustDisruptive technologies - Session 3 - Green it_Smartdust
Disruptive technologies - Session 3 - Green it_Smartdust
Bohitesh Misra, PMP
 
Disruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contractsDisruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contracts
Bohitesh Misra, PMP
 
Disruptive technologies - Session 1 - introduction
Disruptive technologies - Session 1 - introductionDisruptive technologies - Session 1 - introduction
Disruptive technologies - Session 1 - introduction
Bohitesh Misra, PMP
 
Business analytics why now_what next
Business analytics why now_what nextBusiness analytics why now_what next
Business analytics why now_what next
Bohitesh Misra, PMP
 
Internet of Things (IoT) based Solar Energy System security considerations
Internet of Things (IoT) based Solar Energy System security considerationsInternet of Things (IoT) based Solar Energy System security considerations
Internet of Things (IoT) based Solar Energy System security considerations
Bohitesh Misra, PMP
 
Ad

Recently uploaded (20)

History of Science and Technologyandits source.pptx
History of Science and Technologyandits source.pptxHistory of Science and Technologyandits source.pptx
History of Science and Technologyandits source.pptx
balongcastrojo
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
presentation of first program exist.pptx
presentation of first program exist.pptxpresentation of first program exist.pptx
presentation of first program exist.pptx
MajidAzeemChohan
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
AllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptxAllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptx
bpkr84
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
Induction Program of MTAB online session
Induction Program of MTAB online sessionInduction Program of MTAB online session
Induction Program of MTAB online session
LOHITH886892
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
brainstorming-techniques-infographics.pptx
brainstorming-techniques-infographics.pptxbrainstorming-techniques-infographics.pptx
brainstorming-techniques-infographics.pptx
maritzacastro321
 
History of Science and Technologyandits source.pptx
History of Science and Technologyandits source.pptxHistory of Science and Technologyandits source.pptx
History of Science and Technologyandits source.pptx
balongcastrojo
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
presentation of first program exist.pptx
presentation of first program exist.pptxpresentation of first program exist.pptx
presentation of first program exist.pptx
MajidAzeemChohan
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
AllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptxAllContacts Vs AllSubscribers - SFMC.pptx
AllContacts Vs AllSubscribers - SFMC.pptx
bpkr84
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
Induction Program of MTAB online session
Induction Program of MTAB online sessionInduction Program of MTAB online session
Induction Program of MTAB online session
LOHITH886892
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
brainstorming-techniques-infographics.pptx
brainstorming-techniques-infographics.pptxbrainstorming-techniques-infographics.pptx
brainstorming-techniques-infographics.pptx
maritzacastro321
 

What is data science ?

  • 1. 1 | P a g e What is data science and why it is important now? What is data science and why it is important now? Author – Bohitesh Misra ([email protected]), September 2017 Data Science! Fundamentally, in layman terms, data scientists collect data from various data sources, clean them, organize the data and shape them to be able to analyze them. We can separate data into training and testing to assess and experiment the algorithm or model that is developed using statistics and apply them to any area or sector that we find suitable. Data mining helps end users extract useful business information from large databases. Asking the right questions Asking the right questions is extremely important, and hence apt communications skills is essential for data scientists. With the advent of technology and the internet, we now have access to data instantly and the technology to test our interpretation to make decisions rapidly and promptly. Data scientist Data scientists use their data and analytical ability to find and interpret rich data sources; manage large volume of data; merge data sources; ensure consistency of datasets; create visualizations in understanding data; build mathematical models using the data; and present and communicate the data insights and findings to business decision makers. "Data scientist" has become a popular buzzword with Harvard Business Review dubbing it "The Sexiest Job of the 21st Century" and McKinsey & Company projecting a global excess demand of 1.5 million new data scientists. Statistical models
  • 2. 2 | P a g e What is data science and why it is important now? How does data mining works? It works the same way a human being does. Basically, it uses historical information to learn for future. Mathematical models like linear algebra, probability, statistics and calculus, regression, clustering, predictive analysis are indispensable in data science. Python and R are preferred programming languages that have packages and libraries built specifically for data science which allow us to learn programming and start applying. I’ve begun with R and use basic libraries for text and data mining. Data Cleaning 80% of the work by data scientists is data cleaning. Data is sometimes available in preferred formats such as csv and xls, but you’ll find very little data directly available to be executed using programming. APIs, web scraping and SQL come in to the rescue of Data Scientists. Spark and Map-Reduce are used to clean and analyze large and distributed datasets. It’s everywhere! Data-driven solutions are being used everywhere, from e-commerce websites, social networking sites, financial visualization and interpretation. Data-driven practices are increasingly being employed by companies over the last few years. In fact, it would be difficult to find a sector in which data science cannot be used to take better decisions, and companies are slowly realizing this and adopting it. Want to learn it? I came across data science and decided it was the right fit for me and recently completed Executive Management Programme from Indian Institute of Technology Delhi in the same subject. Learning data science is very easy and convenient, with the large number of MOOCs and eBooks available for free online. I urge you to think about how it may be applied to you, whether it is your business where you can gather data in the form of reviews and opinions of
  • 3. 3 | P a g e What is data science and why it is important now? customers to make better data-driven decisions. You can use the data from movie review sites to choose your next movie. Data science for Startups Startups critically need a Data strategy around the collection, storage and usage of large data, in a way that data can serve the purpose behind the selling point of a startup and can also open-up additional potential monetisation avenues in the future. A common case can be recommendation engine, which can benefit from all kinds of information about the users: age, gender, purchases, offerings and discounts. Designing the platform in a way that improves information collection from its users, results in a big database that can be used to improve in better managing discount deals, improving advertising or even the user experience on the platform. A clear data strategy can provide startups with additional revenue scope and can also provide with a competitive advantage.