SlideShare a Scribd company logo
Data Science Tutorial | What is Data Science? | Data Science For Beginners | Edureka
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Agenda
1. Need for Data Science
2. Walmart Use Case
3. What is Data Science?
4. Who is a Data Scientist?
5. Data Science – Skill Set
6. Data Science Job Roles
7. Data Life Cycle
8. Introduction to Machine Learning
9. K – Means Use Case
10. K – Means Algorithm
11. Hands - On
12. Data Science Certification
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Need For Data Science
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Sources
Mobile Cloud Smart Car
Evolution of
Technology
IOT
Social Media
Other factors
Telephone Desktop Car
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Sources
Evolution of
Technology
IOT
Social Media
Other factors
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Sources
Evolution of
Technology
IOT
Social Media
Other factors
347,222 tweets1,736,111 pictures 204,000,000 emails
300 hours of video
uploaded
4,166,667 likes &
200,000 photos
4,166,667 likes &
200,000 photos
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Sources
Evolution of
Technology
IOT
Social Media
Other factors
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Walmart Use Case
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Analysis At Walmart
Halloween and cookie sales
Data scientist at Walmart found a connection between Halloween and the sales of cookies.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Analysis At Walmart
Hurricane and strawberry pop tarts
Data scientist at Walmart found that sales of Strawberry pop-tarts increased by 7 times before a Hurricane.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Analysis At Walmart
Social media and cake pops
Walmart is leveraging social media data to find about the trending products so that they can be introduced to
the Walmart stores across the world
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
What Is Data Science?
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
What is Data Science?
Data Science is the process of extracting knowledge and insights
from data by using scientific methods.
Scientific methods:
Programming + Statistics + Business
“Torture the data, and it will confess to anything.”
~ Ronald Coase, Economics, Nobel Prize
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Who Is A Data Scientist?
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Who Is A Data Scientist?
Mathematics
Business Technology
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Science – Skill Set
Programming
languagesStatistics
Machine Learning
Big Data processing
frameworks
Data wrangling &
exploration
Data visualisation
Data extraction &
processing
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Science Job Roles
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Science Job Roles
Data Scientist Data Analyst Data Architect Data Engineer
Statistician
Database
Administrator
Business Analyst
Data & Analytics
Manager
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Science Life Cycle
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Data Science
Business
requirements
Data
acquisition
Data
processing
Data
exploration
Modelling
Deployment
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Understand the problem
Identify central objectives
Identify variables that need
to be predicted
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
What data do I need for my project?
What are the data sources?
How can I obtain the data?
What is the most efficient way to
store and access all of it?
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
Transform data into desired format
Data cleaning
• Missing values
• Corrupted data
• Remove unnecessary
data
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
understand the patterns in the data
Retrieve useful insight
form hypotheses
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
Determine optimal data features
for the machine-learning model
Create a model that predicts the
target most accurately
Evaluate & test the efficiency of
the model
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Life Cycle
Business requirements
Data acquisition
Data Processing
Data exploration
Modelling
Deployment
Check the deployment environment
for dependency issues
Deploy the model in a pre-
production/ test environment
Monitor the performance
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Introduction To Machine Learning
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
What Is Machine Learning?
Machine learning is a subset of artificial intelligence (AI) which provides machines the ability to learn automatically &
improve from experience without being explicitly programmed.
They look the same!
Cherry
Apple
Orange
Data
Algorithm
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Types Of Machine Learning
Reinforcement LearningSupervised Learning Unsupervised Learning
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Use Case
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Brain Tumour Detection Using K - means
Brain tumour segmentation deals with the implementation of the k-means
algorithm for detection of range and shape of tumour in brain MR images.
K-Means clustering is an unsupervised learning algorithm used to partition a dataset
into k clusters in which each data point belongs to the cluster with the nearest mean.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence
➢Randomly initialize k points called the cluster centroids.
Here, k = 2
➢Value of k(number of clusters) can be determined by the elbow
curve.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence
➢Compute the distance between the data points and the
cluster centroid initialized.
➢Depending upon the minimum distance, data points are
divided into two groups.
1
2
Euclidean distance
Cluster
centroid
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence
➢Compute mean of red dots & reposition red cluster
centroid to this mean
➢Compute mean of green dots & reposition green
cluster centroid to this mean.
1
2
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence
1
2
➢Repeat previous two steps iteratively till the cluster
centroids stop changing their positions.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence 1
2
➢Repeat previous two steps iteratively till the cluster
centroids stop changing their positions.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence 1
2
➢Repeat previous two steps iteratively till the cluster
centroids stop changing their positions.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence 1
2
➢Repeat previous two steps iteratively till the cluster
centroids stop changing their positions.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
Initialization
Cluster assignment
Move centroid
Optimization
Convergence 1
2
➢Finally, k-means clustering algorithm converges.
➢Divides the data points into two clusters clearly visible in
red and green.
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
K – Means Algorithm
➢ Data Matrix
➢ Distance/ dissimilarity Matrix
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Hands - On
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Data Science Certification
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Edureka’s Data Science Certification
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
Edureka’s Data Science Certification
Introduction to
Data Science
Statistical
Inference
Data extraction,
wrangling &
exploration
Introduction to
Machine Learning
Classification
techniques
Unsupervised
Learning
Recommender
engine Text Mining Time seriesDeep Learning
DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science
WebDriver vs. IDE vs. RC
➢ Data Warehouse is like a relational database designed for analytical needs.
➢ It functions on the basis of OLAP (Online Analytical Processing).
➢ It is a central location where consolidated data from multiple locations (databases) are stored.
Ad

More Related Content

What's hot (20)

Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
bhavesh lande
 
Data Science
Data ScienceData Science
Data Science
Rabin BK
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Sampath Kumar
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Edureka!
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
Spotle.ai
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
Edureka!
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Edureka!
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Edureka!
 
Data science
Data scienceData science
Data science
Mohamed Loey
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Tharushi Ruwandika
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
SadhanaParameswaran
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
Data Science Club
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
ANOOP V S
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Niko Vuokko
 
Data science
Data scienceData science
Data science
Ranjit Nambisan
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
Edureka!
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
bhavesh lande
 
Data Science
Data ScienceData Science
Data Science
Rabin BK
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Sampath Kumar
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Edureka!
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
Spotle.ai
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
Edureka!
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Edureka!
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Edureka!
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
SadhanaParameswaran
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
Data Science Club
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
ANOOP V S
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Niko Vuokko
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
Edureka!
 

Similar to Data Science Tutorial | What is Data Science? | Data Science For Beginners | Edureka (20)

Predictive Analytics Using R | Edureka
Predictive Analytics Using R | EdurekaPredictive Analytics Using R | Edureka
Predictive Analytics Using R | Edureka
Edureka!
 
Next Generation Manufacturing
Next Generation ManufacturingNext Generation Manufacturing
Next Generation Manufacturing
Elliot Duff
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
Edureka!
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
Edureka!
 
Data Science : Make Smarter Business Decisions
Data Science : Make Smarter Business DecisionsData Science : Make Smarter Business Decisions
Data Science : Make Smarter Business Decisions
Edureka!
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Edureka!
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
Inside Analysis
 
OA centre of excellence
OA centre of excellenceOA centre of excellence
OA centre of excellence
Object Automation Software Solutions (P) Ltd.
 
Sentiment Analysis In Retail Domain
Sentiment Analysis In Retail DomainSentiment Analysis In Retail Domain
Sentiment Analysis In Retail Domain
Edureka!
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
Hoa Le
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier
 
Maciej Marek (Philip Morris International) - The Tools of The Trade
Maciej Marek (Philip Morris International) - The Tools of The TradeMaciej Marek (Philip Morris International) - The Tools of The Trade
Maciej Marek (Philip Morris International) - The Tools of The Trade
Codiax
 
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
AlexandreMacedo50
 
The Future of Data Science Course In Kochi
The Future of Data Science Course In KochiThe Future of Data Science Course In Kochi
The Future of Data Science Course In Kochi
navnith990
 
Future in Data Science course in kerala.pdf
Future in Data Science course in kerala.pdfFuture in Data Science course in kerala.pdf
Future in Data Science course in kerala.pdf
jeffiiii007
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
Denodo
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
Merce Crosas
 
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdf
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdfData science course in Kerala .(PPT)AKSHYA VALSAN.pdf
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdf
akshayaabhishekabhis
 
Data Science
Data ScienceData Science
Data Science
VictorFreemanAdekunl
 
How to crack down big data?
How to crack down big data? How to crack down big data?
How to crack down big data?
Ta-Wei (David) Huang
 
Predictive Analytics Using R | Edureka
Predictive Analytics Using R | EdurekaPredictive Analytics Using R | Edureka
Predictive Analytics Using R | Edureka
Edureka!
 
Next Generation Manufacturing
Next Generation ManufacturingNext Generation Manufacturing
Next Generation Manufacturing
Elliot Duff
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
Edureka!
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
Edureka!
 
Data Science : Make Smarter Business Decisions
Data Science : Make Smarter Business DecisionsData Science : Make Smarter Business Decisions
Data Science : Make Smarter Business Decisions
Edureka!
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Edureka!
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
Inside Analysis
 
Sentiment Analysis In Retail Domain
Sentiment Analysis In Retail DomainSentiment Analysis In Retail Domain
Sentiment Analysis In Retail Domain
Edureka!
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
Hoa Le
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier
 
Maciej Marek (Philip Morris International) - The Tools of The Trade
Maciej Marek (Philip Morris International) - The Tools of The TradeMaciej Marek (Philip Morris International) - The Tools of The Trade
Maciej Marek (Philip Morris International) - The Tools of The Trade
Codiax
 
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
2024-07-eb-big-book-of-data-engineering-3rd-edition.pdf
AlexandreMacedo50
 
The Future of Data Science Course In Kochi
The Future of Data Science Course In KochiThe Future of Data Science Course In Kochi
The Future of Data Science Course In Kochi
navnith990
 
Future in Data Science course in kerala.pdf
Future in Data Science course in kerala.pdfFuture in Data Science course in kerala.pdf
Future in Data Science course in kerala.pdf
jeffiiii007
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
Denodo
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
Merce Crosas
 
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdf
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdfData science course in Kerala .(PPT)AKSHYA VALSAN.pdf
Data science course in Kerala .(PPT)AKSHYA VALSAN.pdf
akshayaabhishekabhis
 
Ad

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
Ad

Recently uploaded (20)

Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 

Data Science Tutorial | What is Data Science? | Data Science For Beginners | Edureka

  • 2. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Agenda 1. Need for Data Science 2. Walmart Use Case 3. What is Data Science? 4. Who is a Data Scientist? 5. Data Science – Skill Set 6. Data Science Job Roles 7. Data Life Cycle 8. Introduction to Machine Learning 9. K – Means Use Case 10. K – Means Algorithm 11. Hands - On 12. Data Science Certification
  • 3. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Need For Data Science
  • 4. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Sources Mobile Cloud Smart Car Evolution of Technology IOT Social Media Other factors Telephone Desktop Car
  • 5. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Sources Evolution of Technology IOT Social Media Other factors
  • 6. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Sources Evolution of Technology IOT Social Media Other factors 347,222 tweets1,736,111 pictures 204,000,000 emails 300 hours of video uploaded 4,166,667 likes & 200,000 photos 4,166,667 likes & 200,000 photos
  • 7. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Sources Evolution of Technology IOT Social Media Other factors
  • 8. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Walmart Use Case
  • 9. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Analysis At Walmart Halloween and cookie sales Data scientist at Walmart found a connection between Halloween and the sales of cookies.
  • 10. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Analysis At Walmart Hurricane and strawberry pop tarts Data scientist at Walmart found that sales of Strawberry pop-tarts increased by 7 times before a Hurricane.
  • 11. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Analysis At Walmart Social media and cake pops Walmart is leveraging social media data to find about the trending products so that they can be introduced to the Walmart stores across the world
  • 12. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science What Is Data Science?
  • 13. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science What is Data Science? Data Science is the process of extracting knowledge and insights from data by using scientific methods. Scientific methods: Programming + Statistics + Business “Torture the data, and it will confess to anything.” ~ Ronald Coase, Economics, Nobel Prize
  • 14. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Who Is A Data Scientist?
  • 15. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Who Is A Data Scientist? Mathematics Business Technology
  • 16. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Science – Skill Set Programming languagesStatistics Machine Learning Big Data processing frameworks Data wrangling & exploration Data visualisation Data extraction & processing
  • 17. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Science Job Roles
  • 18. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Science Job Roles Data Scientist Data Analyst Data Architect Data Engineer Statistician Database Administrator Business Analyst Data & Analytics Manager
  • 19. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Science Life Cycle
  • 20. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Data Science Business requirements Data acquisition Data processing Data exploration Modelling Deployment
  • 21. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Understand the problem Identify central objectives Identify variables that need to be predicted Business requirements Data acquisition Data Processing Data exploration Modelling Deployment
  • 22. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Business requirements Data acquisition Data Processing Data exploration Modelling Deployment What data do I need for my project? What are the data sources? How can I obtain the data? What is the most efficient way to store and access all of it?
  • 23. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Business requirements Data acquisition Data Processing Data exploration Modelling Deployment Transform data into desired format Data cleaning • Missing values • Corrupted data • Remove unnecessary data
  • 24. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Business requirements Data acquisition Data Processing Data exploration Modelling Deployment understand the patterns in the data Retrieve useful insight form hypotheses
  • 25. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Business requirements Data acquisition Data Processing Data exploration Modelling Deployment Determine optimal data features for the machine-learning model Create a model that predicts the target most accurately Evaluate & test the efficiency of the model
  • 26. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Life Cycle Business requirements Data acquisition Data Processing Data exploration Modelling Deployment Check the deployment environment for dependency issues Deploy the model in a pre- production/ test environment Monitor the performance
  • 27. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Introduction To Machine Learning
  • 28. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science What Is Machine Learning? Machine learning is a subset of artificial intelligence (AI) which provides machines the ability to learn automatically & improve from experience without being explicitly programmed. They look the same! Cherry Apple Orange Data Algorithm
  • 29. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Types Of Machine Learning Reinforcement LearningSupervised Learning Unsupervised Learning
  • 30. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Use Case
  • 31. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Brain Tumour Detection Using K - means Brain tumour segmentation deals with the implementation of the k-means algorithm for detection of range and shape of tumour in brain MR images. K-Means clustering is an unsupervised learning algorithm used to partition a dataset into k clusters in which each data point belongs to the cluster with the nearest mean.
  • 32. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm
  • 33. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence ➢Randomly initialize k points called the cluster centroids. Here, k = 2 ➢Value of k(number of clusters) can be determined by the elbow curve.
  • 34. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence ➢Compute the distance between the data points and the cluster centroid initialized. ➢Depending upon the minimum distance, data points are divided into two groups. 1 2 Euclidean distance Cluster centroid
  • 35. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence ➢Compute mean of red dots & reposition red cluster centroid to this mean ➢Compute mean of green dots & reposition green cluster centroid to this mean. 1 2
  • 36. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence 1 2 ➢Repeat previous two steps iteratively till the cluster centroids stop changing their positions.
  • 37. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence 1 2 ➢Repeat previous two steps iteratively till the cluster centroids stop changing their positions.
  • 38. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence 1 2 ➢Repeat previous two steps iteratively till the cluster centroids stop changing their positions.
  • 39. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence 1 2 ➢Repeat previous two steps iteratively till the cluster centroids stop changing their positions.
  • 40. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm Initialization Cluster assignment Move centroid Optimization Convergence 1 2 ➢Finally, k-means clustering algorithm converges. ➢Divides the data points into two clusters clearly visible in red and green.
  • 41. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science K – Means Algorithm ➢ Data Matrix ➢ Distance/ dissimilarity Matrix
  • 42. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Hands - On
  • 43. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Data Science Certification
  • 44. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Edureka’s Data Science Certification
  • 45. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science Edureka’s Data Science Certification Introduction to Data Science Statistical Inference Data extraction, wrangling & exploration Introduction to Machine Learning Classification techniques Unsupervised Learning Recommender engine Text Mining Time seriesDeep Learning
  • 46. DATA SCIENCE CERTIFICATION TRAINING www.edureka.co/data-science WebDriver vs. IDE vs. RC ➢ Data Warehouse is like a relational database designed for analytical needs. ➢ It functions on the basis of OLAP (Online Analytical Processing). ➢ It is a central location where consolidated data from multiple locations (databases) are stored.