0% found this document useful (0 votes)
43 views

Data Science Chapitre 0

The document provides an overview of machine learning and data science topics. It begins with an introduction to machine learning, discussing why it is important and what machine learning is. It then outlines several machine learning algorithms and techniques across supervised learning methods like regression and classification, unsupervised learning methods like clustering, and other approaches such as deep learning, natural language processing, reinforcement learning, and dimensionality reduction. The document provides a high-level tour of common machine learning concepts and applications.

Uploaded by

Leonel Ska
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views

Data Science Chapitre 0

The document provides an overview of machine learning and data science topics. It begins with an introduction to machine learning, discussing why it is important and what machine learning is. It then outlines several machine learning algorithms and techniques across supervised learning methods like regression and classification, unsupervised learning methods like clustering, and other approaches such as deep learning, natural language processing, reinforcement learning, and dimensionality reduction. The document provides a high-level tour of common machine learning concepts and applications.

Uploaded by

Leonel Ska
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 25

Data Science

Mahdi Louati
3 GLID
September, 19th 2022
Content
s
0. Welcome to Machine
Learning
1. Data
Preprocessing
2. Regression Models

0.1 Why Machine Learning is the Future 1.1. Importing the Librairies 2.1. Simple Linear Regression (SLR)
0.2. What is machine Learning 1.2. Importing the Dataset 2.2. Multiple Linear Regression (MLR)
0.3. Installing Python and Anaconda 1.3. Missing Data 2.3. Polynomial Regression
1.4. Categorical Data 2.4. Support Vector Regression (SVR)
1.5. Training Set and Test Set 2.5. Decision Tree Regression
1.6. Feature Scaling 2.6. Random Forest Regression
2.7. Evaluation Regression Models

3. Classification 4. Clustering 5. Dimensionality Reduction


Models
3.1. Logistic Regression 4.1. K-Means Clustering 5.1. Principal Component Analysis (PCA)
3.2. K-Nearest Neighbors 4.2. Hierarchical Clustering 5.2. Linear Discriminant Analysis (LDA)
3.3. Support Vector Machine (SVM) 5.3. Kernel PCA
3.4. Kernel SVM
3.5. Naïve Bayes
3.6. Decision Tree classification

6. Reinforcement 7. Natural langage Processing 8. Deep


Learning
6.1. Upper confidence Boundary (UCB) (NLP) Learning
8.1. Artificial Neural Networks
6.2. Thompson Sampling 8.2 Convolution Neural Networks
Intelligence Knowledge Reasoning

Knowledge “The Problem-Solving skill.”

Reasoning “A generation of new knowledge.”


AI research uses techniques from many fields, such as linguistics, philosophy, logic, computer science,
psychology,…

These techniques are used in several applications such as control system, natural language processing, facial
recognition, voice recognition, business analytics, pattern matchnig and data mining.

Psycol Philos
ogy ophy
AI Lingui
Logic Comp stics
uter
Scienc
e
Artificial Intelligence, Machine Learning, Deep Learning and Data Science are popular terms in this era and knowing what it
is and the difference between them is crucial. Although these terms might be closely related there are differences between
them see the image below to visualize it.

Artificial intelligence (AI) is the set of theories and techniques


implemented to create machines able to simulate the human intelligence.

Machine Learning (ML) is that field of computer science with the help
of which computer systems can provide sense to data in much the same
way as human beings do.

Machine Learning is a subset of Artificial Intelligence that uses statistical learning algorithms to build systems that have the
ability to automatically learn and improve from experiences without being explicitly programmed or human intervention. .

Deep learning is a machine learning technique that is inspired by the way a human brain filters information, it is basically
learning from examples. It helps a computer model to filter the input data through layers to predict and classify information. 
Conférence Claude Shannon (Father of Information Theory) 1916-2001
Dartmouth 1956 John McCarthy (Creator of the « Lisp » Programming Language)
1927-2011.

Marvin Minsky (Cognitive Science and Artificial Intelligence)


1927-2016

Allen Newell (Computer Scientist) 1927-1992


Herbert A. Simon (Nobel Prize in Economics 1978) 1917-2001
Conférence
« How to make machines able of simulating human Dartmouth 2006
learning and any other characteristic of intelligence.»
The workshop lasted two months.
No consensus was reached.
They chose the name Artificial Intelligence for the discussed
field
1997 DEEPER BLUE (IBM) defeats Kasparov (World Chess Champion)

WATSON (DeepQA : prog. Inf. in IANLP of IBM) defeats Brad RUTTER and
2011
Kenn JENNINGS in Jeopardy! and wins 1 million $

AlphaGo (prog. Inf. in IA of Google DeepMind) defeats Lee SEDOL (best professional player
2016
in Go game)

This reshaped people's view of AI and unveiled its overwhelming development


AI in the Eyes of Researchers
"I propose to consider the question, 'Can machines think?.” (A. Turing 1950)

The branch of computer science concerned with making computers behave like humans. (J. McCarthy 1956)

The science of making machines do things that would require intelligence if done by men. (M. Minsky)

Identification of concepts related to AI and machine learning AI


Development Report 2020
Data Scientist vs Data
Analyst
Conten
u
0. Welcome to Machine
Learning
1. Data
Preprocessing
2. Regression Models

0.1 Why Machine Learning is the Future 1.1. Importing the Librairies 2.1. Simple Linear Regression (SLR)
0.2. What is machine Learning 1.2. Importing the Dataset 2.2. Multiple Linear Regression (MLR)
0.3. Installing Python and Anaconda 1.3. Missing Data 2.3. Polynomial Regression
1.4. Categorical Data 2.4. Support Vector Regression (SVR)
1.5. Training Set and Test Set 2.5. Decision Tree Regression
1.6. Feature Scaling 2.6. Random Forest Regression
2.7. Evaluation Regression Models

3. Classification 4. Clustering 5. Dimensionality Reduction


Models
3.1. Logistic Regression 4.1. K-Means Clustering 5.1. Principal Component Analysis (PCA)
3.2. K-Nearest Neighbors 4.2. Hierarchical Clustering 5.2. Linear Discriminant Analysis (LDA)
3.3. Support Vector Machine (SVM) 5.3. Kernel PCA
3.4. Kernel SVM
3.5. Naïve Bayes
3.6. Decision Tree classification

6. Reinforcement 7. Natural langage Processing 8. Deep


Learning
6.1. Upper confidence Boundary (UCB) (NLP) Learning
8.1. Artificial Neural Networks
6.2. Thompson Sampling 8.2 Convolution Neural Networks
Section 0

00 Welcome to Machine Learning


0.1 Why Machine Learning is the Future
0.2 What is machine Learning
0.3 Installing Python and Anaconda
SINCE THE DAWN OF TIME…
UP UNTIL 2005…
HUMANS HAD CREATED…130 EXABYTES OF DATA

EXABYTES?

200 to 500 letters * * *

1 Byte 1 1 Megabyte 1 Gigabyte 1 Terabyte


Kilobyte

A
500*10⁶ Hectars of trees

70 to 80 years 500*10⁹ trees *

Some 1 to 2 Betabyte 1 Exabyte =Bytes


Terabytes
Books, Songs, Speeches and all the data produced by humans
2020
40900
EXABYTE
2015 S
7900
EXABYTE
S
2010
1200
EXABYTES
2005
130
EXABYTES

Data grows exponentially, we create more data than before.


0. Welcome to Machine
Learning 0.1. What is Machine
Learning
Machine Learning (ML) is one of the biggest fields of Data Science. It is an application of
Artificial Intelligence (AI) that provides systems the ability to automatically learn and improve
from experience without being explicitly programmed.

It is grown so quickly over the past decade that now you are almost expected to know some level
of Machine Learning to call yourself a Data Scientist.

The Machine learning is so pervasive today that you probably use it dozens of times a day
without knowing it. It focuses on the development of computer programs that can access data
and use it learn for themselves. It has several domains of applications
Topics
include:

Supervised Learning is a Machine Learning task consisting of learning a prediction function


from annotated examples, as opposed to Unsupervised Learning.

Regression problems are distinguished from Classification problems. Thus, it is considered that
the problems of predicting a quantitative variable are Regression problems whereas the problems
of predicting a qualitative variable are Classification problems.

Although both types of learning are Artificial Intelligence, in the Supervised Learning a
researcher is there to "guide" the algorithm on the path of learning by providing him with
examples that he considers convincing after having previously labeled expected results. Artificial
Intelligence then learns from each example, with the aim of being able to generalize its learning
to new cases.
In the case of Unsupervised Learning, Machine Learning is completely autonomous. Data is
then communicated to the machine without providing the examples of output results expected .
You will learn about the most effective machine learning techniques.

You gain practice implementing them and getting them to work for yourself.

Youwill learn about not only the theoretical underpinnings of learning but also gain the practical
know-how needed to quickly and powerfully apply these techniques to new problems.
0. Welcome to Machine
Learning 0.2. Installing Python and
Anaconda
Python is an interpreted programming language, multi-paradigm and multiplatform. It promotes
structured, functional and object-oriented imperative programming. It was developed by the
Dutch Guido van Rossum in 1989.

Spider is a development environment. It is a kind of studio with very practical tools and it is
integrated in the best platform Anaconda.

Anaconda is the best open source Data Science platform since it contains most of the Machine
Learning Libraries, the most practical to use for building models is the Scikit-learn Library.

Python Spider Anaconda

You might also like