AI DataAcquisition
AI DataAcquisition
DATA ACQUISITION
• Problem Scoping
• Data Acquisition
• Data Exploration
• Modelling
• Evaluation
• First step in the lifecycle of an AI project
is Problem scoping
• Scoping helps in setting the goal of the
project
• Identifies the stakeholders
• Defines the scope of the project
• Also sets the measure for evaluating
the system
Read the following articles. Let us come
up with a problem statement to develop
an AI system for this purpose.
• Threat to Elephants
• Impact of poaching
• Importance of elephants
• Conservation by WWF
WHO? :
Who are the stakeholders?
Elephants, WWF
What do you know about the stakeholders?
Elephants live in the forest. They need food and water
for living. They move around in search of food. They
cross over to human habitats.
WWF : An organisation that protects and ensures
safety of wildlife. They work with the government of
different countries for this purpose.
WHAT? :
What is the problem?
The problem is elephants are in danger of
extinction due to poaching and conflict.
How do we know that it is a problem?
The problem has been brought to our notice
through articles in newspaper and websites.
WHERE? :
What is the context / situation in which
the stakeholders experience the problem?
Due to the high demand of ivory in China
and conflict with humans for habitat.
WHY? :
What would be of key value to the stakeholders?
Ensure the safety of elephants which play a
major role in ecological cycle thereby helping
WWF achieve its goal of protecting elephants.
How would it improve the situation?
Solution would ensure habitat for elephants
and allow dependent organisms also to thrive
The process of collecting accurate and
reliable data which forms the base of an AI
sytem is called Data Acquisition.
Data is a piece of information, facts and
statistics collected together for reference
or analysis.
For e.g., if you want to predict next year’s
salary of any employee, what would you
need to know? Previous salaries of
employees
An Intelligent system uses previous salary data to predict
the future salary. The AI system would be trained using the
previous salary data. We call this data set as training data.
Once the training is over, we use data to evaluate the model.
The data used for evaluation is called Testing data.