0% found this document useful (0 votes)
11 views

Unit 2 Data Literacy

The document discusses various types of data used in AI, including textual and numeric data, and their applications in Natural Language Processing and statistical analysis. It outlines the process of data acquisition, emphasizing the importance of gathering suitable datasets for training AI models, and introduces concepts like data augmentation and sources of primary and secondary data. Additionally, it highlights the significance of cleaning and organizing data to make it usable for AI purposes.

Uploaded by

yuvankrishnamp58
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Unit 2 Data Literacy

The document discusses various types of data used in AI, including textual and numeric data, and their applications in Natural Language Processing and statistical analysis. It outlines the process of data acquisition, emphasizing the importance of gathering suitable datasets for training AI models, and introduces concepts like data augmentation and sources of primary and secondary data. Additionally, it highlights the significance of cleaning and organizing data to make it usable for AI purposes.

Uploaded by

yuvankrishnamp58
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Unit 2 Data Literacy

Sess Acquiring Data, Processing, and Interpreting Data


Types of Data in AI
1. Textual Data (Qualitative Data)
o Which dataset seems most relevant for training our AI weather predictor?
o Are there any limitations or restrictions mentioned for using the data?
o Composed of words and phrases.
o Used in Natural Language
Processing (NLP) tasks like
sentiment analysis or text
summarization.
o Example: Search queries ("Which
is a good park nearby?").
2. Numeric Data (Quantitative Data)
o Consists of numbers.
o Used for statistical analysis and
modelling.
o Examples: Cricket score,
restaurant bill amount.
Further Classification of Numeric Data
· Computer Vision (CV): Visual Data: Images, Videos.
· Natural Language Processing: Textual Data: Documents, PDF files.
Types of Data used in three domains of AI
· Statistical Data (SD): Numeric Data: Tables, Excel Sheets
· Continuous Data: Numeric data with a continuous range of values. Examples: height,
weight, temperature, voltage.
· Discrete Data: Numeric data consisting only of whole numbers and cannot be fractional.
Example: number of students in a class.
· Visual Dat e.g Images, Videos
· Statistical Data e.g. Tables, Excel Sheets
· Textual Data e.g. Documents, PDF files

Data Acquisition / Acquiring Data


Data Acquisition, also known as acquiring data, refers to the procedure of gathering data. This
involves searching for datasets suitable for training AI models. The process typically comprises
three
key steps:
Data is the fuel that powers Artificial Intelligence (AI). But what exactly is data, and how does it
get used to create intelligent machines? This interactive guide will take you on a journey to
explore:

Where Data Comes From: Dive into different types of data, how we find it, and even how
to create new data!
· Speaking Data's Language: Learn about features, the unique characteristics that define
each piece of data.
· Seeing the Bigger Picture: Explore how data is interpreted and visualized to reveal
hidden patterns and insights.
· Making Data Usable: Discover the secrets behind cleaning and organizing data to make it
ready for AI.
o Images: Pictures of roads, traffic signs, pedestrians, and other objects.
Imagine you're building a self-driving car. To teach it to navigate the road, you'll need data! What
kind of data would be useful?
o Sensor Data: Information from LiDAR, radar, and cameras to measure distances and
surroundings.

Sample Data Augmentation


• Data augmentation means increasing the amount of data by adding copies of existing data
with small changes.
• The image given here does not change, but we get data on the image by changing different
parameters like color and brightness.
• New data is added by slightly changing the existing data.

Acquiring Data – Sample Data Generation


• Data generation refers to generating or recording data using sensors.
• Recording temperature readings of a building is an example of data generation.
• Recorded data is stored in a computer in a suitable form

Sources of Data: Various Sources for


Acquiring Data
· PrimaryData Sources — Some of the sources for primary data include surveys, interviews,
experiments, etc. The data generated from the experiment is an example of primary data.

Secondary Data Sources - Secondary data collection obtains information from external sources, rather
than generating it personally. Some sources for secondary data collection include:

Checklist of factors that make data good or bad

You might also like