0% found this document useful (0 votes)
29 views

Data Processing 1

Data processing involves 6 key stages: 1) data collection from various sources, 2) data preparation to clean and organize the raw data, 3) data input into systems using a compatible language, 4) processing the data using algorithms, 5) outputting and interpreting the data into usable formats like graphs and text, and 6) storage of the processed data for future use. The overall goal is to take raw data and convert it into meaningful and accessible information through these stages.

Uploaded by

Aryan Xworld007
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

Data Processing 1

Data processing involves 6 key stages: 1) data collection from various sources, 2) data preparation to clean and organize the raw data, 3) data input into systems using a compatible language, 4) processing the data using algorithms, 5) outputting and interpreting the data into usable formats like graphs and text, and 6) storage of the processed data for future use. The overall goal is to take raw data and convert it into meaningful and accessible information through these stages.

Uploaded by

Aryan Xworld007
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

What is data processing?

Data processing occurs when data is collected and translated into usable information.
Usually performed by a data scientist or team of data scientists, it is important for data
processing to be done correctly as not to negatively affect the end product, or data
output.

Data processing starts with data in its raw form and converts it into a more readable
format (graphs, documents, etc.), giving it the form and context necessary to be
interpreted by computers and utilized by employees throughout an organization.

Six stages of data processing


1. Data collection

Collecting data is the first step in data processing. Data is pulled from available sources. It
is important that the data sources available are trustworthy and well-built so the data
collected (and later used as information) is of the highest possible quality.

2. Data preparation

Once the data is collected, it then enters the data preparation stage. Data preparation,
often referred to as “pre-processing” is the stage at which raw data is cleaned up and
organized for the following stage of data processing. During preparation, raw data is
diligently checked for any errors. The purpose of this step is to eliminate bad data
(redundant, incomplete, or incorrect data) and begin to create high-quality data for the
best business intelligence.

3. Data input

The clean data is then entered into its destination (perhaps a CRM like Salesforce or a data
warehouse like Redshift), and translated into a language that it can understand. Data input
is the first stage in which raw data begins to take the form of usable information.

4. Processing

During this stage, the data inputted to the computer in the previous stage is actually
processed for interpretation. Processing is done using machine learning algorithms,
though the process itself may vary slightly depending on the source of data being
processed (data lakes, social networks, connected devices etc.) and its intended use
(examining advertising patterns, medical diagnosis from connected devices, determining
customer needs, etc.).

5. Data output/interpretation

The output/interpretation stage is the stage at which data is finally usable to non-data
scientists. It is translated, readable, and often in the form of graphs, videos, images, plain
text, etc.). Members of the company or institution can now begin to self-serve the data for
their own data analytics projects.

6. Data storage

The final stage of data processing is storage. After all of the data is processed, it is then
stored for future use. While some information may be put to use immediately, much of it
will serve a purpose later on. Plus, properly stored data is a necessity for compliance with
data protection legislation like GDPR. When data is properly stored, it can be quickly and
easily accessed by members of the organization when needed.

You might also like