10 - (Module-6) Data Generation, Data Gathering-07-03-2023
10 - (Module-6) Data Generation, Data Gathering-07-03-2023
✓ Data generation,
✓ Data gathering,
✓ Data Pre-processing,
✓ Data analyzation,
✓ Application of analytics,
✓ Vertical-specific algorithms,
✓ Exploratory Data Analysis.
Data Generation
✓ Data generation is the beginning of big data.
✓ Some current sources of big data, such as trading data, mobile data, user
behavior, sensing data, Internet data, and other sources that are usually ignored.
✓ For example, nowadays Internet data has become a major source of big data
where huge amounts of data in terms of searching entries, chatting records, and
microblog messages are produced every day.
✓ Such data are closely related to people's daily lives, and may contain users’
behavior.
✓ For individuals, the data seems valueless; however, useful information
including user habits and hobbies can be determined and collected through the
exploitation of such accumulated big data.
✓ Big data even makes it possible to predict users’ behaviors and emotional
moods.
✓ Internet data is one of the most successful data sources utilized by
many Internet companies to generate user portraits and provide
personalized recommendation services.
✓ Other main sources of big data include the operation and trading
information in enterprises, logistic and sensing information in the
Internet of Things (IoT) networks, human interaction information,
position information in the Internet world, etc.
✓ In addition, digital telescopes also generate massive data ranging
from hundreds of GB to tens of TB or even larger, which is a
rising source of big data. (Astrophotography)
Some of the data collecting/gathering sources:
✓ Collecting new data from internet and other sources
✓ Using the previously collected and stored data
✓ Reusing someone else’s data
✓ Purchasing data