0% found this document useful (0 votes)
9 views

Data Categories-3 _ Lesson 1 _ Data Analytics TEVTA 2024 _ certiportlearning

Big data refers to large, complex datasets that traditional data management systems struggle to handle, characterized by the 6Vs: volume, variety, velocity, veracity, variability, and value. Various tools and frameworks, such as Apache Hadoop, NoSQL databases, and Apache Spark, have been developed to store and process big data effectively. Analyzing big data can provide valuable insights for businesses to improve decision-making and address challenges.

Uploaded by

sccit.jng
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Data Categories-3 _ Lesson 1 _ Data Analytics TEVTA 2024 _ certiportlearning

Big data refers to large, complex datasets that traditional data management systems struggle to handle, characterized by the 6Vs: volume, variety, velocity, veracity, variability, and value. Various tools and frameworks, such as Apache Hadoop, NoSQL databases, and Apache Spark, have been developed to store and process big data effectively. Analyzing big data can provide valuable insights for businesses to improve decision-making and address challenges.

Uploaded by

sccit.jng
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Previous Next

Big Data
Bookmark this page

Big data refers to a collection of data having a huge volume and many varieties, and it grows exponentially with time. It is
so large in size and complexity that it becomes difficult for traditional data management systems like RDBMSs to store and
process efficiently. But these huge volumes of data can be analyzed to get insights that help companies address many
business problems and make better business decisions.

Big data has three basic characteristics - volume, variety, and velocity, usually referred to as 3Vs because they all begin with
the letter V. But there have been additional Vs added over time to define the characteristics of big data and more are being
discovered.

The following are the 6Vs commonly used to describe big data:

1. Volume refers to the huge volume of data that is difficult to store and manage using traditional data systems. Terabytes
of data are generated every day from social media applications and sensor-enabled devices.
2. Variety refers to the different varieties of data. The data can be structured, unstructured, or semi-structured data. It is
easy to store and process structured data in traditional data management systems like RDBMSs. But it is very difficult to
store and process the unstructured or semi-structured data in them.
3. Velocity refers to the speed at which big data is generated. Companies want to manage and analyze new data quickly to
get the required information.
4. Veracity refers to the degree to which big data can be trusted. It simply means the data should be good in quality. The
data should be clean and accurate and should not have missing values.
5. Variability refers to data that is extremely variable and keeps on changing always.
6. Value refers to the business value of big data. There is no advantage of storing huge volumes and different varieties of
data if some business value cannot be derived from it.

With the rise of big data, many tools and frameworks have been created to store and process big data.

The following are examples of tools and frameworks for storing and processing big data:

Apache Hadoop: Apache Hadoop is an open-source framework used to store and analyze big data. It has two main
components: HDFS (Hadoop Distributed File System) for storage and MapReduce for processing.
NoSQL databases: NoSQL stands for “Not only SQL”. NoSQL databases are non-tabular databases that have flexible
schemas and provide a different mechanism for the storage and retrieval of data than RDBMS tables.
Apache Spark: Apache Spark is an open-source unified analytics engine for processing large-scale data.

Watch this video by Dursun Delen to learn about big data and its sources.

What Is Big Data and Where Does It Come From?


Start of transcript. Skip to the end.

You might also like