0% found this document useful (0 votes)
8 views

SAS - Assignment-01-2001

The document outlines key sources of big data, including social media, IoT devices, and healthcare systems. It highlights challenges such as volume, velocity, security, and cost in managing this data. The goals of processing big data include generating insights, making data-driven decisions, and enhancing personalization.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

SAS - Assignment-01-2001

The document outlines key sources of big data, including social media, IoT devices, and healthcare systems. It highlights challenges such as volume, velocity, security, and cost in managing this data. The goals of processing big data include generating insights, making data-driven decisions, and enhancing personalization.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

1.

Sources that produce Huge Data : Data is being produced at an


unprecedented scale from a variety of sources. The key sources are:
Social Media Platforms: Billions of users generate text, images, videos, and
interactions Example : Facebook, Instagram, Twitter.
IoT Devices: Sensors, wearables, smart homes, and industrial IoT devices
produce continuous streams of data.
E-commerce Platforms: User transactions, browsing patterns, reviews, and
recommendation engines Example : Amazon
Healthcare Systems: MRI/CT scans, and wearable health monitors.
Financial Systems: Stock markets, banking transactions, credit card usage, and
blockchain networks.

2. Challenges in Maintaining the Data

Volume: Storing petabytes/exabytes of data requires scalable infrastructure.


Velocity: Real-time data streams Example : social media, feeds demand high-
speed processing frameworks example : Apache Kafka.
Variety: Structured (databases), semi-structured and unstructured (images,
videos) data need flexible processing.
Veracity: Ensuring data quality, accuracy, and reliability Example : handling
noise, missing values, or biases.
Security & Privacy: Ensuring sensitive information is not breached and
anonymizing personal data.
Cost: High storage, compute, and personnel expenses (data engineers,
scientists).
Integration: Integrate data from different sources, such as legacy systems, APIs,
and cloud platforms.
Scalability: Ensure performance scales with data (horizontal scaling vs. vertical
scaling).
3. Goals of Processing Big Data

Generation of insights. Discover patterns, trends, or correlations (customer


behavior, fraud detection).
Making decisions. It will help make data-driven decisions for businesses,
governments, and research (predictive analytics).
Automation. Develop machine learning models for image recognition, chatbots,
or self-driving systems.
Optimization. Enhance operations (supply chain, energy consumption) and
resource utilization.
Personalization. Delivers tailored experiences, such as recommendations on
what to watch Example : Netflix, where to go on vacation, and which
advertisements to show.

Summary
Sources: IoT, social media, healthcare, and financial systems are major
contributors.
Challenges: Volume, velocity, security, and cost are critical hurdles.
Objectives: Focus on insights, automation, optimization, and compliance.

You might also like