data_analytics
data_analytics
Unit 1:
Syllabus:
1
ITECH WORLD AKTU
Key Points:
• Outcome: Generates insights for strategic decisions in various domains like busi-
ness, healthcare, and technology.
• Tools: Includes Python, R, Excel, and specialized tools like Tableau, Power BI.
Example: A retail store uses data analytics to identify customer buying patterns
and optimize inventory management, ensuring popular products are always in stock.
1. Social Data:
2. Machine-Generated Data:
• Sensors and IoT Devices: Data from devices like thermostats, smart-
watches, and industrial sensors.
• Log Data: Records of system activities, such as server logs and application
usage.
• GPS Data: Location information generated by devices like smartphones and
vehicles.
• Telemetry Data: Remote data transmitted from devices, such as satellites
and drones.
3. Transactional Data:
2
ITECH WORLD AKTU
Example:
• A social media platform like Twitter generates vast amounts of social data from
tweets, hashtags, and mentions.
• Machine-generated data from GPS in delivery trucks helps optimize routes and
reduce costs.
• A retail store’s transactional data tracks customer purchases and identifies high-
demand products.
• Structured Data: Data that is organized in a tabular format with rows and
columns. It follows a fixed schema, making it easy to query and analyze.
• Semi-Structured Data: Data that does not have a rigid structure but contains
tags or markers to separate elements. It lies between structured and unstructured
data.
Comparison Table:
• Volume: Refers to the sheer amount of data generated. Modern data systems
must handle terabytes or even petabytes of data.
• Velocity: Refers to the speed at which data is generated and processed. Real-time
data processing is crucial for timely insights.
3
ITECH WORLD AKTU
– Example: Stock market systems process millions of trades per second to pro-
vide real-time updates.
• Variety: Refers to the different types and formats of data, including structured,
semi-structured, and unstructured data.
• Veracity: Refers to the quality and reliability of the data. High veracity ensures
data accuracy, consistency, and trustworthiness.
– Example: Data from unreliable sources or with missing values can lead to
incorrect insights.
Real-Life Scenario: Social media platforms like Twitter deal with high Volume
(millions of tweets daily), high Velocity (real-time updates), high Variety (text, images,
videos), and mixed Veracity (authentic and fake information).
4
ITECH WORLD AKTU
ciently manage. These platforms enable businesses and organizations to derive meaningful
insights from large-scale and diverse data.
Key Features of Big Data Platforms:
• Hadoop:
• Spark:
• NoSQL Databases:
5
ITECH WORLD AKTU
• Data Preparation: Collecting, cleaning, and transforming data into usable for-
mats.
[width=0.8]1681499276138.png
• Optimizes Resource Usage: The lifecycle ensures efficient use of resources, such
as time, tools, and personnel. By organizing tasks in a structured way, projects are
completed more efficiently, avoiding wasted effort and resources.
6
ITECH WORLD AKTU
• Data Scientist:
– A data scientist is responsible for analyzing and interpreting complex data to
extract meaningful insights.
– They design and build models to forecast trends, make predictions, and identify
patterns within data.
– Data scientists use machine learning algorithms, statistical models, and ad-
vanced analytics techniques to solve business problems.
– Example: A data scientist develops a predictive model to forecast customer
churn based on historical data and trends.
• Data Engineer:
– A data engineer is responsible for designing, constructing, and maintaining the
systems and infrastructure that collect, store, and process data.
– They ensure that data pipelines are efficient, scalable, and capable of handling
large volumes of data.
– Data engineers work closely with data scientists to ensure the availability of
clean and well-structured data for analysis.
– Example: A data engineer designs and implements a data pipeline that ex-
tracts real-time transactional data from an e-commerce platform and stores it
in a data warehouse.
• Business Analyst:
– A business analyst bridges the gap between the technical team (data scientists
and engineers) and business stakeholders.
7
ITECH WORLD AKTU
– They are responsible for understanding the business problem and translating
it into actionable data-driven solutions.
– Business analysts also interpret the results of data analysis and communicate
them in a way that is understandable for non-technical stakeholders.
– Example: A business analyst analyzes customer feedback data and interprets
the results to help the marketing team refine their targeting strategy.
• Project Manager:
Example: A retail company builds a model to predict customer churn and integrates it
into their CRM system.
[width=1.0]1681499276138.png