[ISP610] Lesson 1 - Introduction to data analytics _Mdm Ezza2024
[ISP610] Lesson 1 - Introduction to data analytics _Mdm Ezza2024
Chapter 1:
Introduction to Data Analytics
AI is the new
electricity
“
Data is the
new Oil “
Clive Humby
1.1 Definition of data analytics
October 24 11
How Big is Big Data
“
From Megabyte (106) of data to Brontobyte (1027) and
Geopbyte (1030), these measurements will be used to
describe the tremendous amount of digital pool
formed by the IoT platform.
Cisco-IBSG predicts about more than 50 billion
“
devices connected to the internet by 2020, 75 billion
IoT Devices by 2025.
By 2025, it’s estimated that 463 exabytes of data will
be created each day globally – that’s the equivalent of
212,765,957 DVDs per day.
-Almiani,2020
1.1 Definition of data analytics
Value Variety
Volume
9V’s (Owais,2016)
10V’s (Data Science Central)
17V’s (Panimalar,2017)
ROW
1.1 Definition of data analytics
• Types of Data
– Text Data (Web)
1.1 Definition of data analytics
• Types of Data
– Semi-structured Data – Graph Data
(XML)
October 24 24
1.1 Definition of data analytics
• Types of Data
– Streaming Data
https://www.zoomdata.com/solutions/modern-bi/streaming-analytics/
Data Sources
• Data come from multiple sources, including:
– Medical Information, such as genomic sequencing and MRIs
– Increased use of broadband on the Web – including the 2 billion photos each
month that Facebook users currently upload as well as the innumerable videos
uploaded to YouTube and other multimedia sites.
– Video surveillance.
– Increased global use of mobile devices – the torrent of texting is not likely to
cease.
– Smart devices – sensor-based collection of information from smart electric
grids, smart buildings and many other public and industry infrastructure.
– Non-traditional IT devices – including the use of RFID readers, GPS navigation
systems, and seismic processing.
1.1 Definition of data analytics
Data
VALUE!
October 24 35
Public Services
Data
Life Sciences
Data
ChatGPT / Large Language Model
• TASK :
Find Value of Data in the Context of Large Language
Model
– Situation
– Use of Data
– Key Outcomes
Example of applications. More…
• Netflix
• https://www.edureka.co/blog/data-
science-applications/
Competitive Advantage
October 24 40
Competitive Advantage: Airlines
• Call centres, for instance, can be made more effective and efficient by
capitalizing on what the company can know about the caller ahead of
time. And airlines have for years been able to route premium-status fliers
to higher-level customer service representatives by recognizing their caller
IDs. Now they can do even more: By making a quick correlation between
your ID, your booked flights and the status of those flights, they may be
able to determine why you’re calling, even before the second ring. If your
next flight has just been delayed, the representative could answer the
phone with a pretty good idea of why you’re calling. More in-depth
analysis could correlate your ID with your social media presence. If you’ve
just tweeted an irate message about being booted from a flight, the rep
answering your call may have already read it.
October 24 41
What’s Driving Analytics
in Organisations?
October 24 42
Analytics
• More than just an OLTP MIS reporting.
• Rather than doing standard reporting on these areas, organizations can
apply advanced analytical techniques to optimize processes and derive
more value from these typical tasks.
• Analytics examine large amounts of data to uncover hidden patterns,
correlations and other insights.
• Analytics help organisations to make more accurate decisions when faced
with problems.
• Analytics helps organizations harness their data and use it to identify new
opportunities. That, in turn, leads to smarter business moves, more
efficient operations, higher profits and happier customers
October 24 43
October 24 44
WHO ARE THE PEOPLE INVOLVED IN
BUSINESS DATA ANALYTICS
AND
WHAT IS YOUR ROLE?
October 24 45
October 24 46
YOU
October 24 47
October 24 48
SKILL SET
YOU
October 24 49
• Quantitative skills, such as mathematics or statistics.
• Technical aptitude, such as software engineering, machine
learning and programming skills.
• Sceptical…. This may be a counterintuitive trait, although it is
important that data scientists can examine their work critically
rather than in a one-sided way.
• Curious & Creative. Must be passionate about data and finding
creative ways to solve problems and portray information.
• Communicative & Collaborative: It is not enough to have strong
quantitative skills or engineering skills. To make a project
resonate, you must be to articulate the business value in a clear
way, and work collaboratively with project sponsors and key
October stakeholders.
24 50
1.5 Data Science
What is Data Science?
• Data Understanding: In CRISP-DM, data understanding involves exploring and familiarizing oneself with the available data sources. Similarly, data
analytics and data science projects require an understanding of the data landscape, including data quality, structure, and re lationships, to inform the
analysis and modeling processes.
• Data Preparation: Data preparation in CRISP-DM involves cleaning, transforming, and integrating data to make it suitable for analysis. This aligns with
the data preprocessing phase in data analytics and data science, where raw data is refined and prepared for analysis using te chniques such as data
cleaning, normalization, and feature engineering.
• Modeling: The modeling phase in CRISP-DM involves building and evaluating predictive models to address the project objectives. This corresponds to
the modeling phase in data science, where various algorithms and techniques are applied to develop models that can uncover pa tterns, make
predictions, or generate insights from the data.
• Evaluation: CRISP-DM emphasizes the importance of evaluating model performance and validating results to ensure they meet the project
objectives. Similarly, data analytics and data science projects involve assessing the accuracy, reliability, and relevance of the analysis results to ensure
they provide actionable insights and value to the business.
• Deployment: The deployment phase in CRISP-DM focuses on implementing the data analytics/data science solution and integrating it into business
processes. In data analytics and data science, this translates to deploying analytical models, dashboards, or reports to stak eholders and decision-
makers, enabling them to use the insights generated to inform their actions and decisions.