0% found this document useful (0 votes)
3 views

1 Introduction to DA Course

The document outlines a course on Data Analytics offered by the Department of Data Science and Engineering, focusing on fundamental skills, data analysis, and visualization techniques. It discusses the importance of data analytics in various sectors, including business, finance, healthcare, and IoT, highlighting its role in optimizing performance and decision-making. The document also emphasizes the growing demand for data analysts and data scientists in the job market.

Uploaded by

Anant Barjatya
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

1 Introduction to DA Course

The document outlines a course on Data Analytics offered by the Department of Data Science and Engineering, focusing on fundamental skills, data analysis, and visualization techniques. It discusses the importance of data analytics in various sectors, including business, finance, healthcare, and IoT, highlighting its role in optimizing performance and decision-making. The document also emphasizes the growing demand for data analysts and data scientists in the job market.

Uploaded by

Anant Barjatya
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

DS2101

INTRODUCTION
TO DATA
ANALYTICS
[3 0 0 3]

DR. GINIKA MAHAJAN


• This course is offered by Department
of Data Science and Engineering for
third semester students. The core
objective of this course is to use the
data in extensive level for learning
and analysis. The applicative aim of
Introduction to this subject is to understand the
Course basic underlying concepts of data
curation, transformation, pre-
processing concept, and algorithms.
1. To develop the fundamental skills of data
analytics by covering the basic data
analytics life cycle.
2. To learn how to analyze massive datasets
for use in applications.
3. To prepare the data in a meaningful and
concise manner for better understanding.
Course Outcomes 4. To accomplish basic algorithms
methods for data analytics.
and

5. To use a variety of methods to visualize


and convey different formats of data.
Criteria Description Maximum
Marks
Sessional Exam 30

Assessment Internal
Assessment
(Closed Book)

Plan: (Summative) Research Project


(Accumulated
and Averaged)
30

End Term End Term Exam 40


Exam (Closed Book)
(Summative)
Total 100
DS2101: INTRODUCTION TO DATA ANALYTICS [3 0 0 3]

Steps in Data Analytics Projects, Data Analytics tasks, and


methods, Data Gathering and Preparation: Data Formats,
Parsing and Transformation, Scalability and Real-time Issues;
Data Cleaning: Consistency Checking, Heterogeneous and
Missing Data, Data Transformation and Segmentation;
Exploratory Analysis: Descriptive and comparative statistics,
Hypothesis testing, Statistical Inference. Association rule
mining, Clustering. Visualization: Visual Representation of
Data, Gestalt Principles, Information Overloads; Creating Visual
Syllabus: Representations: Visualization Reference Model, Visual
Mapping, Visual Analytics, Design of Visualization Applications;
Classification of Visualization Systems: Interaction and
Visualization Techniques, Visualization of One, Two and Multi-
Dimensional Data, Text and Text Documents; Visualization of
Groups: Trees, Graphs, Clusters, Networks, Software,
Metaphorical Visualization; Visualization of Volumetric Data:
Vector Fields, Processes and Simulations, Visualization of
Maps, Geographic Information, GIS systems, Collaborative
Visualizations, Evaluating Visualizations; Recent Trends in
Various Perception Techniques: Various Visualization
Techniques, Data Structures used in Data Visualization.
1. Glenn J. Myatt, Wayne P. Johnson, Making Sense of
Data I: A Practical Guide to Exploratory Data
Analysis and Data Mining, 2nd Edition, John Wiley &
Sons Publication, 2014.
2. Glenn J. Myatt, Wayne P. Johnson, Making Sense of
Data II: A Practical Guide to Data Visualization,
Advanced Data Mining Methods, and Applications,
John Wiley & Sons Publication, 2009.
3. E. Tufte. The Visual Display of Quantitative
Information, (2e), Graphics Press, 2007.
4. Jules J., Berman D., Principles of Big Data:
Preparing, Sharing, and Analyzing Complex
Information, (2e), 2013.

References
• The upsurge of Big Data has brought
Data & Data & along two other buzzwords in the
Data… industry, Data Science and Data
Analytics. Today, the whole world
contributes to massive data growth
in colossal volumes, hence the
name, Big Data.
• The World Economic Forum states
that by the end of 2020, the daily
global data generation will reach 44
zettabytes. By 2025, this number will
reach 463 exabytes of data!
• Big Data includes everything
• texts, emails, tweets, user
searches (on search engines),
social media chatter, data
generated from IoT and
connected devices – basically,
everything we do online.
• The data generated every day
via the digital world is so vast
and complex that traditional
data processing and analysis
systems cannot handle it.
• Hence Data Science and Data
Analytics.
• Data Science is a field that deals with
extracting meaningful information and
insights by applying various algorithms
preprocessing and scientific methods on
structured and unstructured data.

What is Data • This field is related to Artificial


Intelligence and is currently one of the

Science most demanded skills.


• Data science comprises mathematics,
computations, statistics, programming,
etc to gain meaningful insights from the
large amount of data provided in
various formats.
• While both fields involve working
with data to gain insights, data
science often involves using data
to build models that can predict
future outcomes, while data
analytics tends to focus more on
analyzing past data to inform
decisions in the present.
What is Data Analytics?
• Data analytics is the collection,
transformation, and organization
of data in order to draw
conclusions, make predictions,
and drive informed decision
making.
• Data Analytics is used to get
conclusions by processing the raw
data.
• It is helpful in various businesses as it
helps the company to make decisions
based on the conclusions from the
data.
• Basically, data analytics helps to
convert a Large number of figures in
the form of data into Plain English i.e.,
conclusions which are further helpful
in making in-depth decisions.
What data analytics actually do?

• A data analyst reviews data to


identify key insights into a
business's customers and ways
the data can be used to solve
problems. They also
communicate this information to
company leadership and other
stakeholders.
What is data analytics
examples?

• This type of analysis helps describe


or summarize quantitative data by
presenting statistics. For
example, descriptive statistical
analysis could show the distribution
of sales across a group of
employees and the average sales
figure per employee.
Understanding Data Analytics

• Data analytics is a broad term that encompasses


many diverse types of data analysis.
• Any type of information can be subjected to
data analytics techniques to get insight that can
be used to improve things.
• Data analytics techniques can reveal trends and
metrics that would otherwise be lost in the mass
of information.
• This information can then be used to optimize
processes to increase the overall efficiency of a
business or system.
Job Demand
• Data analysts and data
scientists represent two of the
most in-demand, high-paying
jobs, alongside AI and machine
learning specialists and digital
transformation specialists,
according to the World
Economic Forum Future of Jobs
Report 2023.
What is the Role of Data
Analytics?

• Data analysts exist at the intersection of


information technology, statistics and
business. They combine these fields in
order to help businesses and organizations
succeed.
• The primary goal of a data analyst is to
increase efficiency and improve
performance by discovering patterns in
data.
Techniques
to Data
Science
• For example, manufacturing companies
often record the runtime, downtime, and
work queue for various machines and then
analyze the data to better plan the
workloads so the machines operate closer
to peak capacity.
• Data analytics can do much more than point
Applications of Data out bottlenecks in production. Gaming
companies use data analytics to set reward
Analytics schedules for players that keep the majority
of players active in the game. Content
companies use many of the same data
analytics to keep you clicking, watching, or
re-organizing content to get another view or
another click.
Data Analytics in
Business
• Data analytics is important because it helps
businesses optimize their performances.
• Implementing it into the business model
means companies can help reduce costs by
identifying more efficient ways of doing
business and by storing large amounts of
data.
• A company can also use data analytics to
make better business decisions and help
analyze customer trends and satisfaction,
which can lead to new—and better—
products and services.
• The use of data analytics goes beyond
maximizing profits and ROI, however.
• Data analytics can provide critical
information for healthcare (health
informatics), crime prevention, and
environmental protection.
• These applications of data analytics use
these techniques to improve our world.
• Though statistics and data analysis have
always been used in scientific research,
advanced analytic techniques and big data
allow for many new insights.
• These techniques can find trends in
complex systems.
• Researchers are currently using machine
learning to protect wildlife.
Data Analytics in
Financial Sector
• One of the earliest adopters is
the financial sector.
• Data analytics has an important role in the
banking and finance industries, used to
predict market trends and assess risk.
• Credit scores are an example of data
analytics that affects everyone. These
scores use many data points to determine
lending risk.
• Data analytics is also used to detect and
prevent fraud to improve efficiency and
reduce risk for financial institutions.
Data Analytics in Healthcare
• The use of data analytics in healthcare is already widespread.
• Predicting patient outcomes, efficiently allocating funding and improving
diagnostic techniques are just a few examples of how data analytics is
revolutionizing healthcare.
• The pharmaceutical industry is also being revolutionized by machine
learning.
• Drug discovery is a complex task with many variables. Machine learning
can greatly improve drug discovery.
• Pharmaceutical companies also use data analytics to understand the
market for drugs and predict their sales.
Data Analytics in IoT
• The internet of things (IoT) is a field that is used
alongside machine learning. These devices provide
a great opportunity for data analytics.
• IoT devices often contain many sensors that
collect meaningful data points for their operation.
• Devices like the Nest thermostat track movement
and temperature to regulate heating and cooling.
• Smart devices like this can use data to learn from
and predict your behavior. This will provide
advanced home automation that can adapt to the
way you live.
What are the
analytical tools used
in data analytics?

• There are various tools used in


data analysis.
• Some data analysts use business
intelligence software, such
as Tableau.
• Others may use programming
languages such as SQL or Python,
which have various statistical and
visualization libraries.
KEY TAKEAWAYS

• Data analytics is the science of analyzing raw data to make conclusions about that information.
• Data analytics help a business optimize its performance, perform more efficiently, maximize profit, or
make more strategically-guided decisions.
• The techniques and processes of data analytics have been automated into mechanical processes and
algorithms that work over raw data for human consumption.
• Various approaches to data analytics include looking at what happened (descriptive analytics), why
something happened (diagnostic analytics), what is going to happen (predictive analytics), or what
should be done next (prescriptive analytics).
• Data analytics relies on a variety of software tools ranging from spreadsheets, data visualization, and
reporting tools, data mining programs, or open-source languages for the greatest data manipulation.
Research in
Data
Science/Data
Analytics

You might also like