0% found this document useful (0 votes)

37 views

Lecture 3

Uploaded by

alaa emad

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

Lecture 3

Uploaded by

alaa emad

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

University of Sadat City

Faculty of Computers and Artificial Intelligence (FCAI)

IS Department

Business Intelligence(IS401)
Lecture 3

Prepared By:
Dr. Heba Askr
First Term 2023-2024
Business Intelligence, Analytics, Data
Science, and AI
Fifth Edition

Chapter 3
Descriptive Analytics I: Ro m a n n ume ral one col on

Nature of Data, Big Data and

Statistical Modeling

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Learning Objectives (1 of 2)
3.1 Understand the nature of data as it relates to business
intelligence (B I) and analytics
3.2 Learn the methods used to make real-world data
analytics ready
3.3 Learn what Big Data is and how it is changing the world
of analytics
3.4 Understand the motivation for and business drivers of Big
Data analytics
3.5 Become familiar with the wide range of enabling
technologies for Big Data analytics

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Learning Objectives (2 of 2)
3.6 Learn about Hadoop, Spark, MapReduce, and NoSQL
as they relate to Big Data analytics
3.7 Become familiar with the Data for Good concept
3.8 Understand the need for and appreciate the capabilities
of stream analytics
3.9 Learn about the applications of stream analytics
3.10 Describe statistical modeling and its relationship to
business analytics
3.11 Learn about descriptive and inferential statistics

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
The Nature of Data (1 of 2)
• Data: a collection of facts
– usually obtained as the result of experiences,
observations, or experiments
• Data may consist of numbers, words, images, …
• Data is the lowest level of abstraction (from which
information and knowledge are derived)
• Data is the source for information and knowledge
• Data quality and data integrity → critical to analytics

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
The Nature of Data (2 of 2)
Figure 3.1 A Data to Knowledge Continuum

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Metrics for Analytics Ready Data
• Data source reliability
• Data content accuracy
• Data accessibility
• Data security and data privacy
• Data richness
• Data consistency
• Data currency/data timeliness
• Data granularity‫دقة البيانات‬
• Data validity and data relevancy

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
A Simple Taxonomy of Data (1 of 2)
• Data (datum—singular form of data) = facts
• Structured data
– Targeted for computers to process
– Numeric versus nominal
• Unstructured/textual data
– Targeted for humans to process/digest
• Semi-structured data?
– XML, HT ML, Log files, etc.
• Data taxonomy …

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
A Simple Taxonomy of Data (2 of 2)
Figure 3.2 A Simple Taxonomy of Data

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
The Art and Science of Data
Preprocessing (1 of 2)
• The real-world data is dirty, misaligned, overly complex,
and inaccurate
– Not ready for analytics!
• Readying the data for analytics is needed
– Data preprocessing
▪ Data consolidation‫توحيد البيانات‬
▪ Data cleaning
▪ Data transformation
▪ Data reduction
• Art – it develops and improves with experience

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
The Art and Science of Data
Preprocessing (2 of 2)
Figure 3. 3 Data Preprocessing Steps
• The process is usually iterative with
many feedbacks and redo
• Data reduction
1. Variables
▪ Dimensional reduction
▪ Variable selection
2. Cases/samples
▪ Sampling
▪ Balancing / stratification

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Data Preprocessing Tasks and
Methods (1 of 3)
Table 3.1 A Summary of Data Preprocessing Tasks and
Potential Methods
Main Task Subtasks Popular Methods
Data Access and collect the data S Q L queries, software agents, Web services.
consolidation Select and filter the data Domain expertise, S Q L queries, statistical
Integrate and unify the data tests.
S Q L queries, domain expertise, ontology-
driven data mapping.
Data cleaning Handle missing values in Fill in missing values (imputations) with most
the data appropriate values (mean, median, min/max,
mode, etc.); recode the missing values with a
constant such as “M L”; remove the record of
the missing value; do nothing.
Data cleaning Identify and reduce noise in Identify the outliers in data with simple
the data statistical techniques (such as averages and
standard deviations) or with cluster analysis;
once identified, either remove the outliers or
smooth them by using binning, regression, or
simple averages.

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Data Preprocessing Tasks and
Methods (2 of 3)
Table 3.1 A Summary of Data Preprocessing Tasks and
Potential Methods
Main Task Subtasks Popular Methods
Data cleaning Find and eliminate Identify the erroneous values in data (other than outliers),
erroneous data such as odd values, inconsistent class labels, odd
distributions; once identified, use domain expertise to
correct the values or remove the records holding the
erroneous values.
Data Normalize the data Reduce the range of values in each numerically valued
transformation variable to a standard range (e.g., 0 to 1 or -1 to +1) by
using a variety of normalization or scaling techniques.
Data Discretize or aggregate If needed, convert the numeric variables into discrete
transformation the representations using range-or
data frequency-based binning techniques; for categorical
variables, reduce the number of values by applying proper
concept hierarchies.

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Data Preprocessing Tasks and
Methods (3 of 3)
Table 3.1 A Summary of Data Preprocessing Tasks and
Potential Methods
Main Task Subtasks Popular Methods
Data Construct new Derive new and more informative variables from
transformati attributes the existing ones using a wide range of
on mathematical functions (as simple as addition and
multiplication or as complex as a hybrid
combination of log transformations).
Data Reduce number of Principal component analysis, independent
reduction attributes component analysis, chi-square testing, correlation
analysis, and decision tree induction.
Data Reduce number of Random sampling, stratified sampling, expert-
reduction records knowledge-driven purposeful sampling.
Data Balance skewed Oversample the less represented or undersample
reduction data the more represented classes.

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data - Definition and Concepts (1 of 2)
• Big Data means different things to people with different
backgrounds and interests
• Traditionally, “Big Data” = massive volumes of data
– Example, volume of data at CE RN, NASA, Google, …
• Where does the Big Data come from?
– Everywhere! Web logs, RF ID, GPS systems, sensor
networks, social networks, Internet-based text
documents, Internet search indexes, detail call
records, astronomy, atmospheric science, biology,
genomics, nuclear physics, biochemical experiments,
medical records, scientific research, military
surveillance, multimedia archives, …

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data - Definition and Concepts (2 of 2)
• Big Data is a misnomer!
• Big Data is more than just “big”
• The Vs that define Big Data
– Volume
– Variety
– Velocity
– Veracity‫الموثوقية‬
– Variability
– Value
– …

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Fundamentals of Big Data Analytics
• Big Data by itself, regardless of the size, type, or speed, is
worthless
• Big Data + “big” analytics = value
• With the value proposition, Big Data also brought about
big challenges
– Effectively and efficiently capturing, storing, and
analyzing Big Data
– New breed of technologies needed (developed or
purchased or hired or outsourced …)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Critical Success Factors for Big Data
Analytics (1 of 2)
• A clear business need (alignment with the vision and the
strategy)
• Strong, committed sponsorship (executive champion)
• Alignment between the business and IT strategy
• A fact-based decision-making culture
• A strong data infrastructure
• The right analytics tools
• Right people with right skills

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Critical Success Factors for Big Data
Analytics (2 of 2)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Enablers of Big Data Analytics
• In-memory analytics
– Storing and processing the complete data set in RAM
• In-database analytics
– Placing analytic procedures close to where data is stored
• Grid computing & MPP
– Use of many machines and processors in parallel (MPP -
massively parallel processing)
• Appliances (devices)
– Combining hardware, software, and storage in a single unit
for performance and scalability

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Challenges of Big Data Analytics
• Data volume
– The ability to capture, store, and process the huge
volume of data in a timely manner
• Data integration
– The ability to combine data quickly and at reasonable
cost
• Processing capabilities
– The ability to process the data quickly, as it is captured
(i.e., stream analytics)
• Data governance (… security, privacy, access)
• Skill availability (… data scientist)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Business Problems Addressed by Big
Data Analytics (stop here)
• Process efficiency and cost reduction
• Brand management
• Revenue maximization, cross-selling/up-selling
• Enhanced customer experience
• Churn identification, customer recruiting
• Improved customer service
• Identifying new products and market opportunities
• Risk management
• …

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data Technologies
• MapReduce …
• Hadoop …
• Hive
• Pig
• Hbase
• Flume
• Oozie
• Ambari
• …

• Hadoop is an open-source framework for storing and

analyzing massive amounts of distributed, unstructured
data
– Originally created by Doug Cutting at Yahoo!
• Hadoop clusters run on inexpensive commodity hardware
so projects can scale-out inexpensively
– Hadoop is now part of Apache Software Foundation
– Open source - hundreds of contributors continuously
improve the core technology

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data Technologies--Hadoop (2 of 3)
• How Does Hadoop Work?
– Access unstructured and semi-structured data (example,
log files, social media feeds, other data sources)
– Break the data up into “parts,” which are then loaded into a
file system made up of multiple nodes running on
commodity hardware using HDFS
– Each “part” is replicated multiple times and loaded into the
file system for replication and failsafe processing
– A node acts as the Facilitator and another as Job Tracker
– Jobs are distributed to the clients, and once completed the
results are collected and aggregated using MapReduce

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data Technologies--Hadoop (3 of 3)
• Hadoop Technical Components
– Hadoop Distributed File System (HDFS)
– Name Node (primary facilitator)
– Secondary Node (backup to Name Node)
– Job Tracker
– Slave Nodes (the grunts of any Hadoop cluster)
– Additionally, Hadoop ecosystem is made up of a
number of complementary sub-projects: No SQL
(Cassandra, Hbase), DW (Hive), …
▪ No SQL = not only S QL

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data Technologies--MapReduce (1 of 2)
• MapReduce distributes the processing of very large multi-
structured data files across a large cluster of ordinary
machines/processors
• Goal - achieving high performance with “simple”
computers
• Developed and popularized by Google
• Good at processing and analyzing large volumes of multi-
structured data in a timely manner
• Example tasks: indexing the Web for search, graph
analysis, text analysis, machine learning, …

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Big Data Technologies--MapReduce (2 of 2)
• How does MapReduce work?

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Technology Insights 3.2
A Few Demystifying Facts about Hadoop
1. Hadoop consists of multiple products
2. Hadoop is open source but available from vendors, too
3. Hadoop is an ecosystem, not a single product
4. HDFS is a file system, not a DBMS
5. Hive resembles S QL but is not standard SQL
6. Hadoop and MapReduce are related but not the same
7. MapReduce provides control for analytics, not analytics
8. Hadoop is about data diversity, not just data volume
9. Hadoop complements a DW; it’s rarely a replacement
10. Hadoop enables many types of analytics, not just Web
analytics
Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Spark Versus Hadoop (1 of 2)
• Both of these open-source frameworks are developed by the
Apache Software Foundation (in 2004 and 2009)
• Hadoop is for large volumes and varied type of data
• Spark is for in-memory processing for speed/efficiency
1. Order of magnitude faster processing of big data
2. A unified engine that supports highly efficient SQL queries,
streaming data, machine learning and graph processing,
and
3. A revamped APIs designed for ease of use, especially for
processing of unstructured and semi-structured data.
• Comparison dimensions: Performance, Cost, Parallel
processing, Scalability, Security, and Analytics
• NoSQL – Not only SQL!

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Spark Versus Hadoop (2 of 2)
• Use Hadoop when …
– Processing big data sets in environments where data size
exceeds available memory
– Batch processing with tasks that exploit disk read and write
operations
– Building data analysis infrastructure with a limited budget
– Completing jobs that are not time-sensitive
– Historical and archive data analysis
• Use Spark when …
– Dealing with parallel operations of iterative algorithms
– Achieving quick results with in-memory computations
– Analyzing stream data analysis in real time
– Graph-parallel processing to model data
– All ML applications
Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Stream Analytics Applications
• e-Commerce
• Telecommunication
• Law Enforcement and Cyber Security
• Power Industry
• Financial Services
• Health Services
• Government

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Statistical Modeling for Business
Analytics (1 of 2)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Statistical Modeling for Business
Analytics (2 of 2)
• Statistics
– A collection of mathematical techniques to
characterize and interpret data
• Descriptive Statistics
– Describing the data (as it is)
• Inferential statistics
– Drawing inferences about the population based on
sample data
• Descriptive statistics for descriptive analytics

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Descriptive Statistics Measures of
Centrality Tendency
• Arithmetic mean


n
x1 + x2 +    + xn x
x = x = i =1 i

n n
• Median
– The number in the middle
• Mode
– The most frequent observation

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Descriptive Statistics Measures of
Dispersion (1 of 2)
• Dispersion
– Degree of variation in a given
variable
• Range
– Max - Min
• Variance Standard Deviation

 
n n
( xi − x )2
( x − x )2

s = =1
s = =1
2 i i i

n −1 n −1
• Mean Absolute Deviation (MAD)
– Average absolute deviation from the mean
Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Descriptive Statistics Measures of
Dispersion (2 of 2)
Figure 3.12 Understanding
the Specifics about Box-and-
Whiskers Plots
• a.k.a. box-and-whiskers
plot
• Versatile / informative
• Quartiles
• Median, mean, outliers

i =1 i
n
( x − x )3

Skewness = S =
(n − 1)s 3
• Kurtosis
– Peak/tall/skinny nature of the distribution
i =1 i
n
( x − x ) 4

Kurtosis = K = 4
− 3
ns

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Descriptive Statistics - Shape of a
Distribution (2 of 2)
Figure 3.13 Relationship
between Dispersion and
Shape Properties
• Skewness – positive
versus negative
• Kurtosis – tall versus
short

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Regression Modeling for Inferential
Statistics
• Regression
– A part of inferential statistics
– The most widely known and used analytics technique
in statistics
– Used to characterize relationship between explanatory
(input) and response (output) variable
• It can be used for
– Hypothesis testing (explanation)
– Forecasting (prediction)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Regression Modeling (1 of 3)
• Correlation versus Regression
– What is the difference (or relationship)?
• Simple Regression versus Multiple Regression
– Base on number of input variables
• How do we develop linear regression models?
– Scatter plots (visualization—for simple regression)
– Ordinary least squares method
▪ A line that minimizes squared of the errors

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Regression Modeling (3 of 3)
• x: input, y: output
• Simple Linear Regression
y =  0 + 1 x
• Multiple Linear Regression
y =  0 + 1 x1 +  2 x2 + 3 x3 +    +  n xn
• The meaning of Beta (  ) coefficients
– Sign (+ or -) and magnitude

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Process of Developing a Regression
Model
• How do we know if the
model is good enough?
– R 2 (R-Square)
– p Values
– Error measures (for
prediction problems)
▪ MSE, MAD, RM SE

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Logistic Regression Modeling (1 of 2)
• A very popular statistics-based classification algorithm
• Employs supervised learning
• Developed in 1940s
• The difference between Linear Regression and Logistic
Regression
– In Logistic Regression Output/Target variable is a
binomial (binary classification) variable (as opposed to
numeric variable)

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Analytics In Action 3.3 (2 of 6)
• The analytics process to develop prediction models (both
regression and classification type) for NCAA Bowl Game
outcomes

Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Analytics In Action 3.3 (3 of 6)
• Prediction Results
1. Classification (directly predicting “Win” versus “Loss”)
▪ Simple binary classification
2. Regression (predicting the score difference and then
converting the results into “Win” versus “Loss”
▪ Regression based classification
– Which one would be more accurate?

*The output variable is a binary categorical variable (Win or

Loss); differences were sig (** p < 0.01).

*The output variable is a numerical/integer variable

(point-diff); differences were sig (** p < 0.01).
Sources: Delen, D., Cogdell, D., & Kasap, N. (2012). A comparative analysis of data mining
methods in predicting NCAA bowl outcomes. International Journal of Forecasting, 28, 543–
552; Freeman, K. M., & Brewer, R. M. (2016). The politics of American college football.
Journal of Applied Business and Economics, 18(2), 97–101.
Copyright © 2024, 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright

This work is protected by United States copyright laws and is

provided solely for the use of instructors in teaching their courses
and assessing student learning. Dissemination or sale of any part
of this work (including on the World Wide Web) will destroy the
integrity of the work and is not permitted. The work and materials
from it should never be made available to students except by
instructors using the accompanying text in their classes. All
recipients of this work are expected to abide by these restrictions
and to honor the intended pedagogical purposes and the needs of
other instructors who rely on these materials.

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
Knee Ability Zero Now Complete As A Picture Book 4 PDF Free
94% (68)
Knee Ability Zero Now Complete As A Picture Book 4 PDF Free
49 pages
Read People Like A Book by Patrick King-Edited
61% (72)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (29)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
78% (27)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (55)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
70% (70)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Business Intelligence, Analytics, and Data Science: A Managerial Perspective
No ratings yet
Business Intelligence, Analytics, and Data Science: A Managerial Perspective
73 pages
Top 30 Data Analyst Interview Questions & Answers
No ratings yet
Top 30 Data Analyst Interview Questions & Answers
12 pages
CH 03 PPTaccessible
No ratings yet
CH 03 PPTaccessible
71 pages
3510-6510 Ch2
No ratings yet
3510-6510 Ch2
73 pages
Nature of Data, Statistical Modeling and Visualization
No ratings yet
Nature of Data, Statistical Modeling and Visualization
67 pages
Sharda Dss11e Ch03
No ratings yet
Sharda Dss11e Ch03
70 pages
Chapter2 BI
No ratings yet
Chapter2 BI
77 pages
DS PPT Aman
No ratings yet
DS PPT Aman
9 pages
Unit 1 - DA - Introduction To Data Science
No ratings yet
Unit 1 - DA - Introduction To Data Science
70 pages
Unit 1 - Intro To EDA
No ratings yet
Unit 1 - Intro To EDA
40 pages
Descriptive Analytics I: Nature of Data,: Statistical Modeling, and Visualization
No ratings yet
Descriptive Analytics I: Nature of Data,: Statistical Modeling, and Visualization
76 pages
TECH 4070-Ch02
100% (1)
TECH 4070-Ch02
33 pages
02 - Data Pre Processing
No ratings yet
02 - Data Pre Processing
91 pages
Preprocessing in Data Mining: Edgar Acu Na
No ratings yet
Preprocessing in Data Mining: Edgar Acu Na
5 pages
BI Chapter 02 - Unlocked
No ratings yet
BI Chapter 02 - Unlocked
51 pages
Data Science
No ratings yet
Data Science
9 pages
DSBDL Asg 2 Write Up
No ratings yet
DSBDL Asg 2 Write Up
4 pages
FDS - 3 SOLVED
No ratings yet
FDS - 3 SOLVED
21 pages
SML Updated UNIT-2
No ratings yet
SML Updated UNIT-2
43 pages
Preprocessing
No ratings yet
Preprocessing
90 pages
Exploratory Data Analysis EDA and Feature Engineering 10 Merged
No ratings yet
Exploratory Data Analysis EDA and Feature Engineering 10 Merged
99 pages
Fundamental of Data Science
No ratings yet
Fundamental of Data Science
20 pages
Task 1
No ratings yet
Task 1
3 pages
DS
No ratings yet
DS
7 pages
Internship Report 2023-24 Data Science
100% (1)
Internship Report 2023-24 Data Science
23 pages
COMPUTATIONAL DATA SCIENCE - UNIT 1
No ratings yet
COMPUTATIONAL DATA SCIENCE - UNIT 1
18 pages
Week 4 DMM(1) (1)
No ratings yet
Week 4 DMM(1) (1)
21 pages
Data Science S3mca
No ratings yet
Data Science S3mca
55 pages
Assignment DSBDS Insem
No ratings yet
Assignment DSBDS Insem
6 pages
1656792308661
No ratings yet
1656792308661
23 pages
Introduction To Data Science: Chapter Two
No ratings yet
Introduction To Data Science: Chapter Two
52 pages
Data Science Lecture No 02
No ratings yet
Data Science Lecture No 02
21 pages
Data Science 2
No ratings yet
Data Science 2
55 pages
Sharda 11e Full Accessible Ppt 03
No ratings yet
Sharda 11e Full Accessible Ppt 03
31 pages
daf_brochure_181220242312
No ratings yet
daf_brochure_181220242312
23 pages
03 Preprocessing
No ratings yet
03 Preprocessing
18 pages
Day 1 Article For Discussion
No ratings yet
Day 1 Article For Discussion
5 pages
Unit 1
No ratings yet
Unit 1
21 pages
Class3-9 DataPreprocessing 22Aug-06Sept2019
No ratings yet
Class3-9 DataPreprocessing 22Aug-06Sept2019
53 pages
unit2
No ratings yet
unit2
20 pages
FTA-Module 1-Notes (1)
No ratings yet
FTA-Module 1-Notes (1)
24 pages
HIT391-week 3-New
No ratings yet
HIT391-week 3-New
43 pages
BDA -Statistical Inference, Exploratory Data Analysis, and the Analytics Process
No ratings yet
BDA -Statistical Inference, Exploratory Data Analysis, and the Analytics Process
74 pages
Teit Cbgs Dmbi Lab Manual FH 2015
No ratings yet
Teit Cbgs Dmbi Lab Manual FH 2015
60 pages
Lesson4 Data
No ratings yet
Lesson4 Data
31 pages
KMBN IT01 LM Consolidated
No ratings yet
KMBN IT01 LM Consolidated
123 pages
PDF
No ratings yet
PDF
42 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
80 pages
FDS MOST IMP QUESTION
No ratings yet
FDS MOST IMP QUESTION
12 pages
Introduction to Engineering Data Analysis
No ratings yet
Introduction to Engineering Data Analysis
20 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
11 pages
IV_AI-DS_AD3491_FDSA_QB_Unit1
No ratings yet
IV_AI-DS_AD3491_FDSA_QB_Unit1
5 pages
Summary Chapter 5 - 7 - Group 4
No ratings yet
Summary Chapter 5 - 7 - Group 4
47 pages
Technical Report Writing For Ca2 Examination: Topic: Introduction To Data Science
No ratings yet
Technical Report Writing For Ca2 Examination: Topic: Introduction To Data Science
7 pages
FDS notes
No ratings yet
FDS notes
5 pages
PHD seminar
No ratings yet
PHD seminar
38 pages
Chapter 4 Information Analytics in Perspectives
No ratings yet
Chapter 4 Information Analytics in Perspectives
39 pages
Data Analysis Using Python Day_1 to Day_4
No ratings yet
Data Analysis Using Python Day_1 to Day_4
30 pages
1 Introduction to DA Course
No ratings yet
1 Introduction to DA Course
29 pages
Data Science Mastery: From Beginner to Expert in Big Data Analytics
From Everand
Data Science Mastery: From Beginner to Expert in Big Data Analytics
Kameron Hussain
No ratings yet
Azure & Azure Government SOC 2 Type 2 Report (2017-04-01 To 2018-03-31)
No ratings yet
Azure & Azure Government SOC 2 Type 2 Report (2017-04-01 To 2018-03-31)
225 pages
Full Download (Ebook) Using SAP®: an introduction for beginners and business users by Schulz, Olaf ISBN 9781493214044, 9781493214051, 9781493214068, 1493214047, 1493214055, 1493214063 PDF DOCX
100% (9)
Full Download (Ebook) Using SAP®: an introduction for beginners and business users by Schulz, Olaf ISBN 9781493214044, 9781493214051, 9781493214068, 1493214047, 1493214055, 1493214063 PDF DOCX
65 pages
Dxminds Com Top 10 Mobile App Development Companies in Kolkata
No ratings yet
Dxminds Com Top 10 Mobile App Development Companies in Kolkata
7 pages
MK - Biaya Modal (Kelompok 4)
No ratings yet
MK - Biaya Modal (Kelompok 4)
41 pages
Education: Experience: Ranjan Das
No ratings yet
Education: Experience: Ranjan Das
16 pages
Robert Werzner Senior Project Manager: Education
No ratings yet
Robert Werzner Senior Project Manager: Education
14 pages
Bill Gates
No ratings yet
Bill Gates
23 pages
Accounting Information Systems 7th Edition James A. Hall - The latest updated ebook is now available for download
No ratings yet
Accounting Information Systems 7th Edition James A. Hall - The latest updated ebook is now available for download
47 pages
Iso 38505-3
0% (1)
Iso 38505-3
7 pages
Summary Report: Table of Content 1. Dao Breakdown
No ratings yet
Summary Report: Table of Content 1. Dao Breakdown
3 pages
Samsung Term Paper
No ratings yet
Samsung Term Paper
7 pages
CC Token 2.0white Paper
No ratings yet
CC Token 2.0white Paper
56 pages
Machine Learning Hackathon: Problem Statement
No ratings yet
Machine Learning Hackathon: Problem Statement
1 page
X-Force Threat Intelligence Level 2 Quiz - Attempt
No ratings yet
X-Force Threat Intelligence Level 2 Quiz - Attempt
13 pages
IT Governance Mechanisms in Managing IT Business V
No ratings yet
IT Governance Mechanisms in Managing IT Business V
11 pages
NDD Rint: Stock Control Complete Monitoring Events Monitoring Billing Cycle Validation
No ratings yet
NDD Rint: Stock Control Complete Monitoring Events Monitoring Billing Cycle Validation
26 pages
Create a SAP Fiori App and Deploy It to SAP BTP, ABAP Environment
No ratings yet
Create a SAP Fiori App and Deploy It to SAP BTP, ABAP Environment
18 pages
ICT Policy JBL
No ratings yet
ICT Policy JBL
8 pages
Class Handout IM467178 L
No ratings yet
Class Handout IM467178 L
32 pages
dp-300 5
No ratings yet
dp-300 5
4 pages
Resume Templetes
100% (2)
Resume Templetes
5 pages
Arpan Sharma CA3 PHP
No ratings yet
Arpan Sharma CA3 PHP
12 pages
Vishnu 2022 Resume
No ratings yet
Vishnu 2022 Resume
1 page
E-Way_bill_API_PROPOSAL
No ratings yet
E-Way_bill_API_PROPOSAL
12 pages
The Written Test For The Position of Sales Executive
No ratings yet
The Written Test For The Position of Sales Executive
4 pages
OJT - Final Report.
No ratings yet
OJT - Final Report.
26 pages
CMC DWDM RFP Service Clarification - Nov 14 r4
No ratings yet
CMC DWDM RFP Service Clarification - Nov 14 r4
16 pages
Yahoo Mail Document Your Apple Invoice AD33591824 PDF
100% (1)
Yahoo Mail Document Your Apple Invoice AD33591824 PDF
2 pages
MIS and Analytics Programme
No ratings yet
MIS and Analytics Programme
36 pages
30-06-2021SEEPZ - EPCES - Updated List - 30.06.2021
No ratings yet
30-06-2021SEEPZ - EPCES - Updated List - 30.06.2021
1,245 pages