0% found this document useful (0 votes)

162 views

DATA MINING Chapter 1 and 2 Lect Slide

1) The document discusses data mining and data preprocessing. It notes the explosive growth of data and need for knowledge discovery. 2) Data preprocessing is important for data mining and includes data cleaning, integration, transformation, and reduction to handle issues like missing values, noise, and inconsistencies. 3) Techniques for data preprocessing include filling in missing values, identifying and removing outliers, resolving inconsistencies, normalization, aggregation, binning, clustering, and regression. The goal is to improve data quality for knowledge discovery.

Uploaded by

Sanjeev Thakur

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views

DATA MINING Chapter 1 and 2 Lect Slide

Uploaded by

Sanjeev Thakur

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 47

Introduction of Data Mining- Motivation & Importance of data Mining ,Role of Data Mining in knowledge Discovery

ROBIN PRAKASH MATHUR ASST .PROFESSOR DEPARTMENT OF CSE LPU-PHAGWARA , JALANDHAR

Why Data Mining?

The Explosive Growth of Data: from terabytes to petabytes Data collection and data availability Automated data collection tools, database systems, Web, computerized society Major sources of abundant data

Business: Web, e-commerce, transactions, stocks,

Science: Remote sensing, bioinformatics, scientific simulation, Society and everyone: news, digital cameras, YouTube

We are drowning in data, but starving for knowledge!

Necessity is the mother of inventionData miningAutomated analysis of massive data sets
March 23, 2013 Data Mining: Concepts and Techniques 2

What Is Data Mining?

Data mining (knowledge discovery from data)
Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data Data mining: a misnomer?

Alternative names
Knowledge discovery (mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intelligence, etc.

March 23, 2013

Data Mining: Concepts and Techniques

Knowledge Discovery (KDD) Process

This is a view from typical database systems and data warehousing Pattern Evaluation communities Data mining plays an essential role in the knowledge discovery process Data Mining Task-relevant Data Data Warehouse Selection

Data Cleaning
Data Integration Databases

March 23, 2013

Data Mining: Concepts and Techniques

1. Data cleaning (to remove noise and inconsistent data) 2. Data integration (where multiple data sources may be combined) 3. Data selection (where data relevant to the analysis task are retrieved from the database) 4. Data transformation (where data are transformed or consolidated into forms appropriate for mining by performing summary or aggregation operations, for instance)

5. Data mining (an essential process where intelligent methods are applied in order to extract data patterns) 6. Pattern evaluation (to identify the truly interesting patterns representing knowledge based on some interestingness measures) 7. Knowledge presentation (where visualization and knowledge representation techniques are used to present the mined knowledge to the user)

ARCHITECTURE OF DATA MINING

Database, data warehouse, WorldWideWeb, or other information repository: This is one or a set of databases, data warehouses, spreadsheets, or other kinds of information repositories. Data cleaning and data integration techniques may be performed on the data. Database or data warehouse server: The database or data warehouse server is responsible for fetching the relevant data, based on the users data mining request.

Knowledge base: This is the domain knowledge that is used to guide the search or evaluate the interestingness of resulting patterns. Such knowledge can include concept hierarchies, used to organize attributes or attribute values into different levels of abstraction. Data mining engine: This is essential to the data mining system and ideally consists of a set of functional modules for tasks such as characterization, association and correlation analysis, classification, prediction, cluster analysis, outlier analysis, and evolution analysis.

Pattern evaluation module: This component typically employs interestingness measures and interacts with the data mining modules so as to focus the search toward interesting patterns. It

Data Summarization, Data Cleaning, Data Transformation, Concept Hierarchy, Structure

March 23, 2013

Data Mining: Concepts and Techniques

Data Preprocessing
Why preprocess the data?
Data cleaning

Data integration and transformation

Data reduction

Discretization and concept hierarchy generation

Summary
March 23, 2013 Data Mining: Concepts and Techniques 12

Why Data Preprocessing?

Data in the real world is dirty incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate data noisy: containing errors or outliers inconsistent: containing discrepancies in codes or names No quality data, no quality mining results! Quality decisions must be based on quality data Data warehouse needs consistent integration of quality data

March 23, 2013

Data Mining: Concepts and Techniques

Major Tasks in Data Preprocessing

Data cleaning
Fill in missing values, smooth noisy data, identify or remove outliers, and resolve inconsistencies

Data integration
Integration of multiple databases, data cubes, or files

Data transformation
Normalization and aggregation

Data reduction
Obtains reduced representation in volume but produces the same or similar analytical results

Data discretization
Part of data reduction but with particular importance, especially for numerical data
March 23, 2013 Data Mining: Concepts and Techniques 14

Forms of data preprocessing

March 23, 2013

Data Mining: Concepts and Techniques

Chapter 3: Data Preprocessing

Why preprocess the data?
Data cleaning

Data integration and transformation

Data reduction

Discretization and concept hierarchy generation

Summary
March 23, 2013 Data Mining: Concepts and Techniques 16

Data Cleaning
Data cleaning tasks
Fill in missing values Identify outliers and smooth out noisy data Correct inconsistent data

March 23, 2013

Data Mining: Concepts and Techniques

Missing Data
Data is not always available

E.g., many tuples have no recorded value for several attributes, such as customer income in sales data
Missing data may be due to equipment malfunction inconsistent with other recorded data and thus deleted data not entered due to misunderstanding certain data may not be considered important at the time of entry

not register history or changes of the data

Missing data may need to be inferred.

March 23, 2013

Data Mining: Concepts and Techniques

How to Handle Missing Data?

Ignore the tuple: usually done when class label is missing (assuming the

tasks in classificationnot effective when the percentage of missing values

per attribute varies considerably. Fill in the missing value manually: tedious + infeasible ! Use a global constant to fill in the missing value: e.g., unknown, a new class?! Use the attribute mean to fill in the missing value Use the attribute mean for all samples belonging to the same class to fill in the missing value: smarter

March 23, 2013

Data Mining: Concepts and Techniques

Noisy Data
Noise: random error or variance in a measured variable Incorrect attribute values may due to faulty data collection instruments data entry problems data transmission problems technology limitation inconsistency in naming convention Other data problems which requires data cleaning duplicate records incomplete data inconsistent data
March 23, 2013 Data Mining: Concepts and Techniques 20

How to Handle Noisy Data?

Binning method: first sort data and partition into (equi-depth) bins

then one can smooth by bin means, smooth by bin median, smooth by bin boundaries, etc.
Clustering detect and remove outliers Combined computer and human inspection detect suspicious values and check by human Regression smooth by fitting the data into regression functions
March 23, 2013 Data Mining: Concepts and Techniques 21

Simple Discretization Methods: Binning

Binning methods smooth a sorted data value by consulting its neighborhood, that is, the values around it. The sorted values are distributed into a number of buckets, or bins. Because binning methods consult the neighborhood of values, they perform local smoothing. The data are first sorted and then partitioned into equal-frequency bins of size 4 (i.e., each bin contains four values). In smoothing by bin means, each value in a bin is replaced by the mean value of the bin. Smoothing by bin medians can be employed, in which each bin value is replaced by the bin median In smoothing by bin boundaries, the minimum and maximum values in a given bin are identified as the bin boundaries
March 23, 2013 Data Mining: Concepts and Techniques 22

Binning Methods for Data Smoothing

* Sorted data for price (in dollars): 4, 8, 9, 15, 21, 21, 24, 25, 26, 28, 29, 34 * Partition into (equi-depth) bins: - Bin 1: 4, 8, 9, 15 - Bin 2: 21, 21, 24, 25 - Bin 3: 26, 28, 29, 34 * Smoothing by bin means: - Bin 1: 9, 9, 9, 9 - Bin 2: 23, 23, 23, 23 - Bin 3: 29, 29, 29, 29 * Smoothing by bin boundaries: - Bin 1: 4, 4, 4, 15 - Bin 2: 21, 21, 25, 25 - Bin 3: 26, 26, 26, 34
March 23, 2013 Data Mining: Concepts and Techniques 23

Cluster Analysis
Unlike classification , the class labels are not present in the training data simply because they are not known to begin with. Clustering can be used to generate such labels. The objects are clustered or grouped based on the principle of maximizing the intraclass similarity and minimizing the interclass similarity. K mean, k-medoid
March 23, 2013 Data Mining: Concepts and Techniques 24

Cluster Analysis

March 23, 2013

Data Mining: Concepts and Techniques

Regression it is the process predicting one

variable from the another variable.
y

y=x+1

March 23, 2013

Data Mining: Concepts and Techniques

Different commercial tools that can aid in the step of discrepancy detection.
Data scrubbing tools use simple domain knowledge (e.g., knowledge of postal addresses, and spell-checking) to detect errors and make corrections in the data. These tools rely on parsing and fuzzy matching techniques when cleaning data from multiple sources.
Data auditing tools find discrepancies by analyzing the data to discover rules and relationships, and detecting data that violate such conditions. They are variants of data mining tools.

Chapter 3: Data Preprocessing

Why preprocess the data?
Data cleaning

Data integration and transformation

Data reduction

Discretization and concept hierarchy generation

Summary
March 23, 2013 Data Mining: Concepts and Techniques 28

Data Integration
Data integration: combines data from multiple sources into a coherent store Schema integration integrate metadata from different sources Entity identification problem: identify real world entities from multiple data sources, e.g., A.cust-id B.cust-# Detecting and resolving data value conflicts for the same real world entity, attribute values from different sources are different possible reasons: different representations, different scales, e.g., metric vs. British units
March 23, 2013 Data Mining: Concepts and Techniques 29

Handling Redundant Data in Data Integration

Redundant data occur often when integration of multiple databases The same attribute may have different names in different databases

One attribute may be a derived attribute in another table, e.g., annual revenue
Redundant data may be able to be detected by correlational analysis Careful integration of the data from multiple sources may help reduce/avoid redundancies and inconsistencies and improve mining speed and quality
March 23, 2013 Data Mining: Concepts and Techniques 30

Data Transformation
Data transformation routines convert the data into appropriate forms for mining. For example, attribute data may be normalized so as to fall between a small range, such as 0.0 to 1.0 Smoothing: remove noise from data Aggregation: summarization, data cube construction Generalization: concept hierarchy climbing Normalization: scaled to fall within a small, specified range min-max normalization z-score normalization normalization by decimal scaling
March 23, 2013 Data Mining: Concepts and Techniques 31

Data Transformation: Normalization

min-max normalization

v minA v' (new _ maxA new _ minA) new _ minA maxA minA
z-score normalization

v meanA v' stand _ devA

normalization by decimal scaling

v v' j 10
March 23, 2013

Where j is the smallest integer such that Max(| v' |)<1

Data Mining: Concepts and Techniques

Chapter 3: Data Preprocessing

Why preprocess the data?
Data cleaning

Data integration and transformation

Data reduction Discretization and concept hierarchy generation
March 23, 2013 Data Mining: Concepts and Techniques 33

Data Reduction Strategies

Warehouse may store terabytes of data: Complex data analysis/mining may take a very long time to run on the complete data set Data reduction Obtains a reduced representation of the data set that is much smaller in volume but yet produces the same (or almost the same) analytical results Data reduction strategies Data cube aggregation Dimensionality reduction Numerosity reduction Discretization and concept hierarchy generation
March 23, 2013 Data Mining: Concepts and Techniques 34

Strategies for data reduction include the following: 1. Data cube aggregation- aggregation operations are applied to the data in the construction of a data cube. 2. Attribute subset selection, where irrelevant, weakly relevant, or redundant attributes or dimensions may be detected and removed. 3.Dimensionality reduction, where encoding mechanisms are used to reduce the data set size. 4. Numerosity reduction,where the data are replaced or estimated by alternative, smaller data representations such as parametric models (which need store only the model parameters instead of the actual data) or nonparametric methods such as clustering, sampling, and the use of histograms. 5. Discretization and concept hierarchy generation,where rawdata values for attributes are replaced by ranges or higher conceptual levels. Data discretization is a form of numerosity reduction that is very useful for the automatic generation of concept hierarchies.

Data Cube Aggregation

The cube created at the lowest level of abstraction is referred to as the base cuboid. The base cuboid should correspond to an individual entity of interest, such as sales or customer. In other words, the lowest level should be usable, or useful for the analysis. A cube at the highest level of abstraction is the apex cuboid. Data cubes created for varying levels of abstraction are often referred to as cuboids, so that a data cube may instead refer to a lattice of cuboids. Each higher level of abstraction further reduces the resulting data size.
March 23, 2013 Data Mining: Concepts and Techniques 36

Example of Decision Tree Induction

Initial attribute set: {A1, A2, A3, A4, A5, A6} A4 ?

A1?

A6?

Class 1

Class 2

Class 1

Class 2

> Reduced attribute set: {A1, A4, A6}

March 23, 2013 Data Mining: Concepts and Techniques 37

Data Compression
String compression There are extensive theories and well-tuned algorithms Typically lossless But only limited manipulation is possible without expansion Audio/video compression Typically lossy compression, with progressive refinement Sometimes small fragments of signal can be reconstructed without reconstructing the whole Time sequence is not audio Typically short and vary slowly with time
March 23, 2013 Data Mining: Concepts and Techniques 38

Data Compression

Original Data
lossless

Compressed Data

Original Data Approximated

March 23, 2013

Data Mining: Concepts and Techniques

Regression and Log-Linear Models

Linear regression: Data are modeled to fit a straight line Often uses the least-square method to fit the line Multiple regression: allows a response variable Y to be modeled as a linear function of multidimensional feature vector

Log-linear model: approximates discrete multidimensional

probability distributions
March 23, 2013 Data Mining: Concepts and Techniques 40

Linear regression: Y = + X Two parameters , and specify the line and are to be estimated by using the data at hand. using the least squares criterion to the known values of Y1, Y2, , X1, X2, . Multiple regression: Y = b0 + b1 X1 + b2 X2. Many nonlinear functions can be transformed into the above. Log-linear models: The multi-way table of joint probabilities is approximated by a product of lower-order tables. Probability: p(a, b, c, d) = ab acad bcd

Regress Analysis and LogLinear Models

Histograms
A popular data reduction 40 technique 35 Divide data into buckets 30 and store average (sum) for 25 each bucket Can be constructed 20 optimally in one dimension 15 using dynamic programming 10 Related to quantization 5 problems.
0
March 23, 2013

10000 30000 Data Mining: Concepts and Techniques

50000

70000

90000 42

Discretization
Three types of attributes: Nominal values from an unordered set Ordinal values from an ordered set Continuous real numbers Discretization: * divide the range of a continuous attribute into intervals Some classification algorithms only accept categorical attributes. Reduce data size by discretization Prepare for further analysis

March 23, 2013

Data Mining: Concepts and Techniques

Discretization and Concept hierachy

Discretization
reduce the number of values for a given continuous attribute by dividing the range of the attribute into intervals. Interval labels can then be used to replace actual data values. Concept hierarchies reduce the data by collecting and replacing low level concepts (such as numeric values for the attribute age) by higher level concepts (such as young, middle-aged, or senior).
March 23, 2013 Data Mining: Concepts and Techniques 44

Concept hierarchy generation for categorical data

Specification of a partial ordering of attributes explicitly at the schema level by users or experts Specification of a portion of a hierarchy by explicit data grouping

Specification of a set of attributes, but not of their partial

ordering Specification of only a partial set of attributes

March 23, 2013

Data Mining: Concepts and Techniques

Specification of a set of attributes

Concept hierarchy can be automatically generated based on the number of distinct values per attribute in the given attribute set. The attribute with the most distinct values is placed at the lowest level of the hierarchy.

country province_or_ state

15 distinct values

65 distinct values 3567 distinct values 674,339 distinct values

city street

March 23, 2013

Data Mining: Concepts and Techniques

THANKS.

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
E-Tivity 2.2 Tharcisse 217010849
No ratings yet
E-Tivity 2.2 Tharcisse 217010849
7 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
7 pages
Guidelines, Principles, and Theories: Designing The User Interface: Strategies For Effective Human-Computer Interaction
No ratings yet
Guidelines, Principles, and Theories: Designing The User Interface: Strategies For Effective Human-Computer Interaction
33 pages
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Data Preprocessing: L1+ Freq
No ratings yet
Data Preprocessing: L1+ Freq
13 pages
Unit 2 - Data Preprocessing
No ratings yet
Unit 2 - Data Preprocessing
23 pages
3.1 What Is Data Warehouse?: Unit Iii
No ratings yet
3.1 What Is Data Warehouse?: Unit Iii
33 pages
Data Science Techniques Classification Regression and Clustering
No ratings yet
Data Science Techniques Classification Regression and Clustering
5 pages
DM Chapter 3 Data Preprocessing
No ratings yet
DM Chapter 3 Data Preprocessing
76 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
4 pages
Attribute Oriented Induction
100% (1)
Attribute Oriented Induction
6 pages
Unit 1
No ratings yet
Unit 1
70 pages
Lesson Plan: Data Warehousing and Data Mining
No ratings yet
Lesson Plan: Data Warehousing and Data Mining
1 page
Lecture Notes: Introduction To Data Science and Big Data
No ratings yet
Lecture Notes: Introduction To Data Science and Big Data
5 pages
Rayleigh Model
No ratings yet
Rayleigh Model
9 pages
RMM Unit-I Introdution To Data Mining
No ratings yet
RMM Unit-I Introdution To Data Mining
129 pages
Eda Unit 1
No ratings yet
Eda Unit 1
57 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
Data Generalization
No ratings yet
Data Generalization
3 pages
Unit 4 - 4.4
No ratings yet
Unit 4 - 4.4
23 pages
2022 Dec. ITT401-A
No ratings yet
2022 Dec. ITT401-A
2 pages
DAP Lab Manual
No ratings yet
DAP Lab Manual
20 pages
Unit-1 Basics of Algorithms and Mathematics
No ratings yet
Unit-1 Basics of Algorithms and Mathematics
47 pages
Data-Mining Notes
No ratings yet
Data-Mining Notes
110 pages
Q.1. Why Is Data Preprocessing Required?
100% (1)
Q.1. Why Is Data Preprocessing Required?
26 pages
Introduction of DBMS
No ratings yet
Introduction of DBMS
83 pages
6 1 Mining Complex Data
No ratings yet
6 1 Mining Complex Data
69 pages
Classification and Prediction
No ratings yet
Classification and Prediction
126 pages
Pythonic Data Cleaning With Numpy and Pandas
No ratings yet
Pythonic Data Cleaning With Numpy and Pandas
11 pages
FDP Day1
No ratings yet
FDP Day1
35 pages
Data Mining & Business Intelligence
No ratings yet
Data Mining & Business Intelligence
322 pages
4.7.1 - Data Warehousing Mining & Business Intelligence
No ratings yet
4.7.1 - Data Warehousing Mining & Business Intelligence
3 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
4 pages
Nptel Swayam DWDM Slides
No ratings yet
Nptel Swayam DWDM Slides
406 pages
Data Analytics Lab File Rohit
No ratings yet
Data Analytics Lab File Rohit
23 pages
Data MIning & Data Warehousing-TCS-31
No ratings yet
Data MIning & Data Warehousing-TCS-31
2 pages
Python Lab Manual
No ratings yet
Python Lab Manual
50 pages
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
No ratings yet
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
12 pages
AD3461 ML lab manual
No ratings yet
AD3461 ML lab manual
32 pages
Density & Grid based clustering
100% (1)
Density & Grid based clustering
21 pages
Data Mining-Outlier Analysis
No ratings yet
Data Mining-Outlier Analysis
6 pages
FDS Iat-2 Part-B
No ratings yet
FDS Iat-2 Part-B
4 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
98 pages
Presentation On: Crime Analysis and Prediction Using Data Mining
No ratings yet
Presentation On: Crime Analysis and Prediction Using Data Mining
14 pages
6CS4-02 ML PPT Unit-3
No ratings yet
6CS4-02 ML PPT Unit-3
52 pages
Data Mining - IMT Nagpur-Manish
No ratings yet
Data Mining - IMT Nagpur-Manish
82 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
2 pages
Lecture 4 Data Structure Linked List
No ratings yet
Lecture 4 Data Structure Linked List
30 pages
DSBDAL - Assignment No 9
No ratings yet
DSBDAL - Assignment No 9
12 pages
Unit 5 - SE - Notes
No ratings yet
Unit 5 - SE - Notes
45 pages
Chi Merge
No ratings yet
Chi Merge
5 pages
A Crash Course in Data Science Review
No ratings yet
A Crash Course in Data Science Review
11 pages
03 - Decision - Tree - Hunt Algorithm
No ratings yet
03 - Decision - Tree - Hunt Algorithm
28 pages
GWA - Lab Workbook
50% (2)
GWA - Lab Workbook
70 pages
Data Wrangling
No ratings yet
Data Wrangling
15 pages
Unit-3 DWDM
No ratings yet
Unit-3 DWDM
11 pages
Database Indexing and Hashing
No ratings yet
Database Indexing and Hashing
7 pages
Lab Manual Bca 3 Sem Data Structures-I
No ratings yet
Lab Manual Bca 3 Sem Data Structures-I
16 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
CLARANS
No ratings yet
CLARANS
19 pages
Klasifikasi Jenis Umbi Berdasarkan Citra Menggunakan SVM Dan KNN
No ratings yet
Klasifikasi Jenis Umbi Berdasarkan Citra Menggunakan SVM Dan KNN
4 pages
Deep Learning Chorale Prelude
No ratings yet
Deep Learning Chorale Prelude
6 pages
Zhao Yue CV
No ratings yet
Zhao Yue CV
7 pages
MBOO31 - Management Information Systems SET - 1 Solved Assignment
No ratings yet
MBOO31 - Management Information Systems SET - 1 Solved Assignment
21 pages
CRISP Data Mining SIBM Pune
No ratings yet
CRISP Data Mining SIBM Pune
24 pages
Data Mining in Education Data Classification and Decision Tree Approach 097 Z00080E10038 2
No ratings yet
Data Mining in Education Data Classification and Decision Tree Approach 097 Z00080E10038 2
5 pages
IBM SPSS Modeler CRISP-DM Guide
No ratings yet
IBM SPSS Modeler CRISP-DM Guide
53 pages
EM4218E - Chapter 6
No ratings yet
EM4218E - Chapter 6
9 pages
Is Zc415 (Data Mining BITS-WILP)
No ratings yet
Is Zc415 (Data Mining BITS-WILP)
4 pages
25 Msc-Data Science
No ratings yet
25 Msc-Data Science
79 pages
1 - Competition Mechanics Intro
No ratings yet
1 - Competition Mechanics Intro
23 pages
Clustering Techniques
No ratings yet
Clustering Techniques
38 pages
Figure PPT ch005
No ratings yet
Figure PPT ch005
59 pages
Survey of Attack Projection, Prediction, and
No ratings yet
Survey of Attack Projection, Prediction, and
22 pages
Mit401 Unit 08-Slm
No ratings yet
Mit401 Unit 08-Slm
13 pages
2020 Linkedin Ads Allocation in Feed Via Constrained Optimization
No ratings yet
2020 Linkedin Ads Allocation in Feed Via Constrained Optimization
9 pages
CS490D: Introduction To Data Mining: Chris Clifton
No ratings yet
CS490D: Introduction To Data Mining: Chris Clifton
28 pages
Data Science Vs Data Analytics
No ratings yet
Data Science Vs Data Analytics
5 pages
(K Nearest Neighbors) KNN
No ratings yet
(K Nearest Neighbors) KNN
3 pages
Linear Classifiers in Python: Chapter4
No ratings yet
Linear Classifiers in Python: Chapter4
24 pages
Haimlc801 Twsma Syllabus
No ratings yet
Haimlc801 Twsma Syllabus
3 pages
Literature Review On Data Collection and Analysis
100% (2)
Literature Review On Data Collection and Analysis
5 pages
Unit - I - Introduction
No ratings yet
Unit - I - Introduction
77 pages
Download full Making sense of data I a practical guide to exploratory data analysis and data mining 2ed. Edition Glenn J Myatt ebook all chapters
100% (24)
Download full Making sense of data I a practical guide to exploratory data analysis and data mining 2ed. Edition Glenn J Myatt ebook all chapters
60 pages
Authors Book
No ratings yet
Authors Book
600 pages
0-KDD-Cup-Workshop-2007
No ratings yet
0-KDD-Cup-Workshop-2007
2 pages
BAI 2008 Myanmar Paper
100% (1)
BAI 2008 Myanmar Paper
8 pages
Unit-4
No ratings yet
Unit-4
53 pages
Operational Data Stores Data Warehouse: 8) What Is Ods Vs Datawarehouse?
No ratings yet
Operational Data Stores Data Warehouse: 8) What Is Ods Vs Datawarehouse?
15 pages