Lecture_2
Lecture_2
Big data
• What Is Big Data
• Big data relates to the large data sets, which are created from a
variety of sources and with a lot of speed (a. k. a velocity).
• Any data set that has one of the attributes can be called Big Data.
• It is also about the data with veracity and value.
• The data analytic lifecycle is designed for Big Data problems and data
science projects
• With six phases the project work can occur in several phases
simultaneously
• The cycle is iterative to portray a real project
• Work can return to earlier phases as new information is uncovered
Data Analytics Lifecycle
Data Analytics Lifecycle Overview
Phase 1: Discovery
Phase 6: Operationalize