Data Science and Analytics
Data Science and Analytics
• Data science is a field about processes and systems, to extract meaning from
data from various forms, whether it is an unstructured or structured form.
• Data science is the study of data, like biological sciences are the study of
biology; physical sciences, it's the study of physical reactions. Data is real, data
has real properties, and we need to study them, if we need to work on them.
• Data science involves data and some science.
1.3 COMPONENTS OF DATA SCIENCE
• Various sectors use data science to extract the information they need to
create different services and products.
1.7 DATA SCIENCE USE C ASES
1.8 WHAT ARE THE CHALLENGES FACED BY
DATA SCIENTISTS?
• Some of the challenges data scientists face in the real world are:
• Data quality doesn't conform to the set standards.
• Data integration is .a complex task.
• Data is distributed into large clusters in HDFS, which is difficult to
integrate and analyze.
• Unstructured and semi-structured data are harder to analyze