0% found this document useful (0 votes)
51 views

Session1 Foundations of Data Science

The document discusses data science, including its definition, origins, applications, and career opportunities. It provides an overview of the data science workflow and lifecycle. Key points covered include common data science job roles and required skills, with an emphasis on Python for data science. The document is presented as a lecture on data science by Dr. Nagaraju Baydeti from the National Institute of Technology Nagaland.

Uploaded by

t1m0thy
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views

Session1 Foundations of Data Science

The document discusses data science, including its definition, origins, applications, and career opportunities. It provides an overview of the data science workflow and lifecycle. Key points covered include common data science job roles and required skills, with an emphasis on Python for data science. The document is presented as a lecture on data science by Dr. Nagaraju Baydeti from the National Institute of Technology Nagaland.

Uploaded by

t1m0thy
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

Dr.

Nagaraju Baydeti
Department of Computer Science and Engineering
National Institute of Technology Nagaland
1. What is Data Science?
2. Where it started - The Origin of Data Science
3. Why Data Science?
4. The Data Science - Workflow
5. Life Cycle of Data Science
6. Data Science – Application Areas
7. Job Opportunities
8. Anatomy of Data Scientist
9. Wish to be a Data Scientist – Skillset Required
10. How to start? – Data Science using Python
2
• Study of data
• Process of deriving knowledge
• Gain insights from a huge and diverse set of
data
• Organizing, processing and analyzing the data
• Works with Structured and Un-structured
data
• Decision making
• An inter-discipline field
3
• John Wilder Tukey, an American mathematician.
• Developed Fast Fourier Transform algorithm and box plot.
• He introduced the word ‘bit’ while working with John Von Neumann.
• In 1962, he described a field called ‘data analysis’.

• Peter Naur, a Danish computer science pioneer & Turing award winner.
• Designed ALGOL 60 (Algorithmic Language 1960)
• Well known for the contributor of Backus-Naur form (BNF) notation
used in describing the syntax for most programming languages.
• He disliked the term ‘computer science’ and suggested it to be called
as ‘data science’
For more details about the history of Data Science: https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/#6ab0108855cf
4
• Data – the new fuel for
Industries
• Data-driven approach
• According to the U.S. Bureau
of Labor Statistics
– The number of roles for
Data Scientists has grown
by 650% since 2012
– 11.5 million jobs will be
created by 2026
• Data Scientist ranks top
emerging jobs on LinkedIn

5
6
• Define a Problem
• Obtain the Data
• Scrubbing / Cleaning the Data
– Missing Values
– Data out of Range
– Time Zone Differences
• Exploratory Data Analytics
– Compute Descriptive Statistics to Extract
Features and Test Significant Variables
• Data Modeling
– Create an Efficient Method to Store
Information
• Data Visualization

7
8
9
Source: https://data-flair.training/blogs/data-science-applications/ 10
Various Job roles in Data Science domain:
• Data Analyst
• Machine Learning Expert
• Data Engineer
• Data Scientist
• Data Administrator
• Data Architect
• Business Analyst
• Business Intelligence Manager

11
12
13
14
1. https://datascience.berkeley.edu/about/what-is-data-science/
2. https://techterms.com/definition/data_science
3. https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/#6ab0108855cf
4. http://sudeep.co/data-science/Understanding-the-Data-Science-Lifecycle/
5. https://towardsdatascience.com/5-steps-of-a-data-science-project-lifecycle-26c50372b492
6. https://www.javatpoint.com/data-science
7. https://www.mygreatlearning.com/blog/difference-data-science-machine-learning-ai/
8. https://www.dataquest.io/blog/what-is-data-science/
9. https://www.mygreatlearning.com/blog/different-data-science-jobs-roles-industry/
10. https://www.datacamp.com/community/blog/data-science-past-present-future
11. https://www.javatpoint.com/data-science
12. https://www.proschoolonline.com/blog/data-science-skills

15
Email: [email protected]
[email protected]

Contact: +919986502452
16

You might also like