SlideShare a Scribd company logo
2
Most read
17
Most read
18
Most read
A GENTLE INTRO TO
DATA SCIENCE & AI
Computer Science HotTopics
• Mid 1980s – 1990s: desktop applications
• Networking, graphics & graphical user interfaces (GUIs), some AI / ML
• Mid 1990s – 2006: websites & web applications
• 2007 – 2014: mobile apps
• 2012 – 2017: data science
• Maybe some virtual reality (VR) and augmented reality (AR)
• 2016 – current: artificial intelligence (AI) & machine learning (ML)
• 2017 – early 2018: Bitcoin! (Crypto-currencies)
• IMHO, passing fad & pure speculation
DATA SCIENCE
Processing data has gotten better in the past decade because of:
(1) More data (2) Better use of statistics & other fields in CS (3) Faster &
more specialized hardware (4) Distributed networks & computing (5)
Contributions (papers & software) by Google, Facebook, etc
Rise of Data Science
• 1970s – 2000: Data in expensive databases
• “Small” data: millions of data points = large
• Programmers write code to process data
• Jobs: software engineers & database
administrators
• Expensive, took a long time to run, limited to
companies with expertise and resources
• 2006 – current: Data in a variety of places
• “Big data”: billions of data points. Per day.
• {Various job titles} write code to process data
• Jobs: data analysts, data scientists, data
engineers (software engineers, DB admins)
• Variety of inexpensive (and expensive), user-
friendly systems to analyze data
Small, Medium, Big Data
SMALL DATA
• 100,000s to
millions of records
• Can be handled by
databases
MEDIUM DATA
• Millions to 10s / 100s of
millions of records
• Databases start to creak
• Mix of DBs & Big Data
BIG DATA
• Billions or 10s of billions of
records. Per day.
• Big Data tools.
Job trends in Data Science
Indeed.com postings, Sep 2017 Quora, Dec 2017
ARTIFICIAL
INTELLIGENCE (AI)
How to get computers to think and learn like humans.
This field has been there since the early days of computing, but has sped
up over past few years due to better hardware and data processing.
AI Evolution Over theYears
https://www.slideshare.net/yanaioron1/vertex-perspectives-artificial-intelligence
AI, Machine Learning & Deep Learning
https://www.slideshare.net/yanaioron1/vertex-perspectives-artificial-intelligence
Machine Learning (ML)
• Limited AI; machine learns from existing
data to give (hopefully) correct response
• Build model that outputs correct
information given training on input data
• Model often built by a human and computer
is “trained” using existing data
ML Fields: Supervised, Unsupervised &
Reinforcement Learning
• Supervised Learning: develop model to
predict output based on existing input-output
(done by human)
• Classification: Is this a cat or not?
• Regression:Will user click on this ad?
• Unsupervised Learning: group & interpret
data based on input only
• Clustering: Identify patterns that are not
obviously visible.
• Reinforcement Learning: actions to
maximize “rewards”
• Recommendations on shopping websites
(Amazon), videos (Netflix)
• Computer vs human gaming. Chess (IBM
Watson). Go (Google’s AlphaGo).
Deep Learning
• ML requires
human input to
train and classify
• Deep Learning
uses multiple
levels of CNNs
(Convolutional
Neural Networks)
to learn by itself
• But: setting up &
programming
deep learning
neural network is
hard.
https://medium.com/swlh/ill-tell-you-why-deep-learning-is-so-popular-and-in-demand-5aca72628780
Deep Learning: Face Recognition
https://cdn.edureka.co/blog/wp-content/uploads/2017/05/Deep-Neural-Network-What-is-Deep-Learning-Edureka.png
CLOUD COMPUTING
Rent large computer “farms” by hourly use without having to pay upfront.
(Also) use online tools and services without installation.
Cloud makes it less expensive to build large computing tools and use
online services & platforms.
Cloud Computing
• Run computer services on a “cloud”
• Remote location run by service provider
• IaaS = Infrastructure as a service
• “Rent” computers, storage, networking.
Install your own software.
• PaaS = Platform as a service
• Higher level services, such as databases, web
servers, etc.Web hosting.
• SaaS = Software as a service
• Run software from the cloud.Websites,
online applications, e-mail, IM / messaging,
social media.
Cloud Computing -> Data Science
• “Rent” computing servers instead of buying outright
• Zero setup time; no setting up hardware
• “… building a 50,000 core cluster could easily cost $20
million to $30 million, he said.The Schrödinger project, by
contrast, cost about $4,850 per hour to run.”
• GigaOm: Cycle Computing spins up 50K core Amazon
cluster
• “By leveraging Cycle Computing software and AWS Cloud
infrastructure, Novartis was able to accomplish the same
work faster, and for far less money.
• $44 Million in infrastructure; 10 Million compounds
screened; 39 drug design years in 11 hours for a cost of
$4,232; 3 compounds identified for future work”
• Chef: Novartis Conducts 39Yrs of Computing in 11 Hours
w/Cycle Computing and Chef
Data Science <->AI / ML
https://twitter.com/ipfconline1/status/887043009568788480
Conclusion & QA
• Touched upon 3 “trending” areas in Computer Science today
• Data Science: the merging of CS and other math fields has allowed for us to
process data better and get more meaningful insight into vast volumes of data.
• Artificial Intelligence: an old field has been recently invigorated by advances in
data processing and large / fast computer networks
• Cloud Computing: reduces cost and setup / startup time in using large computing
resources or online services. Get faster to solving the business problem.

More Related Content

What's hot (20)

PDF
Introduction To Artificial Intelligence Powerpoint Presentation Slides
SlideTeam
 
PPT
Machine Learning
Vivek Garg
 
PPTX
Machine learning
Saurabh Agrawal
 
PPTX
Machine Learning
Rabab Munawar
 
PPTX
Introduction to data science.pptx
SadhanaParameswaran
 
PPTX
introduction to data science
bhavesh lande
 
PDF
Credit card fraud detection through machine learning
dataalcott
 
PPT
Machine learning
Rajib Kumar De
 
PPTX
Introduction of Data Science
Jason Geng
 
PDF
Introduction to Data Science and Analytics
Srinath Perera
 
PPTX
Introduction to data science
Sampath Kumar
 
PPTX
Machine learning ppt
Rajat Sharma
 
PPTX
Data science life cycle
Manoj Mishra
 
PPTX
Machine Learning
Darshan Ambhaikar
 
PPTX
Introduction to Machine Learning
Lior Rokach
 
PDF
AI and Data Science.pdf
MohammadMuzammilAnsa2
 
PDF
Introduction to Artificial Intelligence and few examples
BMS Institute of Technology and Management
 
PPTX
Introduction To Machine Learning
Knoldus Inc.
 
PPTX
Data science
SwapnilDahake2
 
PPTX
Data science
SouravSadhukhan6
 
Introduction To Artificial Intelligence Powerpoint Presentation Slides
SlideTeam
 
Machine Learning
Vivek Garg
 
Machine learning
Saurabh Agrawal
 
Machine Learning
Rabab Munawar
 
Introduction to data science.pptx
SadhanaParameswaran
 
introduction to data science
bhavesh lande
 
Credit card fraud detection through machine learning
dataalcott
 
Machine learning
Rajib Kumar De
 
Introduction of Data Science
Jason Geng
 
Introduction to Data Science and Analytics
Srinath Perera
 
Introduction to data science
Sampath Kumar
 
Machine learning ppt
Rajat Sharma
 
Data science life cycle
Manoj Mishra
 
Machine Learning
Darshan Ambhaikar
 
Introduction to Machine Learning
Lior Rokach
 
AI and Data Science.pdf
MohammadMuzammilAnsa2
 
Introduction to Artificial Intelligence and few examples
BMS Institute of Technology and Management
 
Introduction To Machine Learning
Knoldus Inc.
 
Data science
SwapnilDahake2
 
Data science
SouravSadhukhan6
 

Similar to Data science and Artificial Intelligence (20)

PDF
Data Science - NXT Level_Dr.Arun.pdf
Dr. G. Arun Sampaul Thomas
 
PPTX
2020 03-spine summit
Irfan Essa
 
PPTX
Alexander Sokolov “How Data Science and Big Data are changing the World”
Dakiry
 
PPTX
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
DataScienceConferenc1
 
PDF
10 Things Every Entrepreneur Needs to Know About Artificial Intelligence
Christopher Mohritz
 
PDF
Big Data & Artificial Intelligence
Zavain Dar
 
PDF
Data Science for Beginner by Chetan Khatri and Deptt. of Computer Science, Ka...
Chetan Khatri
 
PPTX
BI, AI/ML, Use Cases, Business Impact and how to get started
Karthick S
 
PDF
10 Things Every Entrepreneur Needs to Know About Artificial Intelligence
10x Nation
 
PDF
nVidia Presentation in OpenPOWER workshop Brazil
Ganesan Narayanasamy
 
PDF
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
NUS-ISS
 
PDF
2018 learning approach-digitaltrends
Abhilash Gopalakrishnan
 
PDF
Data Science versus Artificial Intelligence: a useful distinction
Christoforos Anagnostopoulos
 
PDF
Machine Learning & AI - 2022 intro for pre-college students.pdf
Ed Fernandez
 
PDF
Defining a Practical Path to Artificial Intelligence
Roman Chanclor
 
PDF
Top 10 Trends to Watch for In Data Science.pdf
Edtech Learning
 
PDF
Artificial Intelligence-Machine Learning Explained.pdf
DexDio
 
PDF
The Truth About Artificial Intelligence
Jon Whittle
 
PDF
DataScience_introduction.pdf
SouravBiswas747273
 
PDF
[Srijan Wednesday Webinars] Artificial Intelligence & the Future of Business
Srijan Technologies
 
Data Science - NXT Level_Dr.Arun.pdf
Dr. G. Arun Sampaul Thomas
 
2020 03-spine summit
Irfan Essa
 
Alexander Sokolov “How Data Science and Big Data are changing the World”
Dakiry
 
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
DataScienceConferenc1
 
10 Things Every Entrepreneur Needs to Know About Artificial Intelligence
Christopher Mohritz
 
Big Data & Artificial Intelligence
Zavain Dar
 
Data Science for Beginner by Chetan Khatri and Deptt. of Computer Science, Ka...
Chetan Khatri
 
BI, AI/ML, Use Cases, Business Impact and how to get started
Karthick S
 
10 Things Every Entrepreneur Needs to Know About Artificial Intelligence
10x Nation
 
nVidia Presentation in OpenPOWER workshop Brazil
Ganesan Narayanasamy
 
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
NUS-ISS
 
2018 learning approach-digitaltrends
Abhilash Gopalakrishnan
 
Data Science versus Artificial Intelligence: a useful distinction
Christoforos Anagnostopoulos
 
Machine Learning & AI - 2022 intro for pre-college students.pdf
Ed Fernandez
 
Defining a Practical Path to Artificial Intelligence
Roman Chanclor
 
Top 10 Trends to Watch for In Data Science.pdf
Edtech Learning
 
Artificial Intelligence-Machine Learning Explained.pdf
DexDio
 
The Truth About Artificial Intelligence
Jon Whittle
 
DataScience_introduction.pdf
SouravBiswas747273
 
[Srijan Wednesday Webinars] Artificial Intelligence & the Future of Business
Srijan Technologies
 
Ad

More from Suman Srinivasan (10)

PPTX
PHP, LAMP Stack & WordPress
Suman Srinivasan
 
PDF
My PhD thesis defense presentation
Suman Srinivasan
 
PDF
My PhD Thesis
Suman Srinivasan
 
PPT
Real-Time Video Analytics Using Hadoop and HBase (HBaseCon 2013)
Suman Srinivasan
 
PPTX
OSGi summary
Suman Srinivasan
 
PPTX
ActiveCDN on NetServ
Suman Srinivasan
 
PPT
Suman's PhD Candidacy Talk
Suman Srinivasan
 
PPT
7DS Version 1
Suman Srinivasan
 
PPT
BonAHA framework - Lab presentation
Suman Srinivasan
 
PPT
BonAHA framework - IEEE CCNC 2009
Suman Srinivasan
 
PHP, LAMP Stack & WordPress
Suman Srinivasan
 
My PhD thesis defense presentation
Suman Srinivasan
 
My PhD Thesis
Suman Srinivasan
 
Real-Time Video Analytics Using Hadoop and HBase (HBaseCon 2013)
Suman Srinivasan
 
OSGi summary
Suman Srinivasan
 
ActiveCDN on NetServ
Suman Srinivasan
 
Suman's PhD Candidacy Talk
Suman Srinivasan
 
7DS Version 1
Suman Srinivasan
 
BonAHA framework - Lab presentation
Suman Srinivasan
 
BonAHA framework - IEEE CCNC 2009
Suman Srinivasan
 
Ad

Recently uploaded (20)

PPTX
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
PPTX
thid ppt defines the ich guridlens and gives the information about the ICH gu...
shaistabegum14
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
办理学历认证InformaticsLetter新加坡英华美学院毕业证书,Informatics成绩单
Taqyea
 
PPTX
Krezentios memories in college data.pptx
notknown9
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
Data anlytics Hospitals Research India.pptx
SayantanChakravorty2
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PDF
UNISE-Operation-Procedure-InDHIS2trainng
ahmedabduselam23
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
Debolina Ghosh
 
PDF
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
PDF
IT GOVERNANCE 4-2 - Information System Security (1).pdf
mdirfanuddin1322
 
PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
PPTX
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
美国史蒂文斯理工学院毕业证书{SIT学费发票SIT录取通知书}哪里购买
Taqyea
 
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
thid ppt defines the ich guridlens and gives the information about the ICH gu...
shaistabegum14
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
BinarySearchTree in datastructures in detail
kichokuttu
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
办理学历认证InformaticsLetter新加坡英华美学院毕业证书,Informatics成绩单
Taqyea
 
Krezentios memories in college data.pptx
notknown9
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
Data anlytics Hospitals Research India.pptx
SayantanChakravorty2
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
UNISE-Operation-Procedure-InDHIS2trainng
ahmedabduselam23
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
Debolina Ghosh
 
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
IT GOVERNANCE 4-2 - Information System Security (1).pdf
mdirfanuddin1322
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
美国史蒂文斯理工学院毕业证书{SIT学费发票SIT录取通知书}哪里购买
Taqyea
 

Data science and Artificial Intelligence

  • 1. A GENTLE INTRO TO DATA SCIENCE & AI
  • 2. Computer Science HotTopics • Mid 1980s – 1990s: desktop applications • Networking, graphics & graphical user interfaces (GUIs), some AI / ML • Mid 1990s – 2006: websites & web applications • 2007 – 2014: mobile apps • 2012 – 2017: data science • Maybe some virtual reality (VR) and augmented reality (AR) • 2016 – current: artificial intelligence (AI) & machine learning (ML) • 2017 – early 2018: Bitcoin! (Crypto-currencies) • IMHO, passing fad & pure speculation
  • 3. DATA SCIENCE Processing data has gotten better in the past decade because of: (1) More data (2) Better use of statistics & other fields in CS (3) Faster & more specialized hardware (4) Distributed networks & computing (5) Contributions (papers & software) by Google, Facebook, etc
  • 4. Rise of Data Science • 1970s – 2000: Data in expensive databases • “Small” data: millions of data points = large • Programmers write code to process data • Jobs: software engineers & database administrators • Expensive, took a long time to run, limited to companies with expertise and resources • 2006 – current: Data in a variety of places • “Big data”: billions of data points. Per day. • {Various job titles} write code to process data • Jobs: data analysts, data scientists, data engineers (software engineers, DB admins) • Variety of inexpensive (and expensive), user- friendly systems to analyze data
  • 5. Small, Medium, Big Data SMALL DATA • 100,000s to millions of records • Can be handled by databases MEDIUM DATA • Millions to 10s / 100s of millions of records • Databases start to creak • Mix of DBs & Big Data BIG DATA • Billions or 10s of billions of records. Per day. • Big Data tools.
  • 6. Job trends in Data Science Indeed.com postings, Sep 2017 Quora, Dec 2017
  • 7. ARTIFICIAL INTELLIGENCE (AI) How to get computers to think and learn like humans. This field has been there since the early days of computing, but has sped up over past few years due to better hardware and data processing.
  • 8. AI Evolution Over theYears https://www.slideshare.net/yanaioron1/vertex-perspectives-artificial-intelligence
  • 9. AI, Machine Learning & Deep Learning https://www.slideshare.net/yanaioron1/vertex-perspectives-artificial-intelligence
  • 10. Machine Learning (ML) • Limited AI; machine learns from existing data to give (hopefully) correct response • Build model that outputs correct information given training on input data • Model often built by a human and computer is “trained” using existing data
  • 11. ML Fields: Supervised, Unsupervised & Reinforcement Learning • Supervised Learning: develop model to predict output based on existing input-output (done by human) • Classification: Is this a cat or not? • Regression:Will user click on this ad? • Unsupervised Learning: group & interpret data based on input only • Clustering: Identify patterns that are not obviously visible. • Reinforcement Learning: actions to maximize “rewards” • Recommendations on shopping websites (Amazon), videos (Netflix) • Computer vs human gaming. Chess (IBM Watson). Go (Google’s AlphaGo).
  • 12. Deep Learning • ML requires human input to train and classify • Deep Learning uses multiple levels of CNNs (Convolutional Neural Networks) to learn by itself • But: setting up & programming deep learning neural network is hard. https://medium.com/swlh/ill-tell-you-why-deep-learning-is-so-popular-and-in-demand-5aca72628780
  • 13. Deep Learning: Face Recognition https://cdn.edureka.co/blog/wp-content/uploads/2017/05/Deep-Neural-Network-What-is-Deep-Learning-Edureka.png
  • 14. CLOUD COMPUTING Rent large computer “farms” by hourly use without having to pay upfront. (Also) use online tools and services without installation. Cloud makes it less expensive to build large computing tools and use online services & platforms.
  • 15. Cloud Computing • Run computer services on a “cloud” • Remote location run by service provider • IaaS = Infrastructure as a service • “Rent” computers, storage, networking. Install your own software. • PaaS = Platform as a service • Higher level services, such as databases, web servers, etc.Web hosting. • SaaS = Software as a service • Run software from the cloud.Websites, online applications, e-mail, IM / messaging, social media.
  • 16. Cloud Computing -> Data Science • “Rent” computing servers instead of buying outright • Zero setup time; no setting up hardware • “… building a 50,000 core cluster could easily cost $20 million to $30 million, he said.The Schrödinger project, by contrast, cost about $4,850 per hour to run.” • GigaOm: Cycle Computing spins up 50K core Amazon cluster • “By leveraging Cycle Computing software and AWS Cloud infrastructure, Novartis was able to accomplish the same work faster, and for far less money. • $44 Million in infrastructure; 10 Million compounds screened; 39 drug design years in 11 hours for a cost of $4,232; 3 compounds identified for future work” • Chef: Novartis Conducts 39Yrs of Computing in 11 Hours w/Cycle Computing and Chef
  • 17. Data Science <->AI / ML https://twitter.com/ipfconline1/status/887043009568788480
  • 18. Conclusion & QA • Touched upon 3 “trending” areas in Computer Science today • Data Science: the merging of CS and other math fields has allowed for us to process data better and get more meaningful insight into vast volumes of data. • Artificial Intelligence: an old field has been recently invigorated by advances in data processing and large / fast computer networks • Cloud Computing: reduces cost and setup / startup time in using large computing resources or online services. Get faster to solving the business problem.