Lect1
Lect1
CS E, PEC
About the course
◻ Why Python?
Data All Around
Lots of data is being collected
and warehoused
Web data, e-commerce
Financial transactions, bank/credit transactions
Online trading and purchasing
Social Network
Limitations of the File-Based Approach
Web scraping
Secondary data
GitHub
Kaggle
KDnuggets
UCI Machine Learning Repository
US Government’s Open Data
Five Thirty Eight
Amazon Web Services
BuzzFeed
Data is Plural
Harvard HCI
Application Programming Interface (API).
HTTP request/response cycle
Types of Secondary data
• Administrative and Monitoring Data
• Illogical Values
• Typos