Kaggle Survey Story

Uploaded by

Minh Học

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views2 pages

Kaggle Survey Story

Uploaded by

Minh Học

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Description

The Kaggle team set out to conduct an industry-wide survey that presents a truly
comprehensive view of the state of data science and machine learning. The survey
was live for one week in October, and after cleaning the data we finished with 23,859
responses, a 49% increase over last year!
There's a lot to explore here. The results include raw numbers about who is working
with data, what’s happening with machine learning in different industries, and the
best ways for new data scientists to break into the field. We've published the data in
as raw a format as possible without compromising anonymization, which makes it an
unusual example of a survey dataset.
Tell a data story about a subset of the data science community represented in this
survey, through a combination of both narrative text and data exploration. A “story”
could be defined any number of ways, and that’s deliberate. The challenge is to
deeply explore (through data) the impact, priorities, or concerns of a specific group of
data science and machine learning practitioners. That group can be defined in the
macro (for example: anyone who does most of their coding in Python) or the micro
(for example: female data science students studying machine learning in masters
programs). This is an opportunity to be creative and tell the story of a community you
identify with or are passionate about!

Survey Methodology
● This survey received 23,859 usable respondents from 147 countries and
territories. If a country or territory received less than 50 respondents, we
grouped them into a group named “Other” for anonymity.
● We excluded respondents who were flagged by our survey system as “Spam”.
● Most of our respondents were found primarily through Kaggle channels, like
our email list, discussion forums and social media channels.
● The survey was live from October 22nd to October 29th. We allowed
respondents to complete the survey at any time during that window. The
median response time for those who participated in the survey was 15-20
minutes.
● Not every question was shown to every respondent. You can learn more
about the different segments we used in the schema.csv file.
● To protect the respondents’ identity, the answers to multiple choice questions
have been separated into a separate data file from the open-ended
responses. We do not provide a key to match up the multiple choice and free
form responses. Further, the free form responses have been randomized
column-wise such that the responses that appear on the same row did not
necessarily come from the same survey-taker.

For the questions, see this notebook:

https://www.kaggle.com/paultimothymooney/2018-kaggle-machine-learning-data-
science-survey
Package 1.
1. Create a table, where
a. rows: age groups
b. columns: top 5 countries from where we have the most respondents
c. values: percentage of the respondents from that age group and from
that country
2. Create a Venn-diagram showing the number of respondents using the
following programming languages on a regular basis: Python, R, SQL, Java
3. Create a Sankey diagram, which has 3 levels: education level, gender and job
profession (in this order). The bands are simply created from the frequency
values. You can use plotly for this task.

Package 2.
1. Create a heatmap-style matrix plot, where rows and columns represent age
groups and countries (top 5 in number of respondents), respectively and
elements are colored by the percentage of the respondents.
2. Create a population pyramid about online platforms where the respondents
begun or completed data science courses groupped by whether the
respondent is younger than 30 years or not.
3. Create a map visualization about how many respondents are located in the
different countries. You can use plotly for this task.

Kaggle Book
No ratings yet
Kaggle Book
57 pages
Atm Hacking
100% (1)
Atm Hacking
21 pages
Data Analysis and Machine Learning with Kaggle (2021) - Banachewicz & Massaron
No ratings yet
Data Analysis and Machine Learning with Kaggle (2021) - Banachewicz & Massaron
51 pages
JAVA CODING AND C PROGRAMMING EXAMPLES - PROGRAMMING FOR BEGINNERS (BooksRack - Net)
No ratings yet
JAVA CODING AND C PROGRAMMING EXAMPLES - PROGRAMMING FOR BEGINNERS (BooksRack - Net)
62 pages
Kaggle Machine Learning Projects Ashok Kumar Harnal: FORE School of Management, New Delhi
No ratings yet
Kaggle Machine Learning Projects Ashok Kumar Harnal: FORE School of Management, New Delhi
52 pages
Palo Alto Networks Zero Trust Maturity Model
100% (1)
Palo Alto Networks Zero Trust Maturity Model
2 pages
1st PG Merged Compressed Compressed Merged
No ratings yet
1st PG Merged Compressed Compressed Merged
380 pages
Kaggle Survey 2022 Answer Choices
No ratings yet
Kaggle Survey 2022 Answer Choices
18 pages
Phase 2
No ratings yet
Phase 2
6 pages
Kaggle's State of Data Science and Machine Learning 2019: Enterprise Executive Summary
No ratings yet
Kaggle's State of Data Science and Machine Learning 2019: Enterprise Executive Summary
23 pages
Exploratory Data Analysis Using Python
No ratings yet
Exploratory Data Analysis Using Python
41 pages
Kaggle State of Machine Learning and Data Science Report 2022
No ratings yet
Kaggle State of Machine Learning and Data Science Report 2022
25 pages
Data Science Manual
No ratings yet
Data Science Manual
155 pages
05_kaggle_competition
No ratings yet
05_kaggle_competition
37 pages
Problem Statement
No ratings yet
Problem Statement
6 pages
Kaggle Rtdrghdhdtrdrthfffgdjqerwrwregsfdhxcghgvcand Hackathonvcbcvbfvdfdv
No ratings yet
Kaggle Rtdrghdhdtrdrthfffgdjqerwrwregsfdhxcghgvcand Hackathonvcbcvbfvdfdv
27 pages
Introduction
No ratings yet
Introduction
10 pages
Datasets
No ratings yet
Datasets
5 pages
Girls-Who-Code-At-Home-Data-Playground-1
No ratings yet
Girls-Who-Code-At-Home-Data-Playground-1
29 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Kaggle's State of Machine Learning and Data Science 2021
No ratings yet
Kaggle's State of Machine Learning and Data Science 2021
45 pages
CAM Assignment
No ratings yet
CAM Assignment
6 pages
Data Analysis and Machine Learning With Kaggle How To Win Competitions On Kaggle and Build A Successful Career in Data Science 1801817472 9781801817479
No ratings yet
Data Analysis and Machine Learning With Kaggle How To Win Competitions On Kaggle and Build A Successful Career in Data Science 1801817472 9781801817479
48 pages
Kaggle Tutorial 1
No ratings yet
Kaggle Tutorial 1
29 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
3.Data (1)
No ratings yet
3.Data (1)
23 pages
Data Science Week 6 Discussion
No ratings yet
Data Science Week 6 Discussion
4 pages
QB for DS - V Sem Students
No ratings yet
QB for DS - V Sem Students
23 pages
Best Portfolio Projects for Data Science _ by Data Scian by Imad Adrees _ Medium
No ratings yet
Best Portfolio Projects for Data Science _ by Data Scian by Imad Adrees _ Medium
17 pages
1-Introduction To Embedded Linux
No ratings yet
1-Introduction To Embedded Linux
46 pages
Win Kaggle Competition Course
No ratings yet
Win Kaggle Competition Course
14 pages
SystemVerilog Assertions and Functional Coverage Guide to Language Methodology and Applications 2nd Edition Ashok B. Mehta (Auth.) all chapter instant download
50% (2)
SystemVerilog Assertions and Functional Coverage Guide to Language Methodology and Applications 2nd Edition Ashok B. Mehta (Auth.) all chapter instant download
65 pages
Untitled document (12)
No ratings yet
Untitled document (12)
23 pages
Agenda NED University
No ratings yet
Agenda NED University
13 pages
Data Science Mcqs- Hamza Zahoor
No ratings yet
Data Science Mcqs- Hamza Zahoor
9 pages
Chapter One Part II
No ratings yet
Chapter One Part II
99 pages
Case Study DSBDA
No ratings yet
Case Study DSBDA
12 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
Data Science and Machine Learning Project Ideas
100% (2)
Data Science and Machine Learning Project Ideas
20 pages
Chapter 4 Software Architecture
No ratings yet
Chapter 4 Software Architecture
33 pages
FDS - 1 SOLVED
No ratings yet
FDS - 1 SOLVED
17 pages
Mapping Global Data Sets - Json
100% (1)
Mapping Global Data Sets - Json
15 pages
Activity
No ratings yet
Activity
4 pages
Loske Guide
No ratings yet
Loske Guide
96 pages
Top 60 Python Projects For All Levels of Expertise
No ratings yet
Top 60 Python Projects For All Levels of Expertise
9 pages
Project List Data Analytics
No ratings yet
Project List Data Analytics
13 pages
Bda Survey Assignment: Parta - Rollnumbers - Ipynb Parta - Rollnumbers - Ipynb Part A
No ratings yet
Bda Survey Assignment: Parta - Rollnumbers - Ipynb Parta - Rollnumbers - Ipynb Part A
3 pages
Getting-Started - WCFM Documentation
No ratings yet
Getting-Started - WCFM Documentation
15 pages
AUTOMATED LOOP CHECKING FOR INSTRUMENTATION
No ratings yet
AUTOMATED LOOP CHECKING FOR INSTRUMENTATION
13 pages
Chap 04
No ratings yet
Chap 04
48 pages
Data Science QnA
No ratings yet
Data Science QnA
15 pages
Datanest - Data Science Interview
No ratings yet
Datanest - Data Science Interview
19 pages
A Study On Penetration Testing Process and Tools
No ratings yet
A Study On Penetration Testing Process and Tools
7 pages
Vdres
No ratings yet
Vdres
43 pages
data science
No ratings yet
data science
10 pages
An Empirical Study Assessing Software Modeling in Alloy
No ratings yet
An Empirical Study Assessing Software Modeling in Alloy
11 pages
Ex - No:01 Rotate An Image Date
No ratings yet
Ex - No:01 Rotate An Image Date
11 pages
BioMini Plus 2 Manual
No ratings yet
BioMini Plus 2 Manual
11 pages
Database Setup and Management Guide
No ratings yet
Database Setup and Management Guide
34 pages
What Are Stiffness Modifiers in The ETABS Tool
No ratings yet
What Are Stiffness Modifiers in The ETABS Tool
6 pages
Project 3 - Social Media
No ratings yet
Project 3 - Social Media
3 pages
SmartRF Flash Programmer 2 SLA
No ratings yet
SmartRF Flash Programmer 2 SLA
6 pages
PMM-WH-EXE-EE-060 - Installation of Add-On Board For Solar Power of Warehouse Building Approved
No ratings yet
PMM-WH-EXE-EE-060 - Installation of Add-On Board For Solar Power of Warehouse Building Approved
6 pages
(Datasheet) HUAWEI IdeaHub B2
No ratings yet
(Datasheet) HUAWEI IdeaHub B2
10 pages
Datascience
No ratings yet
Datascience
8 pages
DATA SCIENCE SAMPLE
No ratings yet
DATA SCIENCE SAMPLE
5 pages
Python Machine Learning For Beginners Learning From Scratch Numpy Pandas Matplotlib Seaborn SKle
100% (1)
Python Machine Learning For Beginners Learning From Scratch Numpy Pandas Matplotlib Seaborn SKle
277 pages
Home Assignment Dataliteracy
No ratings yet
Home Assignment Dataliteracy
4 pages
GOT1000 - GT15 Document Display Function - Technical
No ratings yet
GOT1000 - GT15 Document Display Function - Technical
11 pages
Mpi Lab1
No ratings yet
Mpi Lab1
6 pages
DBATU Student User Manual
No ratings yet
DBATU Student User Manual
5 pages
Kaggle: Your Machine Learning and Data Science Community
No ratings yet
Kaggle: Your Machine Learning and Data Science Community
7 pages
Chapter 1 and 2
No ratings yet
Chapter 1 and 2
11 pages
DXC - MF - SD - 2267395 - S4TWL - Miscellaneous Minor Functionalities in SD Area
No ratings yet
DXC - MF - SD - 2267395 - S4TWL - Miscellaneous Minor Functionalities in SD Area
4 pages
COEN Requirement 2023-24
No ratings yet
COEN Requirement 2023-24
1 page
Homo Ludens in the Loop: Playful Human Computation Systems
From Everand
Homo Ludens in the Loop: Playful Human Computation Systems
Markus Krause
No ratings yet
Human-Centered Data Science: An Introduction
From Everand
Human-Centered Data Science: An Introduction
Cecilia Aragon
No ratings yet
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Deep Learning
From Everand
Deep Learning
John D. Kelleher
3.5/5 (7)
The Left Hand of Data: Designing Education Data for Justice
From Everand
The Left Hand of Data: Designing Education Data for Justice
Matthew Berland
No ratings yet
Cracking the New York City SHSAT (Specialized High Schools Admissions Test), 3rd Edition: Fully Updated for the New Exam
From Everand
Cracking the New York City SHSAT (Specialized High Schools Admissions Test), 3rd Edition: Fully Updated for the New Exam
The Princeton Review
No ratings yet
Mastering Categorical Data Analysis
From Everand
Mastering Categorical Data Analysis
Pasquale De Marco
No ratings yet
The power of AI and ML to transform Social Science Research
From Everand
The power of AI and ML to transform Social Science Research
Zemelak Goraga
No ratings yet
Data Science Projects for thesis and Portfolio: Solving Political Problems
From Everand
Data Science Projects for thesis and Portfolio: Solving Political Problems
Dr. Zemelak Goraga
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
The Psychology of Survey Data: Revealing the Secrets of Human Responses
From Everand
The Psychology of Survey Data: Revealing the Secrets of Human Responses
Pasquale De Marco
No ratings yet
Technology & Globalization Gr. 5-8
From Everand
Technology & Globalization Gr. 5-8
Erika Gasper
No ratings yet
Participatory Action Research for Evidence-driven Community Development
From Everand
Participatory Action Research for Evidence-driven Community Development
AK Azad
No ratings yet
Time Series Forecasting using Deep Learning: Combining PyTorch, RNN, TCN, and Deep Neural Network Models to Provide Production-Ready Prediction Solutions
From Everand
Time Series Forecasting using Deep Learning: Combining PyTorch, RNN, TCN, and Deep Neural Network Models to Provide Production-Ready Prediction Solutions
Ivan Gridin
No ratings yet
Question Answering: Fundamentals and Applications
From Everand
Question Answering: Fundamentals and Applications
Fouad Sabry
No ratings yet

Kaggle Survey Story

Uploaded by

Kaggle Survey Story

Uploaded by

Description

For the questions, see this notebook:

You might also like