SlideShare a Scribd company logo
Googling for Software Development:
What Developers Search For and
What They Find
MSR 2021
Andre Hora
Developers often search for
software resources on the web
2
Developers often search for
software resources on the web
They may spend ~20% of
their time on the web
3
Code examples
Novel technologies
Bug-
fi
xes
Documentation
etc.
4
Stack Over
fl
ow
50Musers/month
W3Schools
2.5Bpageviews/year
5
Over 85% of
their tra
ffi
c
come from
web search
engines
[alexa.com]
Stack Over
fl
ow
50Musers/month
W3Schools
2.5Bpageviews/year
6
7
8
9
What do developers search for
and what they
fi
nd?
10
Search Queries
11
Search Queries Search Results
12
Search Queries Search Results
Understand real-world search
queries and developers’ needs
Detect where search engines
fi
nd software resources and
explore the results
13
Study Design
1. stackover
fl
ow.com
2. w3schools.com
3. geeksforgeeks.org
4. tutorialspoint.com
5. programcreek.com
Selecting the Websites
15
1
1. stackover
fl
ow.com
2. w3schools.com
3. geeksforgeeks.org
4. tutorialspoint.com
5. programcreek.com
Selecting the Websites Collecting the Search Queries
16
1 2
1.3M distinct queries
1. stackover
fl
ow.com
2. w3schools.com
3. geeksforgeeks.org
4. tutorialspoint.com
5. programcreek.com
Selecting the Websites Collecting the Search Queries
17
1 2
What Developers Search For
3
1.3M distinct queries
• RQ1: Query content
• RQ2: Query size & keywords
• RQ3: Query structure
• RQ4: Query similarity
1. stackover
fl
ow.com
2. w3schools.com
3. geeksforgeeks.org
4. tutorialspoint.com
5. programcreek.com
Selecting the Websites Collecting the Search Queries
18
1 2
What Developers Search For
3
What Developers Find
4
• RQ1: Query content
• RQ2: Query size & keywords
• RQ3: Query structure
• RQ4: Query similarity
• RQ5: Result resources
• RQ6: Result variation
1.3M distinct queries
Search API
Results
RQ1
Query Content
RQ1 What is the content of the
search queries?
21
RQ1 What is the content of the
search queries?
22
RQ1 What is the content of the
search queries?
23
RQ1 What is the content of the
search queries?
24
Developers’ queries typically provide references to
technologies, such as programming languages (30%),
software technologies (24.5%), and web frameworks (5%)
RQ2
Query Size & Keywords
RQ2 What is the size of the search
queries? Where are the keywords located?
26
Size
Keyword position
49.2 65.2 48.7
3
RQ2 What is the size of the search
queries? Where are the keywords located?
27
Size
Keyword position
49.2 65.2 48.7
3
Developers’ queries are short: 3 words, on
the median. Keywords are more likely to be
the
fi
rst than the last word in the query
RQ3
Query structure
RQ4
Query similarity
29
As in general web search, developers also tend to exclude
function words in their queries, which are mostly
composed of content words (80.3%).
Most of the developers’ queries are similar among each
other: while 40% have no similar counterpart, 60% have at
least one similar peer. 8% have 10 or more similar ones.
RQ3
RQ4
RQ5
Result resources
Search API
RQ5 Where does Google
fi
nd
software resources?
31
RQ5 Where does Google
fi
nd
software resources?
32
Google
fi
nds software resources mostly on Stack Over
fl
ow
(11%), YouTube (6%), and W3Schools (5%). However, the
results may vary according to query (keyword or general).
RQ6
Result variation
Search API
RQ6: How do Google results vary for similar queries?
34
RQ6: How do Google results vary for similar queries?
35
1
2
3
4
RQ6: How do Google results vary for similar queries?
36
1
2
3
4
Word swap
Query 1: python email parser
Query 2: email parser python
RQ6: How do Google results vary for similar queries?
37
1
2
3
4
Word swap
Query 1: python email parser
Query 2: email parser python
Word removal
Query 1: python email parser
Query 2: email parser
RQ6: How do Google results vary for similar queries?
38
1
2
3
4
Word swap
Query 1: python email parser
Query 2: email parser python
Word removal
Query 1: python email parser
Query 2: email parser
Synonymous word
Query 1: python email parser
Query 2: py email parser
RQ6: How do Google results vary for similar queries?
39
RQ6: How do Google results vary for similar queries?
40
The links and order of the top 10 Google search results are very
likely to change due to similar queries, whereas the top 1 is much
less affected. However, overall, the intersection of links due to
similar queries is high, at least 70% in most cases.
1. Developers’ queries are likely to include key
contexts (e.g., technologies)
2. Developers’ queries share characteristics with
general ones: both are short and tend to omit
function words
3. Google
fi
nds software resources mostly on
Stack Over
fl
ow (11%) with an over-
concentration in the top 1 results (28%)
4. YouTube is a prominent source for Google
(mostly top 3 results of general queries)
5. Performing minor changes to queries do not
broadly a
ff
ect the top 1 search results nor the
overall top 10 (however, there are exceptions!)
Takeaways
Googling for Software Development:
What Developers Search For and
What They Find
MSR 2021
Andre Hora
Ad

More Related Content

Similar to Googling for Software Development: What Developers Search For and What They Find (MSR 2021) (20)

What Do Developers Discuss about Code Comments?
What Do Developers Discuss about Code Comments?What Do Developers Discuss about Code Comments?
What Do Developers Discuss about Code Comments?
Pooja Rani
 
Managing Large-scale Multimedia Development Projects
Managing Large-scale Multimedia Development ProjectsManaging Large-scale Multimedia Development Projects
Managing Large-scale Multimedia Development Projects
Simon Price
 
Final Presentation
Final PresentationFinal Presentation
Final Presentation
Love Tyagi
 
HackerRank 2018 Tech Recruiting Report
HackerRank 2018 Tech Recruiting ReportHackerRank 2018 Tech Recruiting Report
HackerRank 2018 Tech Recruiting Report
HackerRank
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question Matching
IRJET Journal
 
Some Size factors in software engineering
Some Size factors in software engineeringSome Size factors in software engineering
Some Size factors in software engineering
V.V.Vanniaperumal College for Women
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Debdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Debdoot Mukherjee
 
Towards Reusable Research Software
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Software
dgarijo
 
Find the 'Unfindable' with TalentBin by Monster!
Find the 'Unfindable' with TalentBin by Monster!Find the 'Unfindable' with TalentBin by Monster!
Find the 'Unfindable' with TalentBin by Monster!
monsterindia
 
Understanding and Executing on API Developer Experience
Understanding and Executing on API Developer ExperienceUnderstanding and Executing on API Developer Experience
Understanding and Executing on API Developer Experience
Keshav Vasudevan
 
Software Engineering and Fundamentals note
Software Engineering and Fundamentals noteSoftware Engineering and Fundamentals note
Software Engineering and Fundamentals note
lissannakeryz84
 
Automatic Identification of Informative Code in Stack Overflow Posts
Automatic Identification of Informative Code in Stack Overflow PostsAutomatic Identification of Informative Code in Stack Overflow Posts
Automatic Identification of Informative Code in Stack Overflow Posts
Preetha Chatterjee
 
Quality, Cost, and Governance of Open Source Software
Quality, Cost, and Governance of Open Source SoftwareQuality, Cost, and Governance of Open Source Software
Quality, Cost, and Governance of Open Source Software
Girl Geek Dinners Milano
 
Adaptation of the technology of the static code analyzer for developing paral...
Adaptation of the technology of the static code analyzer for developing paral...Adaptation of the technology of the static code analyzer for developing paral...
Adaptation of the technology of the static code analyzer for developing paral...
PVS-Studio
 
What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?
IWMW
 
Developing accessible experiences - Alison Walden
Developing accessible experiences - Alison WaldenDeveloping accessible experiences - Alison Walden
Developing accessible experiences - Alison Walden
Web à Québec
 
Is software engineering research addressing software engineering problems?
Is software engineering research addressing software engineering problems?Is software engineering research addressing software engineering problems?
Is software engineering research addressing software engineering problems?
Gail Murphy
 
Understanding and Executing on API Developer Experience
Understanding and Executing on API Developer ExperienceUnderstanding and Executing on API Developer Experience
Understanding and Executing on API Developer Experience
SmartBear
 
Product SEO and Why It Can Fail
Product SEO and Why It Can FailProduct SEO and Why It Can Fail
Product SEO and Why It Can Fail
Linda Groendyke
 
What Do Developers Discuss about Code Comments?
What Do Developers Discuss about Code Comments?What Do Developers Discuss about Code Comments?
What Do Developers Discuss about Code Comments?
Pooja Rani
 
Managing Large-scale Multimedia Development Projects
Managing Large-scale Multimedia Development ProjectsManaging Large-scale Multimedia Development Projects
Managing Large-scale Multimedia Development Projects
Simon Price
 
Final Presentation
Final PresentationFinal Presentation
Final Presentation
Love Tyagi
 
HackerRank 2018 Tech Recruiting Report
HackerRank 2018 Tech Recruiting ReportHackerRank 2018 Tech Recruiting Report
HackerRank 2018 Tech Recruiting Report
HackerRank
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question Matching
IRJET Journal
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Debdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Debdoot Mukherjee
 
Towards Reusable Research Software
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Software
dgarijo
 
Find the 'Unfindable' with TalentBin by Monster!
Find the 'Unfindable' with TalentBin by Monster!Find the 'Unfindable' with TalentBin by Monster!
Find the 'Unfindable' with TalentBin by Monster!
monsterindia
 
Understanding and Executing on API Developer Experience
Understanding and Executing on API Developer ExperienceUnderstanding and Executing on API Developer Experience
Understanding and Executing on API Developer Experience
Keshav Vasudevan
 
Software Engineering and Fundamentals note
Software Engineering and Fundamentals noteSoftware Engineering and Fundamentals note
Software Engineering and Fundamentals note
lissannakeryz84
 
Automatic Identification of Informative Code in Stack Overflow Posts
Automatic Identification of Informative Code in Stack Overflow PostsAutomatic Identification of Informative Code in Stack Overflow Posts
Automatic Identification of Informative Code in Stack Overflow Posts
Preetha Chatterjee
 
Quality, Cost, and Governance of Open Source Software
Quality, Cost, and Governance of Open Source SoftwareQuality, Cost, and Governance of Open Source Software
Quality, Cost, and Governance of Open Source Software
Girl Geek Dinners Milano
 
Adaptation of the technology of the static code analyzer for developing paral...
Adaptation of the technology of the static code analyzer for developing paral...Adaptation of the technology of the static code analyzer for developing paral...
Adaptation of the technology of the static code analyzer for developing paral...
PVS-Studio
 
What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?
IWMW
 
Developing accessible experiences - Alison Walden
Developing accessible experiences - Alison WaldenDeveloping accessible experiences - Alison Walden
Developing accessible experiences - Alison Walden
Web à Québec
 
Is software engineering research addressing software engineering problems?
Is software engineering research addressing software engineering problems?Is software engineering research addressing software engineering problems?
Is software engineering research addressing software engineering problems?
Gail Murphy
 
Understanding and Executing on API Developer Experience
Understanding and Executing on API Developer ExperienceUnderstanding and Executing on API Developer Experience
Understanding and Executing on API Developer Experience
SmartBear
 
Product SEO and Why It Can Fail
Product SEO and Why It Can FailProduct SEO and Why It Can Fail
Product SEO and Why It Can Fail
Linda Groendyke
 

More from Andre Hora (15)

Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Andre Hora
 
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
Andre Hora
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
When should internal interfaces be promoted to public? (FSE 2016)
When should internal interfaces be promoted to public? (FSE 2016)When should internal interfaces be promoted to public? (FSE 2016)
When should internal interfaces be promoted to public? (FSE 2016)
Andre Hora
 
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Andre Hora
 
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
Andre Hora
 
Assessing Mock Classes: An Empirical Study (ICSME 2020)
Assessing Mock Classes: An Empirical Study (ICSME 2020)Assessing Mock Classes: An Empirical Study (ICSME 2020)
Assessing Mock Classes: An Empirical Study (ICSME 2020)
Andre Hora
 
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
Andre Hora
 
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Andre Hora
 
How and Why Developers Migrate Python Tests (SANER 2022)
How and Why Developers Migrate Python Tests (SANER 2022)How and Why Developers Migrate Python Tests (SANER 2022)
How and Why Developers Migrate Python Tests (SANER 2022)
Andre Hora
 
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Andre Hora
 
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Andre Hora
 
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
Andre Hora
 
Predicting Test Results without Execution (FSE 2024)
Predicting Test Results without Execution (FSE 2024)Predicting Test Results without Execution (FSE 2024)
Predicting Test Results without Execution (FSE 2024)
Andre Hora
 
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
Andre Hora
 
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Andre Hora
 
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest (MSR...
Andre Hora
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
When should internal interfaces be promoted to public? (FSE 2016)
When should internal interfaces be promoted to public? (FSE 2016)When should internal interfaces be promoted to public? (FSE 2016)
When should internal interfaces be promoted to public? (FSE 2016)
Andre Hora
 
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Assessing the Threat of Untracked Changes in Software Evolution (ICSE 2018)
Andre Hora
 
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
JavaScript API Deprecation in the Wild: A First Assessment (SANER 2020)
Andre Hora
 
Assessing Mock Classes: An Empirical Study (ICSME 2020)
Assessing Mock Classes: An Empirical Study (ICSME 2020)Assessing Mock Classes: An Empirical Study (ICSME 2020)
Assessing Mock Classes: An Empirical Study (ICSME 2020)
Andre Hora
 
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
What Code Is Deliberately Excluded from Test Coverage and Why? (MSR 2021)
Andre Hora
 
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Availability and Usage of Platform-Specific APIs: A First Empirical Study (MS...
Andre Hora
 
How and Why Developers Migrate Python Tests (SANER 2022)
How and Why Developers Migrate Python Tests (SANER 2022)How and Why Developers Migrate Python Tests (SANER 2022)
How and Why Developers Migrate Python Tests (SANER 2022)
Andre Hora
 
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Test Polarity: Detecting Positive and Negative Tests (FSE 2024)
Andre Hora
 
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Monitoring the Execution of 14K Tests: Methods Tend to Have One Path that Is ...
Andre Hora
 
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
PathSpotter: Exploring Tested Paths to Discover Missing Tests (FSE 2024)
Andre Hora
 
Predicting Test Results without Execution (FSE 2024)
Predicting Test Results without Execution (FSE 2024)Predicting Test Results without Execution (FSE 2024)
Predicting Test Results without Execution (FSE 2024)
Andre Hora
 
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
SpotFlow: Tracking Method Calls and States at Runtime (ICSE 2024)
Andre Hora
 
Ad

Recently uploaded (20)

PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Lionel Briand
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
Exploring Wayland: A Modern Display Server for the Future
Exploring Wayland: A Modern Display Server for the FutureExploring Wayland: A Modern Display Server for the Future
Exploring Wayland: A Modern Display Server for the Future
ICS
 
Revolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptxRevolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptx
nidhisingh691197
 
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
steaveroggers
 
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdfMicrosoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
TechSoup
 
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Dele Amefo
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& ConsiderationsDesigning AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Dinusha Kumarasiri
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
F-Secure Freedome VPN 2025 Crack Plus Activation  New VersionF-Secure Freedome VPN 2025 Crack Plus Activation  New Version
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
saimabibi60507
 
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and CollaborateMeet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Maxim Salnikov
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Orangescrum
 
EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
AxisTechnolabs
 
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software DevelopmentSecure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Shubham Joshi
 
Download YouTube By Click 2025 Free Full Activated
Download YouTube By Click 2025 Free Full ActivatedDownload YouTube By Click 2025 Free Full Activated
Download YouTube By Click 2025 Free Full Activated
saniamalik72555
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Lionel Briand
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
Exploring Wayland: A Modern Display Server for the Future
Exploring Wayland: A Modern Display Server for the FutureExploring Wayland: A Modern Display Server for the Future
Exploring Wayland: A Modern Display Server for the Future
ICS
 
Revolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptxRevolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptx
nidhisingh691197
 
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
steaveroggers
 
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdfMicrosoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
Microsoft AI Nonprofit Use Cases and Live Demo_2025.04.30.pdf
TechSoup
 
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Dele Amefo
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& ConsiderationsDesigning AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Dinusha Kumarasiri
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
F-Secure Freedome VPN 2025 Crack Plus Activation  New VersionF-Secure Freedome VPN 2025 Crack Plus Activation  New Version
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
saimabibi60507
 
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and CollaborateMeet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Maxim Salnikov
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Orangescrum
 
EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
AxisTechnolabs
 
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software DevelopmentSecure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Shubham Joshi
 
Download YouTube By Click 2025 Free Full Activated
Download YouTube By Click 2025 Free Full ActivatedDownload YouTube By Click 2025 Free Full Activated
Download YouTube By Click 2025 Free Full Activated
saniamalik72555
 
Ad

Googling for Software Development: What Developers Search For and What They Find (MSR 2021)

  • 1. Googling for Software Development: What Developers Search For and What They Find MSR 2021 Andre Hora
  • 2. Developers often search for software resources on the web 2
  • 3. Developers often search for software resources on the web They may spend ~20% of their time on the web 3
  • 6. Over 85% of their tra ffi c come from web search engines [alexa.com] Stack Over fl ow 50Musers/month W3Schools 2.5Bpageviews/year 6
  • 7. 7
  • 8. 8
  • 9. 9
  • 10. What do developers search for and what they fi nd? 10
  • 12. Search Queries Search Results 12
  • 13. Search Queries Search Results Understand real-world search queries and developers’ needs Detect where search engines fi nd software resources and explore the results 13
  • 15. 1. stackover fl ow.com 2. w3schools.com 3. geeksforgeeks.org 4. tutorialspoint.com 5. programcreek.com Selecting the Websites 15 1
  • 16. 1. stackover fl ow.com 2. w3schools.com 3. geeksforgeeks.org 4. tutorialspoint.com 5. programcreek.com Selecting the Websites Collecting the Search Queries 16 1 2 1.3M distinct queries
  • 17. 1. stackover fl ow.com 2. w3schools.com 3. geeksforgeeks.org 4. tutorialspoint.com 5. programcreek.com Selecting the Websites Collecting the Search Queries 17 1 2 What Developers Search For 3 1.3M distinct queries • RQ1: Query content • RQ2: Query size & keywords • RQ3: Query structure • RQ4: Query similarity
  • 18. 1. stackover fl ow.com 2. w3schools.com 3. geeksforgeeks.org 4. tutorialspoint.com 5. programcreek.com Selecting the Websites Collecting the Search Queries 18 1 2 What Developers Search For 3 What Developers Find 4 • RQ1: Query content • RQ2: Query size & keywords • RQ3: Query structure • RQ4: Query similarity • RQ5: Result resources • RQ6: Result variation 1.3M distinct queries Search API
  • 21. RQ1 What is the content of the search queries? 21
  • 22. RQ1 What is the content of the search queries? 22
  • 23. RQ1 What is the content of the search queries? 23
  • 24. RQ1 What is the content of the search queries? 24 Developers’ queries typically provide references to technologies, such as programming languages (30%), software technologies (24.5%), and web frameworks (5%)
  • 25. RQ2 Query Size & Keywords
  • 26. RQ2 What is the size of the search queries? Where are the keywords located? 26 Size Keyword position 49.2 65.2 48.7 3
  • 27. RQ2 What is the size of the search queries? Where are the keywords located? 27 Size Keyword position 49.2 65.2 48.7 3 Developers’ queries are short: 3 words, on the median. Keywords are more likely to be the fi rst than the last word in the query
  • 29. 29 As in general web search, developers also tend to exclude function words in their queries, which are mostly composed of content words (80.3%). Most of the developers’ queries are similar among each other: while 40% have no similar counterpart, 60% have at least one similar peer. 8% have 10 or more similar ones. RQ3 RQ4
  • 31. RQ5 Where does Google fi nd software resources? 31
  • 32. RQ5 Where does Google fi nd software resources? 32 Google fi nds software resources mostly on Stack Over fl ow (11%), YouTube (6%), and W3Schools (5%). However, the results may vary according to query (keyword or general).
  • 34. RQ6: How do Google results vary for similar queries? 34
  • 35. RQ6: How do Google results vary for similar queries? 35 1 2 3 4
  • 36. RQ6: How do Google results vary for similar queries? 36 1 2 3 4 Word swap Query 1: python email parser Query 2: email parser python
  • 37. RQ6: How do Google results vary for similar queries? 37 1 2 3 4 Word swap Query 1: python email parser Query 2: email parser python Word removal Query 1: python email parser Query 2: email parser
  • 38. RQ6: How do Google results vary for similar queries? 38 1 2 3 4 Word swap Query 1: python email parser Query 2: email parser python Word removal Query 1: python email parser Query 2: email parser Synonymous word Query 1: python email parser Query 2: py email parser
  • 39. RQ6: How do Google results vary for similar queries? 39
  • 40. RQ6: How do Google results vary for similar queries? 40 The links and order of the top 10 Google search results are very likely to change due to similar queries, whereas the top 1 is much less affected. However, overall, the intersection of links due to similar queries is high, at least 70% in most cases.
  • 41. 1. Developers’ queries are likely to include key contexts (e.g., technologies) 2. Developers’ queries share characteristics with general ones: both are short and tend to omit function words 3. Google fi nds software resources mostly on Stack Over fl ow (11%) with an over- concentration in the top 1 results (28%) 4. YouTube is a prominent source for Google (mostly top 3 results of general queries) 5. Performing minor changes to queries do not broadly a ff ect the top 1 search results nor the overall top 10 (however, there are exceptions!) Takeaways
  • 42. Googling for Software Development: What Developers Search For and What They Find MSR 2021 Andre Hora