0% found this document useful (0 votes)

48 views

Case Study o Aims

The document discusses two case studies involving analyzing job data and user data using SQL queries. It provides the approaches taken to create databases and tables from CSV files, run queries to answer analytical questions, and report insights. Examples of queries calculate metrics like the number of jobs reviewed per day and percentage of languages, and showcase retrieving and analyzing data from multiple tables to determine user engagement and growth.

Uploaded by

Rahul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Case Study o Aims

Uploaded by

Rahul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

DESCRIPTION : OPERATION ANALYTICS $INVESTIGATION METRICS SPIKE

Operation analytics involves gathering and analyzing data to gain valuable insights
into the performance and efficiency of business operations. Investigation metrics
spike refers to a sudden and significant increase in key metrics, serving as a
signal for further investigation to identify underlying causes, anomalies, or areas
requiring immediate attention and action.
So we as the Data Analyst Trainee have to load the csv files and create the
database, run the queries answer the questions asked in the respective case
studies:
• Case Study 1 > Job Data
• Case Study 2 >users> Events Table > Email Table
TECH-STACK USED

My-SQL Command-Prompt Google Drive

CASE STUDY 1
This Case Study contains the Job Data.
• having 7 attributes
• 8 fields
This case study has questions like:
• Calculate the number of jobs reviewed per hour per day for November 2020?
• Calculate 7 day rolling average of throughput? For throughput, do you prefer
daily metric or 7-day rolling and why?
• Calculate the percentage share of each language in the last 30 days?
• How will you display duplicates from the table?
We have to read the questions asked by the Data Managers and answer them by
running the SQL queries.
Report them to the respective manager.

/*Picture of the table Map has been attached on the next slide.*/
APPROACH : CASE STUDY1

• Downloaded the given dataset Case Study-1 .

Data Base name: casestudy1

Table name : jobsdata
INSIGHTS : CASE STUDY1 - A
Number of jobs reviewed: Amount of
jobs reviewed over time.

Your task: Calculate the number of jobs

reviewed per hour per day for
November 2020? PHPD(perhourperday)

Date JobReviewedPHPD
30-11-2020 180
29-11-2020 180
28-11-2020 218
27-11-2020 35
26-11-2020 64
25-11-2020 80
INSIGHTS : CASE STUDY1- B(PART1)
Let’s say the above metric is called
throughput. Calculate 7 day rolling
average of throughput? For throughput,
do you prefer daily metric or 7-day
rolling and why? */

Date Daily Throughput

30-11-2020 0.02
29-11-2020 0.02
28-11-2020 0.01
27-11-2020 0.06
26-11-2020 0.05
25-11-2020 0.05
INSIGHTS : CASE STUDY1-B(PART2)
Calculate 7 day rolling average of
throughput? For throughput, do you
prefer daily metric or 7-day rolling and
why? */

Weekly Throughput is 0.03,

So, from here we conclude that we get

more detailed information on a daily
throughput which can be preferred
above the weekly throughput.
INSIGHTS : CASE STUDY1-C
Percentage share of each language:
Share of each language for different
contents.

Calculate the percentage share of each

language in the last 30 days?

Lang Totaljobs Lang%

English 1 12.5000
Arabic 1 12.5000
Persian 3 37.5000
Hindi 1 12.5000
French 1 12.5000
Italian 1 12.5000
INSIGHTS : CASE STUDY1-D(PART1)
Duplicate rows: Rows that have the
same value present in them.

Let’s say you see some duplicate rows

in the data. How will you display
duplicates from the table?

If we take duplicate rows on the basis of

JobID then there are 3 duplicates of the
ID 23
INSIGHTS : CASE STUDY1-D(PART2)
Let’s say you see some duplicate rows
in the data. How will you display
duplicates from the table?

If we take duplicate rows on the basis of

ActorID then there are 2 duplicates of
the ID 1003
CASE STUDY 2
This Case Study contains 3 Tables as follows.
• Users( 6 attributes and 19066 fields)
• Events(7 attributes and 340832 fields)
• Email(4 attributes and 90389 fields)
This case study has questions like:
• Calculate the weekly user engagement?
• Calculate the user growth for product?
• Calculate the weekly retention of users-sign up cohort?
• Calculate the weekly engagement per device?
• Calculate the email engagement metrics?
We have to read the questions asked by the Data Managers and answer them by
running the SQL queries.
Report them to the respective manager.

/*Picture of the table Map has been attached on the next slide.*/
APPROACH : CASE STUDY2

• Downloaded the given dataset Case Study-2.

• Study the dataset take out the necessary tables and columns for use in pen and
paper.
• Create a database and the respective tables
• Import the dataset via load data local infile (cause’ its efficient and faster then
data wizard import. This function is useful in importing large data like in this one.
MySQL workbench takes 5hours to import the data various the command line takes
5 sec to 10sec to import it.
• Run the queries to find the to the questions asked.
• This case study uses advanced SQL using functions like case and window
functions.
• At last, optimizing it howsoever possible.
INSIGHTS : CASE STUDY-2
Creating the database and its tables

Data Base name: casestudy1

Table names :
• Users

• Events

• Emails
INSIGHTS : CASE STUDY-2 A
Calculate the weekly user engagement?

WeekNum TotalUsers
17 8019
18 17341
19 17224
20 17911
21 17151
23 18280
22 18413
24 19052
25 18642
29 20067
26 19061
30 21533
28 20776
27 19881
31 18556
32 16612
33 16145
34 16127
35 784
INSIGHTS : CASE STUDY-2 B

User Growth: Amount of users growing over time for a product.

Calculate the user growth for product?

Snippet → Output
INSIGHTS : CASE STUDY-2 C

Weekly Retention: Users getting retained weekly after signing-up for a product.
Calculate the weekly retention of users-sign up cohort?

Snippet 3
Snippet 1

Snippet 2 Snippet 4
OUTPUT OF C :
INSIGHTS : CASE STUDY-2 D
Weekly Engagement: To measure the activeness of a user. Measuring if the user finds quality in
a product/service weekly. Calculate the weekly engagement per device?

Snippet → Output
INSIGHTS : CASE STUDY-2 E
Email Engagement: Users engaging with the email service. Calculate the email engagement
metrics?

Snippet → Output
RESULT: OUTCOME

• Learnt about command shell use by running queries by the command prompt
• Improved a little more on the MySQL(ADVANCED SQL), will work on few more
databases to have more clarity.
• Answering all the questions in the case study1 & 2 gave me a much better
conceptual clarity on the SQL Advanced.
• Learnt about presenting your inferences via PPT Reports.
• This point doesn’t hold much value but being from a non IT background learnt a lot
about the computer system.

Assignment: Case Study - 1: Operation Analytics
59% (27)
Assignment: Case Study - 1: Operation Analytics
5 pages
Operation Analytics and Investigating Metric Spike PROJECT 3RD
100% (1)
Operation Analytics and Investigating Metric Spike PROJECT 3RD
11 pages
Metric Spike.
No ratings yet
Metric Spike.
11 pages
Data Analytics Case Study Guide (Updated For 2024)
No ratings yet
Data Analytics Case Study Guide (Updated For 2024)
10 pages
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
From Everand
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
EMC Education Services
No ratings yet
Operation and Metrics Analysis
No ratings yet
Operation and Metrics Analysis
14 pages
Operations and Metric Analytics - Case Study
No ratings yet
Operations and Metric Analytics - Case Study
17 pages
Project 3
No ratings yet
Project 3
16 pages
Operational Analytics and Investigating Metric Spike
No ratings yet
Operational Analytics and Investigating Metric Spike
9 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
13 pages
Operations Analytics and Metrics Spike _20241115_124811_0000
No ratings yet
Operations Analytics and Metrics Spike _20241115_124811_0000
11 pages
Operation and Metric Analytics
No ratings yet
Operation and Metric Analytics
9 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
20 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
10 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
28 pages
Trainity Project 3
No ratings yet
Trainity Project 3
18 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
28 pages
Project 3 Operation Analytics & Investigating Metric Spike
No ratings yet
Project 3 Operation Analytics & Investigating Metric Spike
17 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
26 pages
Operation Analytics
No ratings yet
Operation Analytics
10 pages
Op - Analytics - Case Study 1
No ratings yet
Op - Analytics - Case Study 1
11 pages
Project Status Report
No ratings yet
Project Status Report
19 pages
Operation Analytics and Investigati Project Description
No ratings yet
Operation Analytics and Investigati Project Description
2 pages
Operation Analytics and Investigating Metric Spike Project3 Vivek Dhande
No ratings yet
Operation Analytics and Investigating Metric Spike Project3 Vivek Dhande
12 pages
CREATE DATABASE IF NOT EXISTS operational
No ratings yet
CREATE DATABASE IF NOT EXISTS operational
13 pages
Data Analytics Case Study_ Complete Guide in 2024
No ratings yet
Data Analytics Case Study_ Complete Guide in 2024
10 pages
Operation Analytics and Investigating Metric Spike
50% (2)
Operation Analytics and Investigating Metric Spike
14 pages
Placement Preparation Material
No ratings yet
Placement Preparation Material
22 pages
Operation Analytics and Investigating Metric Spike Project
No ratings yet
Operation Analytics and Investigating Metric Spike Project
8 pages
Case Study 1
No ratings yet
Case Study 1
4 pages
Operation and Metric Analytics: By-S Rahul
No ratings yet
Operation and Metric Analytics: By-S Rahul
32 pages
Data Analysis Portfolio
No ratings yet
Data Analysis Portfolio
20 pages
Interview Questions and Answers For Data Analysts
No ratings yet
Interview Questions and Answers For Data Analysts
8 pages
Coursework Cover Sheet - Be Sure To Keep A Copy of All Work Submitted
No ratings yet
Coursework Cover Sheet - Be Sure To Keep A Copy of All Work Submitted
9 pages
Explains That The Pharmacy Is Considering Discontinuing A Bubble Bath Product Called Splashtastic
No ratings yet
Explains That The Pharmacy Is Considering Discontinuing A Bubble Bath Product Called Splashtastic
8 pages
5 Jan 2025
No ratings yet
5 Jan 2025
7 pages
PRJCT3 CS2
No ratings yet
PRJCT3 CS2
15 pages
Operations Analytics
No ratings yet
Operations Analytics
13 pages
January All SQL Questions Compiled 1682631354
No ratings yet
January All SQL Questions Compiled 1682631354
122 pages
ISYS 363 - Access Project - Fall 2010
No ratings yet
ISYS 363 - Access Project - Fall 2010
7 pages
Db2 - Bank Case Study
50% (2)
Db2 - Bank Case Study
16 pages
Course Challenge - Coursera
No ratings yet
Course Challenge - Coursera
1 page
Data Analyst Interviews 2025
No ratings yet
Data Analyst Interviews 2025
22 pages
Data Analyst 101
No ratings yet
Data Analyst 101
9 pages
Prasad Shinde Data Analytics Portfolio
No ratings yet
Prasad Shinde Data Analytics Portfolio
29 pages
Challenges ProjectMissingMoneyMatters
No ratings yet
Challenges ProjectMissingMoneyMatters
2 pages
Exam - 1: October 5, 2016 Exam - 2: November 23, 2016 Quiz - 2: October 26, 2016 Quiz - 3: November 9, 2016
No ratings yet
Exam - 1: October 5, 2016 Exam - 2: November 23, 2016 Quiz - 2: October 26, 2016 Quiz - 3: November 9, 2016
14 pages
yash dbms
No ratings yet
yash dbms
56 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
11 pages
DAA_Chapter 02
No ratings yet
DAA_Chapter 02
12 pages
1. Week2_Master the data
No ratings yet
1. Week2_Master the data
28 pages
sql
No ratings yet
sql
24 pages
Sumit Sadhu Dte 2 (SQL)
No ratings yet
Sumit Sadhu Dte 2 (SQL)
6 pages
Operation Analytics Project1
No ratings yet
Operation Analytics Project1
9 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
13 pages
Slide for Chapter 2
No ratings yet
Slide for Chapter 2
16 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
3 pages
Data and Business Analytics Interview Questions
No ratings yet
Data and Business Analytics Interview Questions
54 pages
Interview Questions
No ratings yet
Interview Questions
29 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
February 14 2013 Mount Ayr Record-News
No ratings yet
February 14 2013 Mount Ayr Record-News
14 pages
Q.1 Multiple Choice Questions:-: STD IV G.K. Page
100% (1)
Q.1 Multiple Choice Questions:-: STD IV G.K. Page
5 pages
Strategic Management and Business Policy: Activating Strategies
No ratings yet
Strategic Management and Business Policy: Activating Strategies
16 pages
Branch and Bound Search: Node Is A Subproblem (LP Relaxation), Where "Bound" Comes From
No ratings yet
Branch and Bound Search: Node Is A Subproblem (LP Relaxation), Where "Bound" Comes From
12 pages
Sipna College of Engineering & Technology III &CR Cell Session: 2021-2022 Students Placed in On-Campus & Off-Campus
No ratings yet
Sipna College of Engineering & Technology III &CR Cell Session: 2021-2022 Students Placed in On-Campus & Off-Campus
9 pages
14 Rectified - Water
No ratings yet
14 Rectified - Water
7 pages
Seam3 Finals Topic1
No ratings yet
Seam3 Finals Topic1
3 pages
Code List For Training Wk38 2013 Pub Faultlist
No ratings yet
Code List For Training Wk38 2013 Pub Faultlist
43 pages
Rancangan Pengajaran Tahunan Ppki KSSR Tahun 1 Bahasa Inggeris
100% (1)
Rancangan Pengajaran Tahunan Ppki KSSR Tahun 1 Bahasa Inggeris
7 pages
Different Formats of Classroom Assessment Tools-Lecture Notes
No ratings yet
Different Formats of Classroom Assessment Tools-Lecture Notes
6 pages
Social Media Marketing at Reebok India - The Dilemma of ROMI and Beyond
No ratings yet
Social Media Marketing at Reebok India - The Dilemma of ROMI and Beyond
20 pages
11 - Analisis Trend
No ratings yet
11 - Analisis Trend
28 pages
Original
No ratings yet
Original
4 pages
Get (eBook PDF) Understanding Employment Relations (UK Higher Education Business Management) free all chapters
100% (10)
Get (eBook PDF) Understanding Employment Relations (UK Higher Education Business Management) free all chapters
42 pages
Supply Chain Identification Template
No ratings yet
Supply Chain Identification Template
16 pages
Catheter Choice
No ratings yet
Catheter Choice
13 pages
What Is A Partnership
No ratings yet
What Is A Partnership
79 pages
British Battleships Of World War One New Revised Edition R A Burt download
No ratings yet
British Battleships Of World War One New Revised Edition R A Burt download
26 pages
Roberts (2014) Powerful Knowledge and Geographical Education
No ratings yet
Roberts (2014) Powerful Knowledge and Geographical Education
24 pages
YOU WILL NEVER KNOW CHORDS by Imany @
No ratings yet
YOU WILL NEVER KNOW CHORDS by Imany @
2 pages
STPM Physics Chapter 12 Electrostatics
100% (3)
STPM Physics Chapter 12 Electrostatics
1 page
4 Chemical Kinetics-Notes
No ratings yet
4 Chemical Kinetics-Notes
8 pages
Buscom Midterm PDF Free
No ratings yet
Buscom Midterm PDF Free
29 pages
Introduction to Automobile Engineering course outline Final
No ratings yet
Introduction to Automobile Engineering course outline Final
8 pages
AI X Sample Papers Sultan Chand 2023
No ratings yet
AI X Sample Papers Sultan Chand 2023
9 pages
Omega Air Product Data Sheet Filter Element XR AF and AAF v4.00
No ratings yet
Omega Air Product Data Sheet Filter Element XR AF and AAF v4.00
2 pages
Happy Pineapple: A Pattern by Super Cute Design
67% (3)
Happy Pineapple: A Pattern by Super Cute Design
7 pages
Kathopnishad
No ratings yet
Kathopnishad
64 pages
Brochure en
No ratings yet
Brochure en
8 pages
Building The Innovative Organization (Part 1) (Week 5) : Bachelor of Business Management (Hons)
No ratings yet
Building The Innovative Organization (Part 1) (Week 5) : Bachelor of Business Management (Hons)
39 pages

Case Study o Aims

Uploaded by

Case Study o Aims

Uploaded by

DESCRIPTION : OPERATION ANALYTICS $INVESTIGATION METRICS SPIKE

My-SQL Command-Prompt Google Drive

• Downloaded the given dataset Case Study-1 .

Data Base name: casestudy1

Your task: Calculate the number of jobs

Date Daily Throughput

Weekly Throughput is 0.03,

So, from here we conclude that we get

Calculate the percentage share of each

Lang Totaljobs Lang%

Let’s say you see some duplicate rows

If we take duplicate rows on the basis of

If we take duplicate rows on the basis of

• Downloaded the given dataset Case Study-2.

Data Base name: casestudy1

User Growth: Amount of users growing over time for a product.

You might also like