0% found this document useful (0 votes)

16 views

Crime Data Analysis in Toronto - Group 4

Uploaded by

hajeraunnisa188

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Crime Data Analysis in Toronto - Group 4

Uploaded by

hajeraunnisa188

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

CRIME DATA ANALYSIS IN

TORONTO
Crime Data Analysis and Exploratory Report
Toronto Police Services
Group 4
Dev Wadiker
Chinmay Wadhavkar
Hajera Unnisa
Deepanshi

1
AGENDA

•Introduction
•Data Description
•Methodology
•Data Cleaning and Preparation
•Exploratory Data Analysis (EDA)
•Correlation Analysis
•Model Analysis-Predictive Modeling
•Conclusions
•Recommendations

2
EXECUTIVE SUMMARY

Objective: Analyze Toronto crime data to uncover trends,

patterns, and actionable insights.

Key Findings:
•Data cleaning improved dataset quality.
•Identified significant temporal and spatial trends.
•Assaults and theft are the most prevalent crimes.
•Certain neighborhoods are consistent crime hotspots.

Conclusions: Insights will aid in better resource allocation and

targeted interventions by Toronto Police Services.
3
INTRODUCTION
Background:
Overview of crime challenges in Toronto and the importance of
understanding crime patterns.

Problem Statement:
The need to identify temporal and spatial trends in crime to
inform resource allocation and public safety measures.

Objectives:
1.Clean and preprocess the dataset.
2.Conduct exploratory data analysis (EDA).
3.Provide insights into crime distribution.
4.Offer recommendations for crime prevention and resource
allocation.
4
DATA DESCRIPTION

Data Source: Major Crime Indicators Open Data by Toronto

Police Services.
Variables:
OCC_DATE: Date and time of crime occurrence.
REPORT_DATE: Date and time the crime was reported.
Geographical Data: Longitude (LONG_WGS84) and
Latitude (LAT_WGS84).
Crime Types: OFFENCE and MCI_CATEGORY.
Data Quality: Cleaning steps included imputation,
standardization, and removal of duplicates.

5
METHODOLOGY

Approach: Systematic analysis starting from data

cleaning to exploratory data analysis.

Tools: Python (Pandas, Matplotlib, Seaborn), Jupyter

Notebooks, Geopandas.

Assumptions and Limitations:

•Assumed data completeness and temporal consistency.
•Geographic resolution and lack of socio-economic data
were noted limitations.

6
DATA CLEANING AND PREPARATION

Process:

1.Handling Missing Data: Mode, mean imputation, and

backfill method.
2.Dropping Redundant Columns: Removed
unnecessary or duplicate columns.
3.Data Standardization: Standardized categorical data
for consistency.
4.Handling Duplicates: Ensured each record was
unique.
•Outcome: A clean and well-structured dataset ready for
analysis.

7
EXPLORATORY DATA ANALYSIS (EDA)
1. Descriptive Statistics
Objective: Provide an overview of the main
statistics of the dataset, such as the count, mean,
median, standard deviation, minimum, and
maximum values for key variables.

Summary:

REPORT_YEAR: The data spans from 2000 to

2024, with a mean year of approximately 2019.

REPORT_DAY: The day of the report varies from 1

to 31, with an average of about 15.

OCC_YEAR: The occurrence year ranges from 2000

to 2024, with similar statistics to the report year.

LONG_WGS84 and LAT_WGS84: The geographic

coordinates have a standard deviation indicating
varying locations, with some invalid data points
(longitude 0). 8
2. TEMPORAL ANALYSIS
A. Crime Trends Over Time:

crime_trends_over_years
visualization shows an
increase in crime rates up
until 2020, with a significant
drop in 2024. This could
indicate a data anomaly or the
effect of external factors such
as the COVID-19 pandemic.

9
B. Crime Distribution by Day
of the Week:
 The
crime_distribution_by_dow.pn
g chart indicates a fairly
uniform distribution of crimes
across the week, with slightly
higher incidents on Fridays
and Saturdays.

10
C. Crime Distribution by Hour
of the Day:

crime_distribution_by_hour
visualization highlights peak
crime hours between midnight
and 2 AM, with another rise in
the late afternoon to early
evening

11
3. SPATIAL ANALYSIS

A. Crime Hotspots:
The crime_hotspots.png map
reveals high concentrations of
crime in certain areas of Toronto,
with noticeable hotspots.

12
B. Neighborhood Analysis:

Crime_distribution_top_neighbo
rhoods chart identifies the top
20 neighborhoods with the
highest crime rates, with West
Humber-Clairville and Moss
Park leading.

13
4. CRIME CATEGORY ANALYSIS

A. Offense Types:
 Frequency_of_offense_types
chart shows that assault is the
most frequent crime type,
followed by vehicle-related
offenses and theft.

14
B. Location and Premises Types:
 Distribution_of_crimes_by_loc
ation visualization shows that
most crimes occur in condos
and mobile homes, followed by
public spaces.

15
CORRELATION ANALYSIS

•Correlation Matrix:
 The correlation_matrix.png
visualization shows strong
correlations between certain
variables, like OBJECT ID and
REPORT_YEAR, which could
indicate data recording patterns
rather than meaningful insights.

16
MODEL ANALYSIS
Performance Metrics:
Objective:
• Precision, Recall, F1-Score:
 Evaluate the performance of a
Significant variation across different crime
Random Forest model used to types.
predict crime types based on
Higher performance for frequent categories
temporal and spatial features. (e.g., "Assault with Weapon").
Model Summary: • Macro Average:
Hyperparameter Tuning: Precision: 0.29
Optimized using Randomized Recall: 0.08
Search.Best Parameters:Number
of Estimators: 200Maximum F1-Score: 0.11
Depth: 20Minimum Samples Split: • Weighted Average:
2Minimum Samples Leaf: 1
Overall Accuracy: Precision: 0.42
 44.19% (Moderate Performance) Recall: 0.44
F1-Score: 0.38
17
1. CONFUSION MATRIX

•Insight:

The matrix highlights the model's

strengths in predicting high-
frequency crimes but shows
challenges with less frequent ones.
Normalization provides a clearer
understanding of relative
performance.

18
2. FEATURE IMPORTANCE:
• Insight:
Spatial features (LAT_WGS84,
LONG_WGS84) are the most critical
predictors, followed by temporal
features (OCC_DAY, OCC_HOUR).

• Conclusion:
The Random Forest model is a solid
foundation with an accuracy of 44.19%.
• Limitations:
The model struggles with less frequent
crime categories, indicating room for
improvement.

19
CONCLUSION
Summary of Findings:
•Temporal Trends: Crime rates increased notably post-2014, with a peak in
2019-2021.
•Spatial Trends: High crime concentration in specific neighborhoods like
Moss Park and West Humber-Clairville.
•Crime Categories: Assaults are the most prevalent crime type.
•Modeling: Random Forest effectively identifies key predictors such as
location and time.

Implications:
•Strategic Resource Deployment: Allocate more resources to high-risk
areas and peak times.
•Predictive Policing: Utilize and refine predictive models to anticipate crime
trends and allocate resources proactively. 20
RECOMMENDATIONS
Strategic Resource Deployment:
• Focus on high-crime neighborhoods such
as Moss Park and West Humber-
Clairville.
• Optimize patrol schedules to cover peak
crime hours, especially late at night.
Predictive Policing:
• Model Integration: Incorporate the
Random Forest model into daily
operations for proactive crime prevention.
• Model Enhancement: Continuously
refine the model with additional data (e.g.,
socioeconomic factors, weather patterns).
Community Engagement:
• Strengthen community relations in high-
crime areas through increased presence
and outreach programs.
• Promote public awareness on crime
prevention strategies tailored to specific 21
neighborhoods.
THANK YOU

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (82)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
EBook Crime Analysis With Crime Mapping 4Th Edition Ebook PDF PDF Docx Kindle Full Chapter
100% (50)
EBook Crime Analysis With Crime Mapping 4Th Edition Ebook PDF PDF Docx Kindle Full Chapter
61 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
Service Manual RS2000 Toyota
100% (2)
Service Manual RS2000 Toyota
48 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Us Crime Data Exploration and Analysis
No ratings yet
Us Crime Data Exploration and Analysis
4 pages
Chi Kung - Qigong Program Five Animals
100% (12)
Chi Kung - Qigong Program Five Animals
137 pages
Koos-12 Knee Survey: INSTRUCTIONS: This Survey Asks For Your Views About Your Knee. Answer Every
100% (1)
Koos-12 Knee Survey: INSTRUCTIONS: This Survey Asks For Your Views About Your Knee. Answer Every
2 pages
Crime Data Analysis Presentation v2
No ratings yet
Crime Data Analysis Presentation v2
10 pages
Final Capstone Project - Group 4 - TPS
No ratings yet
Final Capstone Project - Group 4 - TPS
27 pages
Spatial Statistics in Crime Analysis:: Using Crimestat Iii®
No ratings yet
Spatial Statistics in Crime Analysis:: Using Crimestat Iii®
151 pages
Crimes Tati I I Work Book
No ratings yet
Crimes Tati I I Work Book
151 pages
Demographics Crime Trends
No ratings yet
Demographics Crime Trends
6 pages
Arcgis For Utilities Brochure
No ratings yet
Arcgis For Utilities Brochure
37 pages
Enhancing Public Safety A Data-Driven Approach
No ratings yet
Enhancing Public Safety A Data-Driven Approach
12 pages
Scopus 001 Ok
No ratings yet
Scopus 001 Ok
37 pages
1-Crime Mapping and Spatial Analysis PDF
No ratings yet
1-Crime Mapping and Spatial Analysis PDF
64 pages
1 Crime Mapping and Spatial Analysis PDF
No ratings yet
1 Crime Mapping and Spatial Analysis PDF
64 pages
Crime Analysis with Crime Mapping Rachel Boba Santos - Read the ebook online or download it for the best experience
100% (3)
Crime Analysis with Crime Mapping Rachel Boba Santos - Read the ebook online or download it for the best experience
65 pages
ADV Exp 4 2022301014
No ratings yet
ADV Exp 4 2022301014
6 pages
Example 3
No ratings yet
Example 3
29 pages
Exercise: Detecting and Quantifying Patterns: Personal and Property Crime
0% (1)
Exercise: Detecting and Quantifying Patterns: Personal and Property Crime
36 pages
Final Report
No ratings yet
Final Report
18 pages
Crime Analysis
No ratings yet
Crime Analysis
29 pages
Sachin Project
No ratings yet
Sachin Project
13 pages
Article3
No ratings yet
Article3
8 pages
Velasco Et Al. (2000) - Manual of Crime Analysis Map Production
No ratings yet
Velasco Et Al. (2000) - Manual of Crime Analysis Map Production
36 pages
Heat Map Generation For Crime Rate - Research Paper
100% (1)
Heat Map Generation For Crime Rate - Research Paper
7 pages
FINAL-RECITATION-COVERAGE
No ratings yet
FINAL-RECITATION-COVERAGE
2 pages
Heat Map Generation For Crime Rate - Report
No ratings yet
Heat Map Generation For Crime Rate - Report
11 pages
Crime Mapping Research
100% (1)
Crime Mapping Research
11 pages
GIS and Crime Mapping
100% (1)
GIS and Crime Mapping
45 pages
Exploring Transfer Learning For Crime Prediction
No ratings yet
Exploring Transfer Learning For Crime Prediction
3 pages
Devil Crime Rate Prediction Using K-Means
No ratings yet
Devil Crime Rate Prediction Using K-Means
14 pages
Baltimore Crime Report.
No ratings yet
Baltimore Crime Report.
11 pages
8.3 Crime-Data-Mining-Threat-Analysis-and-Prediction
No ratings yet
8.3 Crime-Data-Mining-Threat-Analysis-and-Prediction
16 pages
CRIME MAPPING (Autosaved)
100% (2)
CRIME MAPPING (Autosaved)
18 pages
Big Data and Cloud Computing -New Copy 8 Updated-1
No ratings yet
Big Data and Cloud Computing -New Copy 8 Updated-1
12 pages
Crime Analysis and Prediction
No ratings yet
Crime Analysis and Prediction
17 pages
MAPPING
No ratings yet
MAPPING
4 pages
GEOSPATIAL CRIME HOTSPOT DETECTION: A ROBUST FRAMEWORK USING BIRCH CLUSTERING OPTIMAL PARAMETER TUNING
No ratings yet
GEOSPATIAL CRIME HOTSPOT DETECTION: A ROBUST FRAMEWORK USING BIRCH CLUSTERING OPTIMAL PARAMETER TUNING
13 pages
Crime Analysis and Prediction Using Datamining: A Review
No ratings yet
Crime Analysis and Prediction Using Datamining: A Review
20 pages
Crime Analysis Report - Template
No ratings yet
Crime Analysis Report - Template
6 pages
Batch 3 Final
No ratings yet
Batch 3 Final
29 pages
Analyzing Crime Patterns Insights for Safer Communities
No ratings yet
Analyzing Crime Patterns Insights for Safer Communities
14 pages
Social Media Policy Presentation 20241205 234806 0000
No ratings yet
Social Media Policy Presentation 20241205 234806 0000
19 pages
Crime Analytics: Exploring Analysis of Crimes Through R Programming Language
No ratings yet
Crime Analytics: Exploring Analysis of Crimes Through R Programming Language
5 pages
Crimen y Análisis Sig
No ratings yet
Crimen y Análisis Sig
16 pages
us_crime_data_exploration_and_analysis
No ratings yet
us_crime_data_exploration_and_analysis
4 pages
New Content
No ratings yet
New Content
45 pages
Crime-Analysis-Report-Format-G9-3C (1)
No ratings yet
Crime-Analysis-Report-Format-G9-3C (1)
20 pages
Saraswati-Crime and Special Structure of Cities
No ratings yet
Saraswati-Crime and Special Structure of Cities
5 pages
Crimestat Iii: Susan C. Smith Christopher W. Bruce
No ratings yet
Crimestat Iii: Susan C. Smith Christopher W. Bruce
145 pages
Leapcm Report
No ratings yet
Leapcm Report
17 pages
Crime Analysis and Prediction Using Machine Learning
No ratings yet
Crime Analysis and Prediction Using Machine Learning
5 pages
D2-(5G)
No ratings yet
D2-(5G)
7 pages
Law enforcement operations and planning with crime mapping
No ratings yet
Law enforcement operations and planning with crime mapping
4 pages
Addarsh Chandrasekar - Crime Prediction and Classification in San Francisco City
No ratings yet
Addarsh Chandrasekar - Crime Prediction and Classification in San Francisco City
6 pages
Forecasting of Crime Ppt1
No ratings yet
Forecasting of Crime Ppt1
18 pages
The Utility of Hotspot Mapping For Predicting Spatial Patterns of Crime - SpringerLink
No ratings yet
The Utility of Hotspot Mapping For Predicting Spatial Patterns of Crime - SpringerLink
10 pages
Information Clearinghouse 8th Edition
No ratings yet
Information Clearinghouse 8th Edition
71 pages
Crime Mapping
No ratings yet
Crime Mapping
23 pages
Crime Hotspot Prediction
No ratings yet
Crime Hotspot Prediction
14 pages
Criminal Justice Statistics: Essential Methods
From Everand
Criminal Justice Statistics: Essential Methods
Sandeep Krishnamurthy
No ratings yet
Quantitative Criminology Handbook
From Everand
Quantitative Criminology Handbook
Neeraj Venkataraman
No ratings yet
Urban Expedition Tips
From Everand
Urban Expedition Tips
Maxwell Chen
No ratings yet
Presentation
No ratings yet
Presentation
25 pages
9.Differences in Customer Satisfaction and Repurchase
No ratings yet
9.Differences in Customer Satisfaction and Repurchase
7 pages
Project Report On Wto Organization
67% (3)
Project Report On Wto Organization
31 pages
m4-2 Main Air Compressor Instruction Manual
No ratings yet
m4-2 Main Air Compressor Instruction Manual
88 pages
Fast Food Chain Project Report
67% (3)
Fast Food Chain Project Report
2 pages
Site Acceptance Testing (S.A.T) : Quality Control Department
100% (1)
Site Acceptance Testing (S.A.T) : Quality Control Department
2 pages
Super Senses
50% (2)
Super Senses
8 pages
Coronavirus Disease (COVID-19) : Case Investigation Form
No ratings yet
Coronavirus Disease (COVID-19) : Case Investigation Form
1 page
Well Productivity in An Iranian Gas-Cond
No ratings yet
Well Productivity in An Iranian Gas-Cond
11 pages
433058-sample
No ratings yet
433058-sample
13 pages
Artificial Insemination
No ratings yet
Artificial Insemination
26 pages
Online Asset Integrity Management Training Course
No ratings yet
Online Asset Integrity Management Training Course
4 pages
Comparison HT TT Reiki
No ratings yet
Comparison HT TT Reiki
1 page
Manitou Work Platforms Atj 46 160 Atj 180 Atj Rnc 2rd t4 s1 Genuine Parts Catalogue 647697en 08 2018
No ratings yet
Manitou Work Platforms Atj 46 160 Atj 180 Atj Rnc 2rd t4 s1 Genuine Parts Catalogue 647697en 08 2018
22 pages
Cad 150 2 - 10 0370 CPR 0994 PDF
No ratings yet
Cad 150 2 - 10 0370 CPR 0994 PDF
3 pages
Pmfme Loan Policy
No ratings yet
Pmfme Loan Policy
7 pages
Monetizing Natural Gas: Qatar's Chemical Industry
100% (1)
Monetizing Natural Gas: Qatar's Chemical Industry
4 pages
Socioeconomic Status and Issues
No ratings yet
Socioeconomic Status and Issues
26 pages
BIA
No ratings yet
BIA
29 pages
Jane Doe vs. Miami-Dade County Public School Board - 1501871495226 - 10236595 - Ver1.0
No ratings yet
Jane Doe vs. Miami-Dade County Public School Board - 1501871495226 - 10236595 - Ver1.0
20 pages
HRHC - Innovative Healthcare & Software Solutions
No ratings yet
HRHC - Innovative Healthcare & Software Solutions
4 pages
Piggyback 20 Song 20 Book
No ratings yet
Piggyback 20 Song 20 Book
37 pages
Libro - Biodiversity, Ecosystem Functioning and Human Wellbeing
No ratings yet
Libro - Biodiversity, Ecosystem Functioning and Human Wellbeing
387 pages
Assignment Module 3 Healthcare Professionals
No ratings yet
Assignment Module 3 Healthcare Professionals
5 pages
Ep 95 0300 - (Hemp)
100% (2)
Ep 95 0300 - (Hemp)
86 pages
Trivial Pursuit - Questions
No ratings yet
Trivial Pursuit - Questions
6 pages
Earthing of Conductors
No ratings yet
Earthing of Conductors
48 pages