Terminal Assessment 2 DAP
Terminal Assessment 2 DAP
PYTHON
https://www.kaggle.com/datasets/karanprajapati7/titanic-survival-
REFERENCE: dataset?resource=download
DATASET
DESCRIPTION
TITANIC SURVIVAL
The Titanic Survival Dataset contains 1,300 rows and
DATASET
12 columns of records of passenger data
from the ill-fated Titanic voyage. Key features include
Survived (indicating survival status),
Dataset Name
Pclass (passenger class), Sex (gender), Age (age of
the passenger), and Fare (ticket fare). The
goal is to analyze the factors influencing survival
rates among passengers.
3. DATA CLEANING &
PREPARATION
OUTPUT:
DATA CLEANING &
PREPARATION
HANDLE MISSING VALUES
OUTPUT:
DATA CLEANING &
PREPARATION
REMOVE DUPLICATES
OUTPUT:
DATA CLEANING &
PREPARATION
ENCODING CATEGORICAL VARIABLES OUTPUT:
4. EXPLORATORY
DATA ANALYSIS (EDA)
4.2
UNIVARIATE
VISUALIZATIONS
4.3.
BIVARIATE
VISUALIZATIONS
4.4.
MULTIVARIATE
VISUALIZATION
5. STATISTICAL
ANALYSIS
The Titanic dataset's EDA reveals most Statistical tests revealed significant findings:
passengers were 20–40 years old, averaging T-test: Higher survival rates in 1st vs. 3rd class (p <
THE SIGNIFICANT AGE VARIATION BETWEEN CLASSES SHOWS THAT 1ST CLASS
PASSENGERS WERE TYPICALLY OLDER, POTENTIALLY WITH MORE FINANCIAL
MEANS AND RESOURCES FOR SURVIVAL.
BARPLOT OF
SURVIVAL RATES
BY PCLASS AND
EMBARKED
Discuss how the results could be applied in real-world scenarios (e.g., targeting
high-performing branches or improving underperforming ones).
01
These findings not only help to better understand the Titanic dataset but also provide a
framework for applying statistical analysis and visual insights to real-world business
scenarios. By targeting high-performing groups and addressing challenges in
underperforming segments, businesses can make more informed decisions that optimize
performance and improve overall outcomes.
7. CONCLUSION AND
KEY TAKEAWAYS
ALAMEDA, KHAIZA
MIBULOS, PRINCE RVIC
RUBIO, ANGELICA ELLAINE