Random Forest Algorithm - Titanic Dataset
Random Forest Algorithm - Titanic Dataset
This Algorithm uses the Titanic dataset to predict survival of passengers based on features
like age, sex, fare, and class.
We used a machine learning algorithm called Random Forest Classifier to build a prediction
model.
2️⃣
Recall
Out of all the people who actually survived, how many did the model correctly predict?
•8 people actually survived.
•Model correctly found 6 of them.
•Recall = 6 / 8 = 0.75 or 75% correct.
High recall = The model doesn’t miss survivors.
3️⃣F1-Score 3️⃣⚖️
A balanced score between precision and recall.
It’s like the average of the two — but more careful about balance.
•If precision is 0.6 and recall is 0.75 → F1 is around 0.67
•F1-score is high only if both precision and recall are good
📊 Final Conclusion from Feature Importance Graph
•✅ Top 3 important features:
• Age
• Sex
• Fare
These had the highest influence on survival predictions.
• Sex and Age:
• The model learned that women and younger passengers had
higher survival chances.
•💰 Fare:
• Passengers who paid more (likely in higher classes) had a better
chance of survival.
•📉 Less important features:
• Embarked, Parch, and SibSp had lower importance, meaning
they played a smaller role in the model’s decisions.
•🧠 Overall:
• The Random Forest model made accurate predictions by focusing
more on key survival factors.