Breast Cancer Prediction (Final)
Breast Cancer Prediction (Final)
Mrs Ramya.B.S
Presented by
Asif H [1NT15IS020]
Deivanai A [1NT15IS026]
Dhiraj M G [1NT15IS028]
Gautham N [1NT15IS032]
Introduction
● Breast cancer is cancer that forms in the cells of the breasts.
● It can occur in both men and women, but it’s far more common in
women.
● Data mining which is basically extraction of useful information in a pool of
data and looking for patterns among this data, can be used to develop a
tool to predict the prevalence of breast cancer in a patient.
● Here it is used to create a predictive model that can be accessed on a
website
Motivation
● The rate of incidence was found to be 25.8 in 100,000 women in India
● Studies show India still has a low breast cancer survival rate of 66%
● The major reason for low survival rates of breast cancer in India is that
the awareness about cancer and its treatment is very low
● More than 90% of women diagnosed with breast cancer at the earliest
stage survive their disease for at least 5 years
● Early detection can help cure the disease before it becomes advanced
Objective
● Appropriate analysis of the given dataset after removing null values
● Obtain the clean dataset without null, redundant values etc
● To use the following algorithms to predict the possibility of breast cancer
○ Support Vector Machine
○ Logistic Regression
○ Linear Regression
Fig 2. Workflow
Linear Regression
● Linear regression is a kind of statistical analysis that attempts to show a
relationship between two variables.
● Linear regression looks at various data points and plots a trend line.
Accuracy : 77.07%
E.g. When we have to predict if a student passes or fails in an exam when the number of
hours spent studying is given as a feature, the response variable has two values, pass
and fail.
Accuracy : 96.10%
Accuracy : 98.04%
- Python
- Windows/Ubuntu
- Web browser
Hardware Requirements :
- i5 processor.
- 4 - 8GB RAM.
Results - Web application
Comparison of the algorithms - Time taken
Comparison of the algorithms - Accuracy
Test case
Result for Support vector machine
Result for Logistic regression
Result for Linear regression
References
● [1] Uma Ojha, Savita Goel, A study on prediction of Breast Cancer
recurrence using Data Mining Techniques
● [2] Mohammed H.Tafish, Dr. Alaa M.El-Halees, Breast Cancer Severity
Prediction Using Data Mining Techniques in the Gaza Strip.
● [3] Jabeena Sultana, Abdul Khader Jilani, Predicting Breast Cancer Using
Logistic Regression and Multi-Class Classifiers.
● [6] https://www.tutorialspoint.com/flask
● [7] https://towardsdatascience.com
● [8] https://flask.pocoo.org
THANK
YOU