SlideShare a Scribd company logo
PEOPLE’S DEMOCRATIC REPUBLIC OFALGERIA
MINISTERY OF HIGHER EDUCATION AND SCIENTIFIC RESEARCH
UNIVERSITY OF MOHAMED BOUDIAF - M’SILA
DOMAIN: Mathematics and Computer
Science
FIELD: Computer Science
SUB-FIELD: Information Communication
Technologies
FACULTY: Mathematics and
Computer Science
DEPARTMENT of Computer
Science
A Dissertation in Fulfillment
For the Requirements of the Degree of Master
SUBJECT
Academic year: 2016 /2017
Mr: Y. ARIOUAT University of M’sila Chairman
Mr: TAHAR MEHENNI University of M’sila Supervisor
Mr: M. KAMEL University of M’sila Examiner
By: Yassine Bensaoucha
A data mining tool for the detection of
suicide in social networks
1
Introduction
Suicide
Social Network
Data mining
2
Naive Bayes Classifiers
Proposed classification
Implementation Of The System
Conclusion
plan
Introduction
At present, the suicide phenomenon is raising, having a relevant impact on our
society. each year millions of people die as a result of suicidal, behavior becoming
an economic social and human problem. on the other hand, the use of SN as a means
of communication is becoming extremely popular, and people find that writing about
their feeling and sharing it to the world through this SNs is much easier then talking
about it in real life. the increasing number of publications in SN has made the field
of data mining evolving.
Based on these considerations, we have developed a program that can
detect the suicidal tendencies of people via Twitter.
3
4
Suicide
Suicide is when people direct violence at themselves with the intent to end their
lives, and they die as a result of their actions. Suicide is a leading cause of death
in the United States.
According to the analysis carried out by the World Health Organization
(WHO) in 2012 every 40 seconds a person died by suicide, The WHO estimates
that suicide is the 13th leading cause of death in the world and the third one
between youth aged 15-44
5
The Social Network: is a network structure composed of social
entities (might be people, organisations or groups) as nodes. And
connections in the form of edges, that represent the connection or
relationship amongst these entities. Such connections might be
friendships, or business connection etc.
Social Network
Figure 1: Social Network
Data mining is the process of sorting through large datasets to identify patterns
and establish relationships to solve problems through data analysis.
Data mining
Text Mining
the process at which text is transferred into data that can be analysed. this
incurs the procedures of creating an index for the individual terms, based on the
location of the term within the original text, or based on other techniques or
protocols. The words and indexes can then be used for a variety of analysis
methods.
Sentiment Analysis
the process of determining the emotional tone behind a series of words, used to
gain an understanding of the the attitudes, opinions and emotions expressed
within an online mention.
6
Figure 2 : Machine learning techniques
7
Naive Bayes Classifiers
The Naive Bayes algorithm is a widely used algorithm for document
classification. Given a feature vector table, the algorithm computes the posterior
probability that the document belongs to different classes and assigns it to the
class with the highest posterior probability. There are two commonly used models
( multinomial model and multi-variate Bernoulli model) for using Naive Bayes
approach for text categorization.
8
Figure 3: trainining multinomial naive bayes
Figure 4: test multinomial naive bayes
9
Example multinomial naive Bayes classifier:
Figure 5 Example multinomial naive Bayes classifier
Decide:
whether document d5 belonging to class c=China?
10
Figure 6: resulut of the example multinomial naive Bayes classifier
11
Advantages of Naïve Bayes
Easy to implement
Requires a small amount of training data to estimate the parameters
Good results obtained in most of the cases
12
Data Pre-processing:
The goal behind preprocessing is to
represent each document as a
feature vector, to separate the text
into individual words.
Preprocessing text is called
tokenization or text
normalization.
Figure 7: Data Pre-processing
13
Proposed classification:
Each sentence is first preprocessed and then passed into three categories of
classifiers, each deciding whether the sentence belongs to the
corresponding category or not.
This category contains positive
and regular sentences
Contains negative sentences that do not contain suicidal
phrases like
"the taste of pizza today very bad".
contains suicidal phrases that are classifed as
emergency phrases in the previous search
“Each day, each hour, each minute is just torture. I want it to
end.”
“Waking up every day wishing I hadn’t “
“I feel like the only way to no longer carry this pain is to die.”
“I am weird and slow. Every social interaction is painfully
awkward. “
“I want to finish my life.”
“No one understands me in this life, I'm leaving.” 14
positive negative suicidal
Proposal Solution for Zero problem(document):
This happens in case the probability of each category is 0, that is when
we pass a new text (tweet) entirely on our classification and we do not
have any word from this text in the dataset
first solution: is switching each word from the new tweet (sentence )
with synonyms, then we re-categorize the new tweet (sentence with the
same meaning ) .
second solution:
the solution depends on the extraction of sentiment from the sentence
using the textblob library.
Sentiment(polarity,subjectivity).
The polarity score is a float within the
range [-1.0, 1.0].
The subjectivity is a float within the
range [0.0, 1.0]
if the number of “polarity” is greater than 0 Then positive else negative 15
Implementation
Of The System
download
tweets
pre-process
the tweets
Classification
Positive negative suicidal
16
Analysis of the results of
Classification
The user has
suicidal tendencies
The user has no
suicidal
tendencies
Username of the person
that we want to apply the
calassification on it
Username
Twitter
API
17
download
tweets
18
Analysis of the results of
Classification
Calculate the percentage of positiveTweets = positive
Calculate the percentage of negativeTweets = negative
Calculate the percentage of suicidalTweets = suicidal
elif suicidal > 20%
and negative > 60%
If suicidal > 70% The user has
suicidal
tendencies
else
The user has
no suicidal
tendencies
19
20
This work aims to create a program, which will be capable of detecting suicide
in social networks, so the results obtained from this study or this program were
acceptable.
While we have identified some interesting and promising results, future work
will be compare the results of this work with other classification. and also
merging our classification with decision tree classification.
CONCLUSION
21
thank you
Ad

Recommended

Cloud of things (IoT + Cloud Computing)
Cloud of things (IoT + Cloud Computing)
Zakaria Hossain
 
The future of IOT
The future of IOT
Arti Parab Academics
 
Presentation web 3.0(part 1)
Presentation web 3.0(part 1)
Abhishek Roy
 
Semantic Web
Semantic Web
Adarsh Kumar Yadav
 
Webservices
Webservices
Gerard Sylvester
 
IOT DATA AND BIG DATA
IOT DATA AND BIG DATA
Vellore institute of technology, Vellore
 
Crop prediction using machine learning
Crop prediction using machine learning
dataalcott
 
Web 1.0 2.0 3.0
Web 1.0 2.0 3.0
Nur Garriga
 
Advanced Web Development
Advanced Web Development
Robert J. Stein
 
Web Development
Web Development
Lena Petsenchuk
 
Tools for artificial intelligence
Tools for artificial intelligence
Olivier Teytaud
 
Physical Design of IoT.pdf
Physical Design of IoT.pdf
JoshuaKimmich1
 
Internet of Things (IOT)
Internet of Things (IOT)
Kunal Adhikari
 
Internet of things (IoT)
Internet of things (IoT)
Prakash Honnur
 
Latest trends in information technology
Latest trends in information technology
Atifa Aqueel
 
Presentation on social media on young generation in Bangladesh
Presentation on social media on young generation in Bangladesh
Shastry Rahman
 
IOT and Characteristics of IOT
IOT and Characteristics of IOT
AmberSinghal1
 
Internet and intranet
Internet and intranet
Subrata Kumer Paul
 
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
Krishna Moparthi
 
Artificial Intelligence and Future of Work
Artificial Intelligence and Future of Work
Oleksandr Krakovetskyi
 
Twitter sentimentanalysis report
Twitter sentimentanalysis report
Savio Aberneithie
 
IOT Platform Design Methodology
IOT Platform Design Methodology
poonam kumawat
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
 
Internet of things (IoT)- Introduction, Utilities, Applications
Internet of things (IoT)- Introduction, Utilities, Applications
Tarika Verma
 
Introduction to IOT
Introduction to IOT
Zubayer Al Billal Khan
 
Social networking
Social networking
Abdulla Aman
 
Social networking
Social networking
Bharat Sureka
 
Web 3.0
Web 3.0
JoshuaDominick
 
76201960
76201960
IJRAT
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
SaurabhMishra450
 

More Related Content

What's hot (20)

Advanced Web Development
Advanced Web Development
Robert J. Stein
 
Web Development
Web Development
Lena Petsenchuk
 
Tools for artificial intelligence
Tools for artificial intelligence
Olivier Teytaud
 
Physical Design of IoT.pdf
Physical Design of IoT.pdf
JoshuaKimmich1
 
Internet of Things (IOT)
Internet of Things (IOT)
Kunal Adhikari
 
Internet of things (IoT)
Internet of things (IoT)
Prakash Honnur
 
Latest trends in information technology
Latest trends in information technology
Atifa Aqueel
 
Presentation on social media on young generation in Bangladesh
Presentation on social media on young generation in Bangladesh
Shastry Rahman
 
IOT and Characteristics of IOT
IOT and Characteristics of IOT
AmberSinghal1
 
Internet and intranet
Internet and intranet
Subrata Kumer Paul
 
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
Krishna Moparthi
 
Artificial Intelligence and Future of Work
Artificial Intelligence and Future of Work
Oleksandr Krakovetskyi
 
Twitter sentimentanalysis report
Twitter sentimentanalysis report
Savio Aberneithie
 
IOT Platform Design Methodology
IOT Platform Design Methodology
poonam kumawat
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
 
Internet of things (IoT)- Introduction, Utilities, Applications
Internet of things (IoT)- Introduction, Utilities, Applications
Tarika Verma
 
Introduction to IOT
Introduction to IOT
Zubayer Al Billal Khan
 
Social networking
Social networking
Abdulla Aman
 
Social networking
Social networking
Bharat Sureka
 
Web 3.0
Web 3.0
JoshuaDominick
 
Advanced Web Development
Advanced Web Development
Robert J. Stein
 
Tools for artificial intelligence
Tools for artificial intelligence
Olivier Teytaud
 
Physical Design of IoT.pdf
Physical Design of IoT.pdf
JoshuaKimmich1
 
Internet of Things (IOT)
Internet of Things (IOT)
Kunal Adhikari
 
Internet of things (IoT)
Internet of things (IoT)
Prakash Honnur
 
Latest trends in information technology
Latest trends in information technology
Atifa Aqueel
 
Presentation on social media on young generation in Bangladesh
Presentation on social media on young generation in Bangladesh
Shastry Rahman
 
IOT and Characteristics of IOT
IOT and Characteristics of IOT
AmberSinghal1
 
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
ACCIDENT DETECTION AND VEHICLE TRACKING USING GPS,GSM AND MEMS
Krishna Moparthi
 
Artificial Intelligence and Future of Work
Artificial Intelligence and Future of Work
Oleksandr Krakovetskyi
 
Twitter sentimentanalysis report
Twitter sentimentanalysis report
Savio Aberneithie
 
IOT Platform Design Methodology
IOT Platform Design Methodology
poonam kumawat
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
 
Internet of things (IoT)- Introduction, Utilities, Applications
Internet of things (IoT)- Introduction, Utilities, Applications
Tarika Verma
 

Similar to A data mining tool for the detection of suicide in social networks (20)

76201960
76201960
IJRAT
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
SaurabhMishra450
 
Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...
IJECEIAES
 
Hybrid sentiment and network analysis of social opinion polarization icoict
Hybrid sentiment and network analysis of social opinion polarization icoict
Andry Alamsyah
 
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
CSEIJJournal
 
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
CSEIJJournal
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
Dr. Amarjeet Singh
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
ijtsrd
 
Paper-SentimentAnalysisofTweetshhhjjjjjjjj
Paper-SentimentAnalysisofTweetshhhjjjjjjjj
nvnvnv0288
 
DEPRESSIVE-AND-NON-DEPRESSIVE-TWEETS-CLASSIFICATION-USING-MACHINE-LEARNING-MO...
DEPRESSIVE-AND-NON-DEPRESSIVE-TWEETS-CLASSIFICATION-USING-MACHINE-LEARNING-MO...
arpitgzpbhu
 
Combating propaganda texts using transfer learning
Combating propaganda texts using transfer learning
IAESIJAI
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
IRJET Journal
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
IRJET Journal
 
Abstract
Abstract
Suresh Prabhu
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET Journal
 
ashu ppt final.pptx
ashu ppt final.pptx
VekariyaDarshana2
 
final review ppt of engineering hypothetic arm
final review ppt of engineering hypothetic arm
ssuserd24233
 
Text to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning Techniques
TELKOMNIKA JOURNAL
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
ijfcstjournal
 
A review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptx
voicemail1
 
76201960
76201960
IJRAT
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
SaurabhMishra450
 
Predicting depression using deep learning and ensemble algorithms on raw twit...
Predicting depression using deep learning and ensemble algorithms on raw twit...
IJECEIAES
 
Hybrid sentiment and network analysis of social opinion polarization icoict
Hybrid sentiment and network analysis of social opinion polarization icoict
Andry Alamsyah
 
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
MIMEME ATTRIBUTE CLASSIFICATION USING LDV ENSEMBLE MULTIMODEL LEARNING
CSEIJJournal
 
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
Mimeme Attribute Classification using LDV Ensemble Multimodel Learning
CSEIJJournal
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
Dr. Amarjeet Singh
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
ijtsrd
 
Paper-SentimentAnalysisofTweetshhhjjjjjjjj
Paper-SentimentAnalysisofTweetshhhjjjjjjjj
nvnvnv0288
 
DEPRESSIVE-AND-NON-DEPRESSIVE-TWEETS-CLASSIFICATION-USING-MACHINE-LEARNING-MO...
DEPRESSIVE-AND-NON-DEPRESSIVE-TWEETS-CLASSIFICATION-USING-MACHINE-LEARNING-MO...
arpitgzpbhu
 
Combating propaganda texts using transfer learning
Combating propaganda texts using transfer learning
IAESIJAI
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
IRJET Journal
 
A study of cyberbullying detection using Deep Learning and Machine Learning T...
A study of cyberbullying detection using Deep Learning and Machine Learning T...
IRJET Journal
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET Journal
 
final review ppt of engineering hypothetic arm
final review ppt of engineering hypothetic arm
ssuserd24233
 
Text to Emotion Extraction Using Supervised Machine Learning Techniques
Text to Emotion Extraction Using Supervised Machine Learning Techniques
TELKOMNIKA JOURNAL
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
ijfcstjournal
 
A review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptx
voicemail1
 
Ad

Recently uploaded (20)

presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
 
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
 
ppt somu_Jarvis_AI_Assistant_presen.pptx
ppt somu_Jarvis_AI_Assistant_presen.pptx
MohammedumarFarhan
 
Allotted-MBBS-Student-list-batch-2021.pdf
Allotted-MBBS-Student-list-batch-2021.pdf
subhansaifi0603
 
Shifting Focus on AI: How it Can Make a Positive Difference
Shifting Focus on AI: How it Can Make a Positive Difference
1508 A/S
 
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
 
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
 
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
Taqyea
 
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
taqyea
 
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
Residential Zone 4 for industrial village
Residential Zone 4 for industrial village
MdYasinArafat13
 
Predicting Titanic Survival Presentation
Predicting Titanic Survival Presentation
praxyfarhana
 
lecture12.pdf Introduction to bioinformatics
lecture12.pdf Introduction to bioinformatics
SergeyTsygankov6
 
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
 
Boost Business Efficiency with Professional Data Entry Services
Boost Business Efficiency with Professional Data Entry Services
eloiacs eloiacs
 
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
 
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
Taqyea
 
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
 
presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
 
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
 
ppt somu_Jarvis_AI_Assistant_presen.pptx
ppt somu_Jarvis_AI_Assistant_presen.pptx
MohammedumarFarhan
 
Allotted-MBBS-Student-list-batch-2021.pdf
Allotted-MBBS-Student-list-batch-2021.pdf
subhansaifi0603
 
Shifting Focus on AI: How it Can Make a Positive Difference
Shifting Focus on AI: How it Can Make a Positive Difference
1508 A/S
 
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
 
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
 
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
Taqyea
 
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
taqyea
 
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
Residential Zone 4 for industrial village
Residential Zone 4 for industrial village
MdYasinArafat13
 
Predicting Titanic Survival Presentation
Predicting Titanic Survival Presentation
praxyfarhana
 
lecture12.pdf Introduction to bioinformatics
lecture12.pdf Introduction to bioinformatics
SergeyTsygankov6
 
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
 
Boost Business Efficiency with Professional Data Entry Services
Boost Business Efficiency with Professional Data Entry Services
eloiacs eloiacs
 
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
 
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
Taqyea
 
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
 
Ad

A data mining tool for the detection of suicide in social networks

  • 1. PEOPLE’S DEMOCRATIC REPUBLIC OFALGERIA MINISTERY OF HIGHER EDUCATION AND SCIENTIFIC RESEARCH UNIVERSITY OF MOHAMED BOUDIAF - M’SILA DOMAIN: Mathematics and Computer Science FIELD: Computer Science SUB-FIELD: Information Communication Technologies FACULTY: Mathematics and Computer Science DEPARTMENT of Computer Science A Dissertation in Fulfillment For the Requirements of the Degree of Master SUBJECT Academic year: 2016 /2017 Mr: Y. ARIOUAT University of M’sila Chairman Mr: TAHAR MEHENNI University of M’sila Supervisor Mr: M. KAMEL University of M’sila Examiner By: Yassine Bensaoucha A data mining tool for the detection of suicide in social networks 1
  • 2. Introduction Suicide Social Network Data mining 2 Naive Bayes Classifiers Proposed classification Implementation Of The System Conclusion plan
  • 3. Introduction At present, the suicide phenomenon is raising, having a relevant impact on our society. each year millions of people die as a result of suicidal, behavior becoming an economic social and human problem. on the other hand, the use of SN as a means of communication is becoming extremely popular, and people find that writing about their feeling and sharing it to the world through this SNs is much easier then talking about it in real life. the increasing number of publications in SN has made the field of data mining evolving. Based on these considerations, we have developed a program that can detect the suicidal tendencies of people via Twitter. 3
  • 4. 4 Suicide Suicide is when people direct violence at themselves with the intent to end their lives, and they die as a result of their actions. Suicide is a leading cause of death in the United States. According to the analysis carried out by the World Health Organization (WHO) in 2012 every 40 seconds a person died by suicide, The WHO estimates that suicide is the 13th leading cause of death in the world and the third one between youth aged 15-44
  • 5. 5 The Social Network: is a network structure composed of social entities (might be people, organisations or groups) as nodes. And connections in the form of edges, that represent the connection or relationship amongst these entities. Such connections might be friendships, or business connection etc. Social Network Figure 1: Social Network
  • 6. Data mining is the process of sorting through large datasets to identify patterns and establish relationships to solve problems through data analysis. Data mining Text Mining the process at which text is transferred into data that can be analysed. this incurs the procedures of creating an index for the individual terms, based on the location of the term within the original text, or based on other techniques or protocols. The words and indexes can then be used for a variety of analysis methods. Sentiment Analysis the process of determining the emotional tone behind a series of words, used to gain an understanding of the the attitudes, opinions and emotions expressed within an online mention. 6
  • 7. Figure 2 : Machine learning techniques 7
  • 8. Naive Bayes Classifiers The Naive Bayes algorithm is a widely used algorithm for document classification. Given a feature vector table, the algorithm computes the posterior probability that the document belongs to different classes and assigns it to the class with the highest posterior probability. There are two commonly used models ( multinomial model and multi-variate Bernoulli model) for using Naive Bayes approach for text categorization. 8
  • 9. Figure 3: trainining multinomial naive bayes Figure 4: test multinomial naive bayes 9
  • 10. Example multinomial naive Bayes classifier: Figure 5 Example multinomial naive Bayes classifier Decide: whether document d5 belonging to class c=China? 10
  • 11. Figure 6: resulut of the example multinomial naive Bayes classifier 11
  • 12. Advantages of Naïve Bayes Easy to implement Requires a small amount of training data to estimate the parameters Good results obtained in most of the cases 12
  • 13. Data Pre-processing: The goal behind preprocessing is to represent each document as a feature vector, to separate the text into individual words. Preprocessing text is called tokenization or text normalization. Figure 7: Data Pre-processing 13
  • 14. Proposed classification: Each sentence is first preprocessed and then passed into three categories of classifiers, each deciding whether the sentence belongs to the corresponding category or not. This category contains positive and regular sentences Contains negative sentences that do not contain suicidal phrases like "the taste of pizza today very bad". contains suicidal phrases that are classifed as emergency phrases in the previous search “Each day, each hour, each minute is just torture. I want it to end.” “Waking up every day wishing I hadn’t “ “I feel like the only way to no longer carry this pain is to die.” “I am weird and slow. Every social interaction is painfully awkward. “ “I want to finish my life.” “No one understands me in this life, I'm leaving.” 14 positive negative suicidal
  • 15. Proposal Solution for Zero problem(document): This happens in case the probability of each category is 0, that is when we pass a new text (tweet) entirely on our classification and we do not have any word from this text in the dataset first solution: is switching each word from the new tweet (sentence ) with synonyms, then we re-categorize the new tweet (sentence with the same meaning ) . second solution: the solution depends on the extraction of sentiment from the sentence using the textblob library. Sentiment(polarity,subjectivity). The polarity score is a float within the range [-1.0, 1.0]. The subjectivity is a float within the range [0.0, 1.0] if the number of “polarity” is greater than 0 Then positive else negative 15
  • 16. Implementation Of The System download tweets pre-process the tweets Classification Positive negative suicidal 16 Analysis of the results of Classification The user has suicidal tendencies The user has no suicidal tendencies
  • 17. Username of the person that we want to apply the calassification on it Username Twitter API 17 download tweets
  • 18. 18 Analysis of the results of Classification Calculate the percentage of positiveTweets = positive Calculate the percentage of negativeTweets = negative Calculate the percentage of suicidalTweets = suicidal elif suicidal > 20% and negative > 60% If suicidal > 70% The user has suicidal tendencies else The user has no suicidal tendencies
  • 19. 19
  • 20. 20 This work aims to create a program, which will be capable of detecting suicide in social networks, so the results obtained from this study or this program were acceptable. While we have identified some interesting and promising results, future work will be compare the results of this work with other classification. and also merging our classification with decision tree classification. CONCLUSION

Editor's Notes

  • #8: So we looked for the best technique to classify texts, and we found that the best technique is "yas" Which gives us good results, so we applied them and work to improve the result
  • #12: Priors and conditional probabilities of each word Likely houds
  • #16: But what if we don't find any word(synonym) that matches the text we have in the dataset?
  • #19: Of course we need to consult doctors in this proportion