Detecting Stress Based On Social Interactions in Social Networks
Detecting Stress Based On Social Interactions in Social Networks
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 1
Abstract—Psychological stress is threatening people’s health. It is non-trivial to detect stress timely for proactive care. With the
popularity of social media, people are used to sharing their daily activities and interacting with friends on social media platforms,
making it feasible to leverage online social network data for stress detection. In this paper, we find that users stress state is closely
related to that of his/her friends in social media, and we employ a large-scale dataset from real-world social platforms to systematically
study the correlation of users’ stress states and social interactions. We first define a set of stress-related textual, visual, and social
attributes from various aspects, and then propose a novel hybrid model - a factor graph model combined with Convolutional Neural
Network to leverage tweet content and social interaction information for stress detection. Experimental results show that the proposed
model can improve the detection performance by 6-9% in F1-score. By further analyzing the social interaction data, we also discover
several intriguing phenomena, i.e. the number of social structures of sparse connections (i.e. with no delta connections) of stressed
users is around 14% higher than that of non-stressed users, indicating that the social structure of stressed users’ friends tend to be
less connected and less complicated than that of non-stressed users.
Index Terms—Stress detection, factor graph model, micro-blog, social media, healthcare, social interaction.
1.1 Motivation
# #"*%!$#HH.YCXG- # #"*%!$#=aWJ.A>5F7b{j
#%&# %!&$* (!#$ KL KL
Psychological stress is becoming a threat to people’s $# #"*%!# m]_phl:P.XPRd`96+
Don’t worry, you will get better with more efforts!
health nowadays. With the rapid pace of life, more and $ $%' , !%!$*!&%#&$* (!# KL Feeling stupid, I had tried my best but couldn’t do well. KL
more people are feeling stressed. According to a worldwide #&"#(% !#*!&%! KL Scores are not everything! KL
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 2
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 3
The contributions of this paper are as following. states from social media by learning user-level presentation
via a deep convolution network on sequential tweet series
• We propose a unified hybrid model integrating CNN
in a certain time period. Motivated by the principle of
with FGM to leverage both tweet content attributes
homophily, [38] incorporated social relationships to improve
and social interactions to enhance stress detection.
user-level sentiment analysis in Twitter. Though some user-
• We build several stressed-twitter-posting datasets by
level emotion detection studies have been done, the role that
different ground-truth labeling methods from several
social relationships plays in one’s psychological stress states, and
popular social media platforms and thoroughly eval-
how we can incorporate such information into stress detection
uate our proposed method on multiple aspects.
have not been examined yet.
• We carry out in-depth studies on a real-world large-
Research on leveraging social interactions for social
scale dataset and gain insights on correlations be-
media analysis. Social interaction is one of the most im-
tween social interactions and stress, as well as social
portant features of social media platforms. Now many
structures of stressed users.
researchers are focusing on leveraging social interaction
The rest of this paper is organized as follows. Section information to help improve the effectiveness of social me-
2 gives an overview of related works. Section 3 presents dia analysis. [12] analyzed the relationships between social
our problem statement. Then in Section 4, we introduce interactions and users’ thinking and behaviors, and found
the definitions of the proposed attributes. Section 5 presents out that Twitter-based interaction can trigger effectual cog-
the hybrid model and training method for stress detection. nitions. [49] leveraged comments on Flickr to help predict
Experimental results are shown in Section 6. Then in Section emotions expressed by images posted on Flickr. However,
7, we present several in-depth studies on our dataset for these work mainly focused on the content of social inter-
further insights. Finally, we make some conclusions and actions, e.g., textual comment content, while ignoring the
discuss in Section 8. inherent structural information like how users are connected.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 4
TABLE 1
Summary of tweet-level attributes. The column “#“ indicates the feature vector length for each type of feature
Definition 4. Time-varying attribute-augmented network. degree adverbs. We have also tested other linguistic re-
An attribute-augmented network at time t is comprised of sources including NRC5 and HowNet6 , and found that the
four elements, including 1) a user set V t , 2) an edge set E t , performances were relatively the same, so we adopted the
3) a user-level attribute matrix set X t , and 4) a stress state set commonly used LIWC2007 dictionary for experiments. Fur-
for all users Y t at time t, denoted as Gt = (V t , E t , X t , Y t ). thermore, we extract linguistic attributes of emoticons (e.g.,
and ) and punctuation marks (’!’, ’?’, ’...’, ’.’). Weibo
Given a sequence of labeled time-varying attribute-
defines every emoticon in square brackets (e.g., they use
augmented networks at different times, our goal is to learn a
[haha] for “laugh”), so we can map the keyword in square
model that can best fit the relationships among users’ stress
brackets to find the emoticons. Twitter adopts Unicode
states, user-level attributes, and users’ social linkage, and
as the representation for all emojis [15], [24], which can
then detect users’ unknown stress states with the model.
be extracted directly. The list of linguistic attributes and
Problem 1. Psychological stress detection: Given a series descriptions are shown in Table 1.
of T partially labeled time-varying attribute-augmented net- As for the visual attributes, we use API from OpenCV 7
works {Gt = (VLt , VUt , E t , YLt ) | t ∈ {1, 2, · · · , T }}, VLt is a to perform picture processing and color-related attributes
set of users with labeled stress states YLt at time t, and VUt is computation, e.g., saturation, brightness, warm/cool color,
a set of unlabeled users, the objective is to learn a function clear/dull color in Table 1. For a special class of attributes
named five-color theme, we adopt algorithm from papers
f : {G1 , G2 , . . . GT } → {YU1 , YU2 , . . . YUT } on affective image classification [32] and color psychology
theories [23], [45]. In this work, we did not adopt the
to predict unlabeled users’ stress states.
direct emotional detection results as visual features because
we need multi-dimensional visual features for deep model
learning, while a direct visual emotional classification result
4 ATTRIBUTES C ATEGORIZATION AND D EFINITION only gives a single or very few dimensions as features.
To address the problem of stress detection, we first define However, with the development of emotion-sensitive visual
two sets of attributes to measure the differences of the representation techniques, it would be possibility to adopt
stressed and non-stressed users on social media platforms: automatic visual features in the future. The details of tweet-
1) tweet-level attributes from a user’s single tweet; 2) user- level attributes are summarized in Table 1.
level attributes summarized from a user’s weekly tweets.
4.2 User-Level Attributes
4.1 Tweet-level Attributes Compared to tweet-level attributes extracted from a sin-
Tweet-level attributes describe the linguistic and visual gle tweet, user-level attributes are extracted from a list of
content, as well as social attention factors (being liked, user’s tweets in a specific sampling period. We use one
commented, and retweeted) of a single tweet. week as the sampling period in this paper. On one hand,
For linguistic attributes, we take the most commonly psychological stress often results from cumulative events or
used linguistic features in sentiment analysis research. mental states. On the other hand, users may express their
Specifically, we first adopt LTP [4] — A Chinese Language chronic stress in a series of tweets rather than one. Besides,
Technology Platform — to perform lexical analysis, e.g., the aforementioned social interaction patterns of users in
tokenize and lemmatize, and then explore the use of a a period of time also contain useful information for stress
Chinese LIWC dictionary — LIWC2007 [14], to map the detection. Moreover, as aforementioned, the information in
words into positive/negative emotions. LIWC2007 is a dic- 5
http://www.saifmohammad.com/WebPages/NRC-Emotion-
tionary which categorizes words based on their linguistic Lexicon.htm
or psychological meanings, so we can classify words into 6
http://www.keenage.com
different categories, e.g. positive/negative emotion words, 7
http://opencv.org
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 5
tweets is limited and sparse, we need to integrate more The user-level attributes of a user at time t are denoted
complementary information around tweets, e.g., users’ so- by xti (i=1,2,· · · ) in Figure 3. The route of the other users’
cial interactions with friends. attributes in Figure 3 are similar, which finally form their
Thus, appropriately designed user-level attributes can user-level attributes. We focus on the attribute flow of the
provide a macro-scope of a user’s stress states, and avoid user with red star and omit the detailed route of other users’
noise or missing data. Here, we define user-level attributes attributes in the figure. The stress state of each user at time
from two aspects to measure the differences between t is denoted by yit (i=1,2,· · · ), respectively. The user-level
stressed and non-stressed states based on users’ weekly attributes and the stress states are connected by an attribute
tweet postings: 1) user-level posting behavior attributes [29] factor, while stress states of different users are connected by
from the user’s weekly tweet postings; and 2) user-level social factors. Stress states of the same user at adjacent times
social interaction attributes from the user’s social interactions are connected by dynamic factors. We define the graph as a
beneath his/her weekly tweet postings. The details of user- (PFG). By calculating the factors, we can finally derive all
level attributes are summarized in Table 2. users’ stress states over different weeks.
In the following, we will describe the details of the CNN
5 M ODEL F RAMEWORK with CAE and PFG used in the architecture that tackles
the tweet series with cropped modalities and leverages the
Two challenges exist in psychological stress detection. 1) social interaction information between users, respectively.
How to extract user-level attributes from user’s tweeting series
and deal with the problem of absence of modality in the tweets? 5.2 Learning Aggregated Attributes From Tweet Series
2) How to fully leverage social interaction, including interaction
To aggregate user-level attributes, we need to face two major
content and structure patterns, for stress detection? To tackle
challenges: (1) Missing modality, e.g., tweets with only text
these challenges, we propose a novel hybrid model by
but no picture AND (2) How to generate a distributed and
combining a factor graph model with a convolutional neural
modality-invariant representation for each tweets.
network (CNN), since CNN is capable of learning unified
To solve above challenges in cross-media tweet data, we
latent features from multiple modalities, and factor graph
use a cross auto-encoder (CAE) [28] to learn the modality-
model is good at modeling the correlations. In this section,
invariant representation of each single tweet with different
we will first introduce the architecture of our model, and
modalities. Denoting the text, visual, and social attributes of
then describe the details of each part of the proposed model.
a tweet by vT , vI , and vS , the CAE is formulated as follows:
5.1 Architecture u = f (wT vT + wI vI + wS vS + b)
(1)
Figure 3 shows the architecture of our model. There are three +
(vT , vI , vS ) = f (wu b)
types of information that we can use as the initial inputs, i.e., where u is the modality-invariant representation. wT , wI ,
tweet-level attributes, user-level posting behavior attributes, wS , and b are parameters in the encoder, whereas w T ,
and user-level social interaction attributes, whose detailed wI , wS , and b are parameters in the decoder. f (·) is the
computation will be described later. We address the solution activation function. We use a sigmoid activation function
through the following two key components: f (z) = 1+exp(−z) 1
in our model. vT , vI , vS are the recon-
• First, we design a CNN with cross autoencoders structed input modalities.
(CAE) to generate user-level interaction content at- The basic idea of CAE is to force the model to reconstruct
tributes from tweet-level attributes. The CNN has missing modalities in the training stage and to learn cross
been found to be effective in learning stationary modalities correlation from the data (e.g. negative words
local attributes for series like images [3], [6] and in text correlate with cool color in pictures). [18] While
audios [30], [48]. training the cross auto-encoder, we use training data that
• Then, we design a partially-labeled factor graph contains all the three modalities. We manually disable the
(PFG) to incorporate all three aspects of user-level visual modalities and/or social interaction 8 modality of
attributes for user stress detection. Factor graph the training data, and require it to reconstruct all three
model has been widely used in social network mod- modalities. We train the CAE with a cropped set of data
eling [10], [39], [44]. It is effective in leveraging social v T , v I , v S that inputs from one or two modalities are absent,
correlations for different prediction tasks. while requiring it to reconstruct all the three.
We use the stochastic gradient descent to train the CAE.
Take the user labeled with a red star in Figure 3 as an Denoting all the parameters in the CAE as θ, the energy
example. We extract attributes from each tweet of the user function is defined as follows.
to form tweet-level attributes as shown in the cylinders. ⎛ ⎞
Different colors represent different modalities and blank 1⎝ 2
J (v T , v T , v S ; θ) = vM − vM ⎠
(white color) represents modalities that are not available in 2 M ∈T,I,S
the tweet. The tweet-level attributes in the cylinder are fed ⎛ ⎞ (2)
to cross autoencoders (CAEs) [28]. The CAEs are embedded λ⎝ 2 2⎠
in a CNN [26], [29] that will integrate attributes from CAEs + wM + w M .
2 m∈T,I,S
into the aggregated user-level content attributes by pooling
each attribute map. The user-level content attributes, user- 8
Different from the social interaction attributes in this paper, the
level posting behavior attributes, and user-level social in- social interaction here is the attribute of a single tweet defined in [28].
teraction attributes together form the user-level attributes. It is simply the mean and variance of interaction numbers of a tweet.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 6
TABLE 2
Summary of user-level attributes. The column “#“ indicates the feature vector length for each type of feature
y1t+1
h yit , yit+1
xt1 y1t ?
f xt1 , y1t
g y1t , y2t
g y1t , y3t
xt2 y2t
f xt2 , y2t
xt3
y3t
f xt3 , y3t
h yit , yit−1
y1t−1
Fig. 3. Architecture of our model. The model consists of two parts. The first part is a CNN. The second part is a FGM. The CNN will generate
user-level content attributes by convolution with CAE filters as input to the FGM. Take the user labeled with a red star as example. Tweet-level
attributes of the user are processed through a convolution with CAE to form the user-level content attributes. The user-level attributes are denoted
by xti in the left box. Every xti contains three aspects: user-level content attributes, user-level posting behavior attributes, and user-level social
interaction attributes. Data of other users follows the same route. In the FGM, attribute factors connect user-level attributes to corresponding stress
states. Social factors connect the stress state of different users. Dynamic factors connect stress state of a user over time. The output of the user’s
user-level stress state at time t is y1t as highlighted in red, which actually denotes the stress state of the user in weekly period in this paper.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 7
The first term measures the reconstruction accuracy. The Input: a series of time-varying attribute augmented
second term is the weight decay regularization term that network G with stress states on some of the user
prevents parameters in the model from diverging arbitrarily. nodes, learning rate η ;
λ is the regularization weight. Using data with different Output: parameter value θ = {α, {βc }, γ} and full stress
state vector Y ;
modalities as input, the CAE can be trained and learn a
modality-invariant representation u. Randomly initialize Y ;
The attributes of tweets, which come from a user’s Initialize model parameters θ;
repeat
weekly tweets in timeline, form a time series. To model a
Compute gradient ∇α, ∇βc , ∇γ ;
user as a subject of series of tweets, we apply CNN [26] Update α ← α + η × ∇α;
which has large learning capacity, but has much fewer Update βc ← βc + η × ∇βc ;
connections and parameters to learn than similar-size stan- Update γ ← γ + η × ∇γ ;
dard network layers. It focuses on learning stationary local until convergence;
attributes from series like images (pixel series), audio, and Algorithm 1: Learning and inference by factor graph
other time series. We can learn user-level content attributes
from a series of individual tweets in a time series to describe
a user’s stress state over a week. All attributes of tweets in
a time series form a one-Dimensional series. We use an 1- The joint probability has three types of factor functions,
Dimension CNN in our model. corresponding to the intuitions we have discussed.
CAE units are listed in the attribute maps of the CNN. Attribute factor. We use this factor f (xti , yit ) to represent
They connect to a patch of instance. CAE units take patches the correlation between user vi ’s stress state at time t and
with missing modalities and generate modality-invariant her/his attributes xti . More specifically, we instantiate the
attribute maps. The CAE units are used as filters in the 1-D factor by an exponential-linear function:
CNN and convolute over the sequence of tweets to form one 1
feature map. Thus the latent user-level content attributes can f (xti , yit ) = exp αT xti (4)
Zα
be generated from the tweet-level attributes of single tweets. where α is a parameter of the proposed model, and Zα is
Pooling is another important step to summarize attribute a normalization term.
maps into fewer attribute instances. Though different users
have different number of tweets in different weeks, the Dynamic factor. We use this factor f (yit , yit+1 ) to represent
period of time over which the tweets are sampled are the the time correlation between user vi ’s stress state at time t
and t + 1. More specifically, we instantiate the factor by an
same. We simply pool each attribute map into one pooled exponential-linear function:
attribute. There are two commonly used pooling operations:
1
max-pooling and mean-pooling. When max pooling is used, h(yit , yit+1 ) = exp γ T h (yit , yit+1 ) (5)
Zγ
the pooled attribute unit is assigned with the maximal
activation among all units in the attribute map. When mean- where γ is the model parameters for this type of factor,
pooling is applied, the mean of activations of all units in the h (·) is defined as a vector of indicator functions, and Zγ is
attribute map is assigned to the pooled attribute unit. Since the normalization term.
we pool over the period of time rather than a certain number Social factor. We use social factor g(ye ) (where e =
of tweets, we use mean-over-time (MOT) in this paper, (vit , vjt , c) ∈ E t ) to represent the correlation between user
which can be calculated by summing up the activations, vi and vj ’s stress states according to c at time t:
since the tweet instances are sampled in the same length 1
of time intervals. g(ye ) = exp βc T g (yit , yjt ) (6)
Z βc
5.3 Learning Latent Correlations Between Tweet’s Con- where βc is the model parameters for this type of factor,
g (·) is defined as a vector of indicator functions, and Zβc is
tent And Social Interactions
the normalization term.
As the social correlation between users and time-dependent Finally, by combining Eq.4, 5, 6 into Eq.3, the objective
correlation are hard to be modeled using classic classi- function as the log-likelihood of the proposed model is:
fiers such as SVM, we use a partially-labeled factor graph
T
N
T
N
model (PFG), which was first proposed in [39], to incorpo- O= αT xti + γ T h (yit , yit+1 )
rate social interactions and tweets’ content for learning and t=1 i=1 t=1 i=1
(7)
detecting user-level stress states.
T
We define an objective function by maximizing the con- + βcT g (yit , yjt ) − log Z
t=1 e∈E t
ditional probability of users’ stress states Y given a series of
attribute-augmented networks where Z = Zα c∈C Zβc Zγ is the global normalization
term.
G = {Gt = {(V t , E t , X t , Y t )}}, t ∈ {1, . . . , T }
Learning. Learning the predictive model is to estimate a pa-
and V = V 1 = · · · = V T , |V | = N , i.e., Pθ (Y |G). The
factor graph [25] provides a way to factorize the “global” rameters configuration θ = (α, {βc }, γ) from the partially-
probability as a product of “local” factor functions, which labeled dataset and to maximize the log-likelihood objective
makes the maximization simple, i.e., function Eq. 7, i.e., θ∗ = arg maxθ O(θ).
T
N For optimization, we adopt a gradient decent method.
P (Y |G) = f (xti , yit )h(yit , yit+1 ) g(ye ) (3) Specifically, we derive the gradients with respect to each
t=1 i=1 e∈E t
parameter in our objective function of Eq. 7.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 8
T N T N
∂O t t
t t
=E f (xi , yi ) − EPα (Y |G) f (xi , yi ) TABLE 3
∂α t=1 i=1 t=1 i=1
Overview of the Weibo-Stress dataset
⎡ ⎤ ⎡ ⎤
∂O T T
non-stressed stressed total
= E⎣ g(ye )⎦ − EPβ (Y |G) ⎣ g(ye )⎦
∂β #tweets 253, 638 239, 038 492, 676
t=1 e∈E t t=1 e∈E t
T N T N #users 12, 230 11, 074 23, 304
∂O #weeks 17, 861 19, 136 36, 997
=E h(yit , yit+1 ) − EPγ (Y |G) h(yit , yit+1 ) #tweets/week* 14.2 12.5 13.3
∂γ t=1 i=1 t=1 i=1 #weeks/user* 1.46 1.73 1.59
(8) #interacting users/week* 5.79 6.99 6.35
N * means average number.
T t t
where in the first equation, E t=1 i=1 f (x ,
i iy ) is
the expectation of the summation of the attribute factor (retweet, comment, favorite), video and etc. Despite these
functions given the data distribution
over Y and G in the user generated contents, user relationship, which takes the
T
N t t
training set, and EPα (Y |G) t=1 i=1 f (x ,
i iy ) is the ex- form of following on Sina Weibo, also contains abundant
pectation of the summation of the attribute factor functions information for data analysis. Utilizing above information
given by the estimated model. The other expectation terms and features extracted from multiple modalities, we are able
have similar meanings in the other equation. to investigate users emotions, stresses and opinions.
As the network structure in the real world may contain We then tried to identify the weekly stressed state of
cycles, it is intractable to estimate the marginal probability users. Facing the vast scale of social images, manually
in the second terms of 8. In this work, we adopt Loopy Belief labeling is powerless. Instead, we use tags and comments
Propagation (LBP) [33] to calculate the marginal probability for automatic image labeling, which is a common method
of P (Y ) and compute the expectation terms. The learning in previous work. [20], [21], [46] This is done by searching
process can then be described as an iterative algorithm. Each
iteration contains two steps. Firstly, we call LBP to calculate for tweets containing patterns like “I feel stressed this week”
marginal distributions of unknown variables Pα (Y |G). Sec- and “I feel stressed so much this week”, which are used to
ondly, we update α, β , γ with the learning rate η by Eq.9 The indicate that the users are stressed. The weeks containing
learning algorithm terminates when it reaches convergence. such sentence patterns are labeled as “stressed” weeks.
∂O Similarly, we identify “non-stressed” weeks of users by
θnew = θold + η (9) searching for tweets with patterns like “I feel relaxed” and “I
∂θ
feel non-stressed”. These sentence patterns have been shown
Detection. With the estimated parameter θ, we can now
to have high precision against user-assigned psychological
assign the value of unknown labels Y by looking for a label state labels validated by online surveys in weibo [29].
configuration that will maximize the objective function, i.e., In this way, we collected over 19,000 weeks of tweets
Y ∗ = arg max O(Y |G, θ) (10) that are labeled as stressed, and over 17,000 weeks of non-
stressed users’ tweets. There are 492,676 tweets from 23,304
In this paper, we use a max-sum algorithm [31] to solve users in total. We use this dataset for experiments, analysis
this problem. and further in-depth studies, which is represented by DB1
in this paper. Details of the dataset are shown in Table 3.
6 E XPERIMENTS Dataset DB2: We verified the reliability of the above
In this section, we will present the effectiveness and effi- ground truth labeling method through dataset DB2 in Ta-
ciency of our hybrid model on user-level stress detection. ble 4. It is a small dataset collected from the users who
have shared the score of a psychological stress scale PSTR10
designed by psychologists via Weibo. Guided by the rules
6.1 Dataset Collection of the PSTR scale, a user is taken as stressed when the score
To conduct observations and evaluate our succecive model, is larger than 80, otherwise non-stressed. We thus crawled
we first collect a set of datasets using different labeling the scores posted by users, and used the scores as ground
methods, which are listed as following: truth label for the set of tweets in +-3-day window.
Dataset DB1: It is a challenge to construct a dataset with Dataset DB3 and DB4: To further test our method, we
reliable ground truth labels from large-scale noisy social collected two more datasets from Tencent Weibo (DB3) and
media data. The data crawled from social platforms is usu- Twitter (DB4). They are again labeled using the sentence
ally massive, thus manual labeling methods are not feasible pattern labeling method as described above for DB1. In
due to the uncontrollable cost and quality. To solve this particular, as social platforms of different languages, Weibo
problem, we employed a sentence pattern labeling method and Twitter have many differences. [51]. For example, their
to automatically extract labeled data from the crawled large- top topics differs very much. Thus, experiments on Twitter
scale social media data. We first crawled 350 million tweets can validate the universality of our method. The details of
data via Sina Weibo’s REST APIs9 from Oct. 2009 to Oct. the two datasets are presented in Table 4.
2012. Sina weibo, as the biggest microblog website in China,
provides users an open online platform for information
6.2 Experimental Setup
sharing, communication and obtaining. Similar to Twitter
and Facebook, users on Sina Weibo can post contents with In the following experiments, we first train and test our
multiple modalities, including text, image, social action model on the large-scale Sina Weibo dataset DB1. We then
9 10
http://open.weibo.com http://types.yuzeli.com/survey/pstr50
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 9
TABLE 4
Details of other Datasets
Platform Stress label Number of tweets Number of users Number of weeks Tweets per week
stressed 1, 459 98 98 14.9
DB2:Sina Weibo
non-stressed 1, 845 112 112 16.5
(2010.2-2011.9)
summary 3, 304 210 210 15.7
stressed 138, 570 7, 845 8, 974 15.4
DB3:Tencent Weibo
non-stressed 172, 585 8, 239 9, 976 17.3
(2011.11-2013.3)
summary 311, 155 16, 084 18, 950 16.4
stressed 54, 748 4, 905 6, 081 9.0
DB4:Twitter
non-stressed 75, 357 4, 018 6, 545 11.5
(2009.6-2009.12)
summary 130, 105 8, 923 12, 626 10.3
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 10
95 100 100 95
FGM Accuracy(%)
FGM-S 95
90 F1-score(%)
90 FGM-D 80 90
FGM-S-D
F1-score(%)
F1-score(%)
F1-score(%)
90 70
85 85
85 60
80 50 80
80
40
75 750% 300 75
10% 20% 30% 40% 50% 60% 70% 80% 500 1000 1500 2000 2500 3000 3500 4000 DNN-1 DNN-2 DNN-3 DNN-4
(a) Factor Contribution (b) Data Scale Analysis (c) Convergence Analysis (d) DNN Layer Analysis
Fig. 4. Experiment results analysis from various perspectives. (a) Attribute contribution analysis; (b) Factor contribution analysis; (c) Results of
detection performance with different training data scales; (d) Convergence Analysis of FGM.
100
95
SVM FGM CNN-FGM in setting up DNN model. Shallow networks result in trivial
90
model that cannot catch any underlying correlation in data,
85 whereas too deep networks lead to over-complex model
F1-score(%)
80
which is difficult to tune and may suffer from problems
75
like over-fitting. To choose an appropriate DNN model
70
65
for classification, we test DNN with different number of
60 layers. Figure 4(d) summarizes the experiment results. It is
55
B UIC UIS l clear that 2-layer is not sufficient for the model to achieve
T
UP PB UIC UIS UIC UIS UIS UIS UIC UIS UIS Al
T+U T+ T+ UPB+ UPB+ UIC+ UIC+ UPB+ UPB+ UIC+ a satisfactory result. 3-layer model improve significantly
B+ + T+ T+
UP T while 4-layer model reaches the peak. 5-layer model does
not get better result. This is mainly because at 5-layer the
Fig. 5. Experiment results analysis of different attribute combinations network may be too large that it cannot be tuned well with
on different models, with T, UPB, UIC, and UIS representing Tweet-
level attributes, User-level Posting Behavior attributes, User-level social the available data and within a feasible training time.
Interaction Content attributes and User-level social Interaction Structure
attributes respectively. For example, ‘UIC+UIS’ here means a combi- Attribute Contribution Analysis. As described in Section 4,
nation of User-level social Interaction Content attributes and User-level we have defined several set of tweet-level and user-level
social Interaction Structure attributes. attributes from a single tweet’s content as well as users’
ever, integrating attribute factor with social or dynamic posting behaviors and social interactions in a weekly pe-
factor both improve the performance, revealing that both of riod. To evaluate the contribution of different attributes
the two factors are effective for stress detection. Specifically, and compare the effectiveness of our model of leveraging
incorporating social factor significantly improves the detec- different attributes, we compared the proposed model with
tion performance to around 91% on accuracy, indicating that other existing models by using different combinations of
the social factor is extremely effective. The best detection attributes as input. As described in Section 4, the proposed
performance is observed when using all three types of attributes are categorized into four groups: tweet-level at-
factors. tributes, user-level posting behavior attributes, user-level
social interaction content attributes and user-level social
Training Data Scale Analysis. To evaluate the data scala- interaction structure attributes, denoted as T, UPB, UIC, and
bility of the proposed model, we try to train the model with UIS respectively. We compare the detection performance of
different scale of training data, and compare the final detec- the proposed CNN+FGM model with SVM and CNN with
tion performance in F1-score. In this test, we use all the three traditional autoencoder, with all the possible combinations
attributes as input. Figure 4(b) shows the trend of detection of these four set of attributes. For the SVM with the tweet-
performance with different proportions of training data. It level attributes, we simply take the average of the feature
is clear that when using only 1% of all training data, our vectors from a user’s weekly tweets.
model fails to achieve meaningful detection performance. The results of this experiment are shown in Figure 5. We
When adopting approximately 30% of all training data, our see that all the models achieve the best detection perfor-
model can obtain an equally competitive performance of mance when utilizing all the three set of attributes. When
around 93% compared with that when using 50% of training using only the tweet-level features, the detection perfor-
data. Moreover, the performance keeps increasing given mance of the proposed model and the DNN model drops
more training data. These results verify the scalability of to around 86% and 82% respectively in F1-score, which is
our model on large-scale real-world social media datasets. acceptable. While for SVM, the detection performance drops
to around only 70%, which is poor for a binary classification.
Convergence Analysis. We further investigate the conver-
This result demonstrates the effectiveness of the feature
gence of the learning algorithm for FGM, and Figure 4(c)
aggregation of CNN, which is much better than simply
presents the F1-score with increasing number of iterations.
summarizing the feature vectors manually.
We see that the algorithm converges within around 2000
iterations, which is rapid enough for us to conduct efficient Figure 5 also shows the effectiveness of different at-
model training on large scale datasets in practice. tributes. We can see that by using only user-level attributes,
the detection performance of all the models drops dras-
Impact of size of network. Size of network is a critical issue tically compared to that using only tweet-level attributes,
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 11
TABLE 6 1.0
Comparison of results using different modalities. streeed
non-stressed
Text Text + visual Text + Social All 0.8
Accuracy 0.8713 0.8761 0.8628 0.9155
F1-score 0.8794 0.8865 0.8711 0.9340 0.6
Proportion
which shows the importance of the tweet-level attributes.
0.4
By combing different types of user-level attributes, the
detection performance improves by around 3-8% in F1- 0.2
score, showing that the user-level attributes are supple-
mentary to each other. Meanwhile, by combining the user- 0.0
liy nd mo ect ure sad death anx anger gemo
fami frie pose aff leis
level attributes with tweet-level attributes, the detection
ne
performance improves up to 10-20% in F1-score. This result
Fig. 6. Distribution of stress states (stressed and non-stressed) over
indicates that the user-level attributes are great supplements different word categories from tweets’ comments and retweets. Here we
to tweet-level attributes. show 10 most widely used word categories in our data set.
When using only two set of attributes, the detection per-
formance drops to around 91% in F1-score. In case of using DB4 from Twitter. We also test against the Twitter
sole attributes, we see that using solely user-level social dataset. We still use the attribute extractor trained with large
interaction attributes gets the best detection performance of scale Sina Weibo dataset and only finetune the network with
around 90% in F1-score, as compared to the other attributes. Twitter dataset in 5-fold. The accuracy is 77.43% and F1-
This reveals that the proposed user-level social interaction score is 0.8224. One reason for this modest result is that users
attributes are quite effective for stress detection. in Twitter dataset and Sina Weibo dataset come from differ-
ent language and culture background, so that the language
Impact of different modalities in content attributes: patterns and sentimental signals from these two different
Tweets content come with multiple modalities. To eval- language environments can be different, thus the attribute
uate the contribution of each data modality, we conduct extractor trained with large scale Sina Weibo dataset may
experiments with different combination of attributes. Since not be fully functional for Twitter datasets. Nevertheless,
text is the necessary part of a tweet, we test using solely we still achieved acceptable performance in Twitter dataset,
text attributes, and the two combinations of text and vi- which implies that the basic stress patterns between social
sual attributes, and text and social attributes, as well as relations can be transferred in between different language
using all attributes. The results are shown in Table 6. It environments. Another factor could be that the scale of this
is interesting to note that using only text attribute could dataset is rather small. Subjects in the Twitter dataset are on
achieve rather high performance. Simply combining visual the order of 10% than that in large-scale Sina Weibo dataset.
or social attributes with text attributes may even reduce the We look into the collected data and find that, by coincidence,
performance, especially the social attributes. This trend is all tweets in this dataset have no social activity. We conjec-
even more obvious when both types of attributes (content ture this is also one of the causes of the unsatisfactory result.
and posting behavior) are used. Nevertheless, using all
attributes together outperforms that using only the text
attributes; and the highest performance is observed when 7 S TUDIES OF S OCIAL I NTERACTION
using all attribute and working with both types of attributes.
We have presented the experimental results on stress detec-
6.4 Results on Other Datasets tion in the previous section, while in the setting of social
We further evaluate our model on other datasets, DB2-DB4, networks, it would be helpful to further analyze how a
as shown in Table 4, to show that our model is universally user’s stress status is developed and how they affect each
applicable. For these experiments, we use all the proposed other. To do so, we try to conduct several studies on DB1 to
attributes with MOT pooling, and a 4-layer DNN model. offer insights on how social interactions contribute to user
stress and the task of stress detection from the following
DB2 from Sina Weibo with PSTR label.
aspects:
We use a matured model trained with large scale Sina
(1) Content. How are users’ social interaction contents (e.g.,
Weibo dataset, and then test it against another set of subject
language used) related to users’ stress states?
independently sampled from Sina Weibo. For the test set,
we collect weekly tweets from the users that have shared (2) Structure. Compared to non-stressed users, do stressed
the score of a psychological stress scale with 50 items via users show different structural diversity patterns when they
Sina Weibo. Detection result shows that the test accuracy is behave in social networks? Do differences of social influence and
84.26% and F1-score is 0.8785, which demonstrates that the strong/weak ties exist between stressed and non-stressed users?
overall model is consistent and the sentence pattern based
ground truth labeling method is reliable. 7.1 Content
DB3 from Tencent Weibo. We test on data collected from Content of social interaction refers to the content of tweets’
another major Chinese social media platform. For this test, comments and retweets, including text, emoticons, and
we use the attribute extractor trained with large scale Sina punctuation marks. Based on a widely used psychologi-
Weibo dataset and only finetune the network with Twitter cal dictionary LIWC2007 [40], we extract emotional words
dataset in 5-fold. The accuracy is 86.18% and F1-score is from the interaction content of tweets, and categorize
0.8832 which demonstrate the capability of the model. the extracted words into corresponding groups defined in
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 12
or followed) of users that interact with each other via com- 0.08
0.20
Strong Tie-5
Strong Tie-3
ments and retweets also form a kind of social structure. For 0.07
0.15
Weak Tie
Probability
Probability
this in-depth study, we select top four users with the most 0.06
x9
x4
frequent interactions from users’ weekly tweet postings, 0.05 0.10
x6
x2
0.05
0.03
#Stressed Neighbors
5
0.00
1 2 3 4
#Stressed Neighbors
5
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 13
analysis [17], so as to capture the most recent user rela- [8] Glen Coppersmith, Craig Harman, and Mark Dredze. Measuring
tionships in a shifting environment. Figure 8(b) illustrates post traumatic stress disorder in twitter. In Proceedings of the
International Conference on Weblogs and Social Media, pages 579–582,
the results. We can see that strong ties indeed have strong 2014.
influence on users’ stress states, and the influence of weak [9] Rui Fan, Jichang Zhao, Yan Chen, and Ke Xu. Anger is more
ties is relatively weak. For example, when a user has three influential than joy: Sentiment correlation in weibo. PLoS ONE,
stressed strong-tie connections, the probability that the user 2014.
[10] Zhanpeng Fang, Xinyu Zhou, Jie Tang, Wei Shao, A.C.M. Fong,
will become stressed increases to 13%, more than twice as Longjun Sun, Ying Ding, Ling Zhou, , and Jarder Luo. Modeling
high as for a user with three stressed weak-tie connections. paying behavior in game social networks. In In Proceedings of the
Twenty-Third Conference on Information and Knowledge Management
Summary. Based on the experimental results and analyses (CIKM’14), pages 411–420, 2014.
we know that: 1) users’ stress states are not only revealed [11] Golnoosh Farnadi, Geetha Sitaraman, Shanu Sushmita, Fabio
in their own tweets, but also affected by the contents of Celli, Michal Kosinski, David Stillwell, Sergio Davalos,
their social interactions, including commenting on and re- Marie Francine Moens, and Martine De Cock. Computational
personality recognition in social media. User Modeling and User-
tweeting others’ tweets; and 2) users’ stress states are re- Adapted Interaction, pages 1–34, 2016.
vealed by the structure of their social interactions, including [12] Eileen Fischer and A. Rebecca Reuber. Social interaction via new
structural diversity, social influence, and strong/weak ties. social media: (how) can interactions on twitter affect effectual
thinking and behavior? Journal of Business Venturing, 26(1):1–18,
These insights quantitatively prove the necessity and effec- 2011.
tiveness of combining social interactions for stress detection. [13] Jerome H. Friedman. Greedy function approximation: A gradient
boosting machine. Annals of Statistics, 29(5):1189–1232, 1999.
[14] Rui Gao, Bibo Hao, He Li, Yusong Gao, and Tingshao Zhu.
8 C ONCLUSION Developing simplified chinese psychological linguistic analysis
dictionary for microblog. pages 359–368, 2013.
In this paper, we presented a framework for detecting users’ [15] Johannes Gettinger and Sabine T. Koeszegi. More Than Words: The
psychological stress states from users’ weekly social media Effect of Emoticons in Electronic Negotiations.
data, leveraging tweets’ content as well as users’ social inter- [16] Jennifer Golbeck, Cristina Robles, Michon Edmondson, and Karen
Turner. Predicting personality from twitter. In Passat/socialcom
actions. Employing real-world social media data as the ba- 2011, Privacy, Security, Risk and Trust, pages 149–156, 2011.
sis, we studied the correlation between user’ psychological [17] Mark S. Granovetter. The strength of weak ties. American Journal
stress states and their social interaction behaviors. To fully of Sociology, 1973.
leverage both content and social interaction information of [18] Quan Guo, Jia Jia, Guangyao Shen, Lei Zhang, Lianhong Cai, and
Zhang Yi. Learning robust uniform features for cross-media social
users’ tweets, we proposed a hybrid model which combines data by using cross autoencoders. Knowledge Based System, 102:64–
the factor graph model (FGM) with a convolutional neural 75, 2016.
network (CNN). [19] David W. Hosmer, Stanley Lemeshow, and Rodney X. Sturdivant.
In this work, we also discovered several intriguing phe- Applied logistic regression. Wiley series in probability and mathemat-
ical statistics, 2013.
nomena of stress. We found that the number of social struc- [20] Sung Ju Hwang. Discriminative object categorization with exter-
tures of sparse connection (i.e. with no delta connections) nal semantic knowledge. 2013.
of stressed users is around 14% higher than that of non- [21] Sepandar D. Kamvar. We feel fine and searching the emotional
web. In In Proceedings of WSDM, pages 117–126, 2011.
stressed users, indicating that the social structure of stressed
[22] Herbert C. Kelman. Compliance, identification, and internal-
users’ friends tend to be less connected and less complicated ization: Three processes of attitude change. general information,
than that of non-stressed users. These phenomena could be 1(1):51–60, 1958.
useful references for future related studies. [23] Shigenobu Kobayashi. The aim and method of the color image
scale. Color research & application, 6(2):93–107, 1981.
[24] Novak P Kralj, J Smailovi, B Sluban, and I Mozeti. Sentiment of
emojis. Plos One, 10(12), 2015.
R EFERENCES [25] Frank R Kschischang, Brendan J Frey, and H-A Loeliger. Factor
[1] Andrey Bogomolov, Bruno Lepri, Michela Ferron, Fabio Pianesi, graphs and the sum-product algorithm. Information Theory, IEEE
and Alex Pentland. Daily stress recognition from mobile phone Transactions on, 47(2):498–519, 2001.
data, weather conditions and individual traits. In ACM Interna- [26] Yann LeCun and Yoshua Bengio. Convolutional networks for
tional Conference on Multimedia, pages 477–486, 2014. images, speech, and time series. The handbook of brain theory and
[2] Chris Buckley and Ellen M Voorhees. Retrieval evaluation with in- neural networks, 3361, 1995.
complete information. In Proceedings of the 27th annual international [27] Kathy Lee, Ankit Agrawal, and Alok Choudhary. Real-time
ACM SIGIR conference on Research and development in information disease surveillance using twitter data: demonstration on flu and
retrieval, pages 25–32, 2004. cancer. In Proceedings of ACM SIGKDD international conference on
[3] Xiaojun Chang, Yi Yang, Alexander G Hauptmann, Eric P Xing, Knowledge discovery and data mining, pages 1474–1477, 2013.
and Yao-Liang Yu. Semantic concept discovery for large-scale [28] H. Lin, J. Jia, Q. Guo, Y. Xue, J. Huang, L. Cai, and L. Feng. Psy-
zero-shot event detection. In Proceedings of International Joint chological stress detection from cross-media microblog data using
Conference on Artificial Intelligence, pages 2234–2240, 2015. deep sparse neural network. In proceedings of IEEE International
[4] Wanxiang Che, Zhenghua Li, and Ting Liu. Ltp: A chinese Conference on Multimedia & Expo, 2014.
language technology platform. In Proceedings of International Con- [29] H. Lin, J. Jia, Q. Guo, Y. Xue, Q. Li, J Huang, L. Cai, and L. Feng.
ference on Computational Linguistics, pages 13–16, 2010. User-level psychological stress detection from social media using
[5] Chih chung Chang and Chih-Jen Lin. Libsvm: a library for sup- deep neural network. In Proceedings of ACM Int. Conference on
port vector machines. ACM TRANSACTIONS ON INTELLIGENT Multimedia, 2014.
SYSTEMS AND TECHNOLOGY, 2(3):389–396, 2001. [30] Li Liu and Ling Shao. Learning discriminative representations
[6] Dan C Ciresan, Ueli Meier, Jonathan Masci, Luca Maria Gam- from rgb-d video data. In Proceedings of International Joint Confer-
bardella, and Jürgen Schmidhuber. Flexible, high performance ence on Artificial Intelligence, pages 1493–1500, 2013.
convolutional neural networks for image classification. In Proceed- [31] H-A Loeliger. An introduction to factor graphs. Signal Processing
ings of International Joint Conference on Artificial Intelligence, pages Magazine, IEEE, 21(1):28–41, 2004.
1237–1242, 2011. [32] Jana Machajdik and Allan Hanbury. Affective image classification
[7] Sheldon Cohen and Thomas A. W. Stress, social support, and the using features inspired by psychology and art theory. In Proceed-
buffering hypothesis. Psychological Bulletin, 98(2):310–357, 1985. ings of the international conference on Multimedia, pages 83–92, 2010.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2017.2686382, IEEE
Transactions on Knowledge and Data Engineering
JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 14
[33] Kevin P Murphy, Yair Weiss, and Michael I Jordan. Loopy belief Huijie Lin is currently a Ph.D. candidate of Tsinghua University.
propagation for approximate inference: An empirical study. In
Proceedings of the Fifteenth conference on Uncertainty in artificial Jia Jia is an associate professor in Department of Computer
intelligence, pages 467–475, 1999. Science and Technology, Tsinghua University. She got bachelor degree
[34] Cristian Danescu niculescu mizil, Lillian Lee, Bo Pang, and Jon at Tsinghua University in 2003, and received her Ph.D. degree from
Kleinberg. Echoes of power: Language effects and power differ- Tsinghua University in 2008. Her main research interest is social
ences in social interaction. eprint arXiv:1112.3670, 2011. affective computing and human computer speech interaction. She is
[35] Liqiang Nie, Yi-Liang Zhao, Mohammad Akbari, Jialie Shen, and serving as the secretary-general of Professional Committee of Speech
Tat-Seng Chua. Bridging the vocabulary gap between health seek- in Chinese Information Processing Society, and also a committee
ers and healthcare knowledge. Knowledge and Data Engineering, member of Multimedia Federation in China Society of Image and
IEEE Transactions on, 27(2):396–409, 2015. Graphic.
[36] Federico Alberto Pozzi, Daniele Maccagnola, Elisabetta Fersini,
and Enza Messina. Enhance user-level sentiment analysis on Jiezhong Qiu is currently an Ph.D student of Tsinghua University.
microblogs with approval relations. In AI* IA 2013: Advances in
Artificial Intelligence, pages 133–144. 2013. Yongfeng Zhang is a Postdoc Research Associate in University
[37] Neumann R and Strack F. ”mood contagion”: the automatic of Massachusetts Amherst, US.
transfer of mood between persons. Journal of Personality and Social
Psychology, pages 211–223, 2000. Lexing Xie received the B.S. degree from Tsinghua University,
Beijing, China, in 2000, and the M.S. and Ph.D. degrees from Columbia
[38] Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou,
University, in 2002 and 2005, respectively, all in electrical engineering.
and Ping Li. User-level sentiment analysis incorporating social
She is a Lecturer in the Research School of Computer Science at
networks. In Proceedings of the SIGKDD international conference on
the Australian National University; she was with the IBM T.J. Watson
Knowledge discovery and data mining, pages 1397–1405, 2011.
Research Center, Hawthorne, NY from 2005 to 2010. Her recent
[39] Wenbin Tang, Honglei Zhuang, and Jie Tang. Learning to infer research interests are in multimedia mining, machine learning and
social ties in large networks. In Machine Learning and Knowledge social media analysis. Dr. Xie has won several awards: the best
Discovery in Databases, pages 381–397. 2011. conference paper award in IEEE SOLI 2011, the best student paper
[40] Y. R. Tausczik and J. W. Pennebaker. The psychological meaning awards at JCDL 2007, ICIP 2004, ACM Multimedia 2005 and ACM
of words: Liwc and computerized text analysis methods. Journal Multimedia 2002.
of Language and Social Psychology, 29(1):24–54, 2010.
[41] Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Di Cai, and Jie Tang is an associate professor at the Department of Computer
Arvid Kappas. Sentiment strength detection in short informal text. Science and Technology, Tsinghua University. His main research
Journal of the American Society for Information Science and Technology, interests include data mining algorithms and social network theories.
61(12):2544–2558, 2010. He has been visiting scholar at Cornell University, Chinese University
[42] Svetnik V. Random forest: a classification and regression tool for of Hong Kong, Hong Kong University of Science and Technology, and
compound classification and qsar modeling. Journal of Chemical Leuven University. He has published over 100 research papers in major
Information and Computer Sciences, 43(6):1947–1958, 2003. international journals and conferences including: KDD, IJCAI, AAAI,
[43] Ben Verhoeven, Walter Daelemans, and Barbara Plank. Twisty: A ICML, WWW, SIGIR, SIGMOD, ACL, Machine Learning Journal, TKDD,
multilingual twitter stylometry corpus for gender and personality and TKDE.
profiling. In Proceedings of the Tenth International Conference on
Language Resources and Evaluation LREC, pages 1632–1637, 2016. Ling Feng is a professor of computer science and technology at
[44] Chi Wang, Jie Tang, Jimeng Sun, and Jiawei Han. Dynamic Tsinghua University, Beijing. Her research interests include context-
social influence analysis through time-dependent factor graphs. aware data management toward ambient intelligence, knowledge-based
Advances in Social Networks Analysis and Mining (ASONAM), 2011 information systems, data mining and warehousing, and distributed
International Conference on, pages 239 – 246, 2011. object-oriented database management systems. She has published
[45] Xiaohui Wang, Jia Jia, Peiyun Hu, Sen Wu, Jie Tang, and Lianhong more than 150 scientific articles in high-quality international conferences
Cai. Understanding the emotional impact of images. In Proceedings or journals, and received the 2004 Innovational VIDI Award by the
of the 20th ACM International Conference on Multimedia, MM ’12, Netherlands Organization for Scientific Research, the 2006 Chinese
pages 1369–1370, 2012. ChangJiang Professorship Award by the Ministry of Education, and the
[46] Lexing Xie and Xuming He. Picture tags and world knowledge: 2006 Tsinghua Hundred-Talents Award.
learning tag relations from visual semantic sources. In ACM
Multimedia Conference, pages 967–976, 2013. Tat-Seng Chua joined the National University of Singapore, Singapore,
[47] Yuanyuan Xue and et al. Detecting adolescent psychological in 1983, and spent three years as a Research Staff Member with
pressures from micro-blog. Health Information Science. Springer the Institute of Systems Science, National University of Singapore.
International Publishing, pages 83–94, 2014. He was the Acting and Founding Dean of the School of Computing,
National University of Singapore, from 1998 to 2000. He is currently
[48] Jian Bo Yang, Minh Nhut Nguyen, Phyo Phyo San, Xiao Li Li,
the KITHCT Chair Professor with the School of Computing, National
and Shonali Krishnaswamy. Deep convolutional neural networks
University of Singapore. His research interests include multimedia
on multichannel time series for human activity recognition. In
information retrieval, multimedia question answering, and the analysis
Proceedings of International Joint Conference on Artificial Intelligence,
and structuring of user-generated contents.
pages 3995–4001, 2015.
Dr. Chua has organized and served as a Program Committee Mem-
[49] Yang Yang, Jia Jia, Shumei Zhang, Boya Wu, Qicong Chen, Juanzi
ber of numerous international conferences in the areas of computer
Li, Chunxiao Xing, and Jie Tang. How do your friends on social
graphics, multimedia, and text processing. He was the Conference
media disclose your emotions? In Proceedings of the 28th AAAI
Co-Chair of ACM Multimedia in 2005, the Conference on Image and
Conference on Artificial Intelligence (AAAI’14), pages 306–312, 2011.
Video Retrieval in 2005 and ACM SIGIR in 2008, and the Technical
[50] Shuo Zeng, Mingfeng Lin, and Hsinchun Chen. Dynamic user- PC Co-Chair of SIGIR in 2010. He serves on the editorial boards of
level affect analysis in social media: Modeling violence in the dark the ACM Transactions of Information Systems, Foundation and Trends
web. In IEEE International Conference on Intelligence and Security in Information Retrieval, The Visual Computer, and Multimedia Tools
Informatics (ISI), pages 1–6, 2011. and Applications. He is on the Steering Committees of the International
[51] Qian Zhang and Bruno Goncalves. Topical differences between Conference on Multimedia Retrieval, Computer Graphics International,
chinese language twitter and sina weibo. Computer Science, 2015. and Multimedia Modeling Conference Series. He serves as a member
[52] Yuan Zhang, Jie Tang, Jimeng Sun, Yiran Chen, and Jinghai Rao. of international review panels of two large-scale research projects in
Moodcast: Emotion prediction via dynamic continuous factor Europe. He is the Independent Director of two listed companies in
graph model. 2013 IEEE 13th International Conference on Data Singapore.
Mining, pages 1193–1198, 2010.
[53] Jichang Zhao, Li Dong, Junjie Wu, and Ke Xu. Moodlens: an
emoticon-based sentiment analysis system for chinese tweets. In
Proceedings of the 18th ACM SIGKDD international conference on
Knowledge discovery and data mining, pages 1528–1531, 2012.
1041-4347 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.