SMA - Module 2 (B)
SMA - Module 2 (B)
(SMA)
CSDC304
Subject In-charge
Email:
1
Social Media Analytics
Lecture
Module II:
Social Network Structure, Measures
& Visualization
Content:
✔ Basics of Social Network Structure - Nodes, Edges & Tie
✔ Describing the Networks Measures - Degree Distribution, Density,
Connectivity, Centralization, Tie Strength & Trust
✔ Network Visualization - Graph Layout, Visualizing Network
features, Scale Issues.
✔ Social Media Network Analytics - Common Network Terms,
Common Social Media Network Types, Types of Networks,
Common Network Terminologies, Network Analytics Tools.
2. https://onlinecourses.nptel.ac.in/noc20_cs78/preview
3. https://nptel.ac.in/courses/106106146
1. https://7layersanalytics.com/
❑ Examination Scheme
✔ IAT1 & IAT2 : 20 marks each (Average of the two)
✔ End Semester Exam : 80 marks
❑ Assignments
✔ Assignment Test 1 or 2 (20 marks each)
Degree Distribution
Density
Connectivity
Centralization
Tie Strength & Trust
Degree Distribution
Density
✔Calculating Density
Connectivity
Centralization
🡪 Network Visualization:
Graph Layout, Visualizing
Network features, Scale Issues
https://kateto.net/2014/04/facebook-data-collection-and-photo-network-
visualization-with-gephi-and-r/#
Graph Layout
• It is based on force-directed
layout algorithms but uses
optimizations in the underlying
code to make the algorithm
computationally efficient.
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
0
Visualizing Network features
❑ The network features, like edge weights, node properties, labels, and
clusters, can also be visualized.
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
1
Visualizing Network features
✔Labels
Labels are some of the more difficult attributes to show in a network,
both on nodes and on edges. The example graphs in the previous section
all have node labels, but the graphs are small and the labels are short.
Figure 4.10 shows a net-work with only 92 nodes, which is still relatively
small.
The nodes represent YouTube videos, and the edges indicate that
they were tagged with at least one similar term.
The node labels are the YouTube identifiers for each video.
Even in this small graph, the image becomes very cluttered with all the
labels shown. Similar problems happen with edge labels. Whether shown
on top of the edge with straight alignment or angled along the edge, the
graph tends to become cluttered and difficult to read.
Some techniques can improve on this a bit, either by putting boxes
around the text, by only showing a few labels of interest, or by relying
on interactive interfaces that only show labels on demand. The latter
allow the user to move the mouse over a node or edge and see the label
or other data on demand. That facilitates exploration of the graph without
the clutter. Still, there are no solutions to totally eliminate this problem
when producing fixed visualization images, so often labels are left off.
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
2
Visualizing Network features
✔ Size, Shape, and Color
Categorical or quantitative attributes are particularly easy to show by
adjustments in size, shape, or color Figures 4.3 - 4.8.
Figure 4.11 shows color encoding of node degree. Darker colors indicate
nodes with higher degrees, and not surprisingly, node a is the darkest.
Node color could also be used to indicate other attributes of a node.
For example, a person’s email indicate if each person is a friend, family
member, classmate, co-worker, and so forth.
Node size
anbeusedtoindicateotherattributes.Forexample,clusteringcoefficientisinteres
tinghere, since thereisatightclusterwhereall
thenodesareconnected,whileinthe rest ofthegraph,theclusteringcoefficient
isverylowforeachnode. Figure 4.12
showsagraphthatusescolorfordegreeandsizeforclustering coefficient.
Edgescanalsobetreatedwithcolororthicknesstoindicatetheirattributes.
Forexample, different types of relationships could each be coded in a
different color.
Edge weights are also commonly visualized.Thesecouldindicatethe
strengthofarelationship,thefrequencyofcommunication,orotherfactors.
Figure4.13 shows thesameexamplenetworkwithweightsaddedtotheedges.
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
3
Visualizing Network features
✔ Size, Shape, and Color
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
4
Visualizing Network features
Size, shape, and color:
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
5
Visualizing Network features
Size, shape, and color:
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
6
Visualizing Network features
✔Larger Graph Properties
Larger graph properties can also be encoded in visualizations.
For example, clusters are sometimes apparent on their own (like
the group to the upper right in Figure 4.11), but visual properties
to indicate them will often clarify a visualization further.
Figure 4.13 shows a new graph that has two main clusters.
This graph is a network of YouTube videos, where nodes
represent videos and edges connected videos that share a
common tag.
All of these videos were tagged with the word “cubs”.
Even without the color coding, the two groups would be relatively
easy to see.
But using a community detection algorithm that groups nodes
into clusters, and then color coding by those clusters, makes it
even more apparent.
This is shown in Figure 4.14.
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
7
Visualizing Network features
✔Larger Graph Properties
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
8
Visualizing Network features
✔Larger Graph Properties
10
Department of Computer Engineering Social Media Analytics –Ms. Annies
9
Network Visualization
Scale Issues
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
0
Scale Issues
The example networks shown so far have been
relatively small a few hundred nodes and a few
thousand edges.
Visualization is very useful for analyzing networks
of this size or smaller. When networks become
much larger, the quality of the visualization
diminishes. Figure 4.15 shows a network from a
peer-to-peer file sharing network. Nodes
represent hosts (computers participating in
the network), and edges represent
connections between them (usually one
computer downloading a file from another).
There are close to11,000nodesin thisnetwork with
roughly40,000 edges. Even with a very low
density (,0.001), there are still too many nodes
and edges to see much of anything.
Dependingonthestructureofthenetwork,itissometi
mespossibletogetuseful
visualizationswithuptoaround10,000nodes;
however,networksunder1,000
nodesaretypicallysafest
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
1
Scale Issues
✔Density
Density can also be a problem for
visualization, even if the number of
nodes is small. Figure 4.16 shows a
network of members of the U.S.
Senate. There are only 100 nodes but
over 4,100 edges. The edges indicate
that the senators have voted the same
way in at least one bill. The edges have
a weight, indicating the percentage of
bills on which the two senators have
voted in the same way. Figure 4.13 has
the edges filtered so that only those
with a weight of 40% or more are
visible. However, as this network
shows, there are no interesting pat-
terns visible with the threshold of 40%;
the network is simply too dense.
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
2
Scale Issues
✔Filtering for visual patterns
It is often difficult to see any patterns in very dense networks.
One way to compensate for this is to filter the networks when
possible.
For example, if we take the same network from Figure 4.13 and
filter the edges so that they only connect senators who have
voted the same way on at least two-thirds of the bills, the pat-
tern changes dramatically.
This is shown in Figure 4.17. In this figure, two clear clusters
emerge, representing the two major political parties.
Furthermore, five senators are pulled out from the major party
clusters along the center, indicating that they frequently vote
with members of both parties.
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
3
Scale Issues
✔Filtering for visual patterns
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
4
Scale Issues
✔Graph simplification
An active area of research in network visualization is graph
simplification.
Because large networks are very common when working with
social media, problems of scale are common.
Graph simplification techniques include grouping clusters of nodes
into a single node and representing the edges between clusters as
a single edge, representing structural patterns as representative
shapes, or showing only part of the graph at a time.
As an example, Figure 4.18 shows a tree-structured network
visualized with a tool called Space Tree (Plaisant et al., 2002).
The nodes and edges to the right of the first level are hidden but
are summarized with triangles.
The size, color, and angle of the triangles indicate the depth,
number of nodes, and width of the summarized structure.
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
5
Scale Issues
✔Graph simplification
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
6
Social Network Structure, Measures & Visualization
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
7
🡪 Social Media Network
❑Analytics
Social Media Network Analytics thus deals with constructing,
analyzing, and understanding social media networks.
❑ The purpose of network analysis is to:
✓Investigate multiplexity:
for example, analyzing comparisons between different link types,
such as friends vs. enemies.
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
8
🡪 Social Media Network
Analytics
11
Department of Computer Engineering Social Media Analytics –Ms. Annies
9
Common Network Terms
❑ The following are some of the social media networks
common terms:
o NETWORK
o SOCIAL NETWORKS
o SOCIAL NETWORK SITE
o SOCIAL NETWORKING
o SOCIAL NETWORK ANALYSIS
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
0
Common Network Terms
o NETWORK
✔ A network is a group of nodes that are connected with links.
✔ Nodes (also known as vertices) can represent anything, including
individuals, organizations, countries, computers, websites, or any other
entities.
✔ Links (also known as ties, edges, or arcs) represent the relationship
among the nodes in a network.
✔ Networks can also exist among animals.
o SOCIAL NETWORKS
✔ A social network is a group of nodes and links formed by social entities
where nodes can represent social entities such as people and
organizations.
✔ Links represent their relationships, such as friendship and trade relations.
✔ Social networks can exist both in the real and online worlds.
✔ A network among classmates is an example of real world social network
or a Twitter follow following network is an example of an online social
media network. In a Twitter follow-following network, nodes are the
Twitter users, and links among the nodes represents the follow-following
relationship (i.e., who is following whom) among the users.
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
1
Common Network Terms
o SOCIAL NETWORK SITE
A social network site is a special-purpose software (or social
media tool) designed to facilitate the creation and maintenance of
social relations.
Examples: Facebook, Google+, and LinkedIn.
o SOCIAL NETWORKING
The act of forming, expanding, and maintaining social relations is
called social networking.
Example, form, expand, and maintain online social ties with family,
friends, colleagues, and sometimes strangers.
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
3
Common Social Media Network Types
❑ The following are some everyday types of social media networks
that we come across and that can be subject to network analytics.
o FRIENDSHIP NETWORKS
o FOLLOW-FOLLOWING NETWORKS
o FAN NETWORK
o GROUP NETWORK
o PROFESSIONAL NETWORKS
o CONTENT NETWORKS
o DATING NETWORKS
o COAUTHORSHIP NETWORKS
o COCOMMENTER NETWORKS
o COLIKE
o COOCCURRENCE NETWORK
o GEO COEXISTENCE NETWORK
o HYPERLINK NETWORKS
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
4
Common Social Media Network Types
❑The following are some everyday types of social media networks
that we come across and that can be subject to network
analytics:
FRIENDSHIP NETWORKS
✔Are Facebook, Google+, and Cyword. Friendship networks let people
maintain social ties and share content with people they closely associate
with, such as family and friends.
✔Nodes in these networks are people, and links are social relationships.
✔Example: friendship, family, and activities
FOLLOW-FOLLOWING NETWORKS
✔Users follow (or keep track of) other users of interested.
✔Twitter is a good example of follow-following network where users
follow influential people, brands, and organizations.
✔Nodes in these networks are, for example, people, brands, and
organizations, and links represents follow following relations
✔Example: who is following whom
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
5
Common Social Media Network Types
o FAN NETWORK
✔ A fan network is formed by social media fans or supporters of someone
or something, such as a product, service, person, brand, business, or
other entity.
✔ The network formed by the social media users subscribed to your
Facebook fan page is an example of a fan network.
✔ Nodes in these networks are fans, and links represent colikes,
cocomments, and coshares.
GROUP NETWORK
✔ Group networks are formed by people who share common interests and
agendas.
✔Most social media platforms allow the creation of groups where member
can post, comment, and manage in-group activities.
Examples of social media groups are Twitter professional groups, Yahoo
Groups, and Facebook groups.
✔Nodes in these networks are group members, and links can represent
✔cocommenting, coliking, and coshares.
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
6
Common Social Media Network Types
PROFESSIONAL NETWORKS
LinkedIn is a good example of professional networks where people manage their
professional identify by creating a profile that lists their achievements,
education, work history, and interests. Nodes in these networks are, for example,
people, brands, and organizations, and links are professional relations (such as
coworker, employee, or collaborator). An important feature of professional
networks is the endorsement feature, where people who know you can endorse
your skills and qualification.
CONTENT NETWORKS
Content networks are formed by the content posted by social media users. A
network among YouTube videos is an example of a content network. In such a
network, nodes are social media content (such as videos, tags, and photos) and
links can represent, for example, similarity (content belonging to the same
categories that can be linked together).
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
7
Common Social Media Network Types
DATING NETWORKS
Dating networks (such as match.com and Tender) are focused on matching and
arranging a dating partner based on personal information (such as age, gender,
and location) provided by a user. Nodes in these networks are people, and links
represent social relations (such as romantic relation).
COAUTHORSHIP NETWORKS
Coauthorship networks are two or more people working together to collaborate
on a project. Wikipedia (an online encyclopedia) is a good example of a social
media-based coauthorship network created by millions of authors from around
the world(Biuk-Aghai 2006). A more explicit example of the coauthorship
network is the ResearchGate platform: a social networking site for researchers to
share articles, ask and respond to questions, and find collaborators. In these
networks, nodes are, for example, researchers, and links represent the
coauthorship relationship.
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
8
Common Social Media Network Types
COCOMMENTER NETWORKS
Cocommenter networks are formed when two or more people comment on social
media content (e.g., a Facebook status update, blog post, or YouTube video). A
cocommenter network can, for example, be constructed from the comments
posted by users in response to a video posted over YouTube or a Facebook fan
page. In these networks, nodes represent users, and link represents the
cocommenting relationship.
COLIKE
In a similar way, colike networks are formed when two or more people like the
same social media content. Using NodeXL (a social network analysis tool), one
can construct a network based on colikes (two or more people liking a similar
content) over Facebook fan page. In such network, nodes will be Facebook
users/fans and links will be the colikes relationship.
12
Department of Computer Engineering Social Media Analytics –Ms. Annies
9
Common Social Media Network Types
COOCCURRENCE NETWORK
Cooccurrence networks are formed when two more entities (e.g., keywords,
people, ideas, and brands) cooccur over social media outlets. For example, one
can construct a cooccurrence network of brand names (or people) to investigate
how often certain brands (or people) cooccur over social media outlets. In such
networks, nodes will be the brand names and the links will represent the
cooccurrence relationships among the brands.
GEO COEXISTENCE NETWORK
Geo coexistence networks are formed when two more entities (e.g., people,
devices, and addresses) coexist in a geographic location. In such a network
nodes represents entities (e.g., people), and links among them represent
coexistence.
HYPERLINK NETWORKS
Hyperlink is a mechanism to move among electronic documents
(such as websites). Hyperlinks can be referred to as being either in-links (i.e.,
hyperlinks originating in other websites (Björneborn and Ingwersen 2004), thus
bringing traffic/users to your website) or out-links (i.e., links originating in your
website and going out (Bjorneborn 2001), thus sending traffic to other websites).
Hyperlink also forms networks. Typically, in these network nodes are website,
and links represent referral relationships (in the form of in-links or out-links).
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
0
🡪 Social Media Network
Analytics
Types of Networks
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
1
Types of Networks
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
2
Types of Networks
1)BASED ON EXISTENCE
Based on the way the networks exist online or are constructed, they can be
classified as
1. Implicit Networks
Implicit networks do not exit by default (or are hidden) and need to be
intentionally constructed with the help of dedicated tools and techniques.
Examples of such networks include keyword cooccurrence networks, cocitation
networks, cocommenter networks, hyperlink networks, etc.
Constructing and studying implicit networks can provide valuable information
and insights.
2. Explicit Networks
Explicit social media networks exist by default; in other words, they are
explicitly designed for social media users to be part of. Most social media
networks are explicit in nature.
Examples of explicit social media networks include Facebook friendship network,
Twitter follow-following networks, LinkedIn professional networks, YouTube
subscribers’ network, and bloggers’ networks.
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
3
Types of Networks
Undirected Network
In undirected networks, the links among the nodes do not have any
direction. A Facebook friendship network is an example of undirected
network.
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
4
Types of Networks
3) BASED ON MODE
Based on the composition of nodes, networks can be classified as 1) one-mode
network, 2) two-mode networks, and 3) multimode networks.
One-Mode Networks
A one-mode network is formed among a single set of nodes of the same nature.
A Facebook friendship network is an example of a one-mode network where nodes
(people) form network ties (friendships).
Two-Mode Networks
Two-mode networks (also known as bipartite networks) are networks with two
sets of nodes of different classes. In these networks, network ties exist only
between nodes belonging to different sets.
For example, consider the two-mode network given , where one set of nodes
(circles) could be social media users and other set of nodes (squares) could be
participation in a series of events. Users are linked to the events they attended.
Multimode Network
A multimode network is also possible where multiple heterogeneous nodes are
connected together. It can be considered as an amalgam of one and two-mode
networks.
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
5
Types of Networks
4) BASED ON WEIGHTS
Networks can also be classified based the weight assigned to the links
among the nodes.
Mainly there are two types of weighted networks:
1)Weighted Networks
✔In weighted networks, the links among nodes bear certain weights to
indicate the strength of association among the nodes.
✔The link (relationship) between, for example, two Facebook friends
(nodes) will be thicker if they communicate more frequently.
✔Weighted networks can provide rich information, but are difficult to
construct.
2) Unweighted Networks
✔In unweighted networks, links among nodes does not bear weights.
✔The links only indicate the existence of a relationship and cannot provide
clues about the strength of relationship.
✔Unweighted networks are easy to construct, but may conceal useful
information.
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
6
Types of Networks
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
7
🡪 Social Media Network
Analytics
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
8
Common Network Terminologies
❑Network terminologies or properties can be divided into two
categories:
1. NODE-LEVEL PROPERTIES
Node-level properties focus on one node and its position in the
network. Some important node properties include:
o Degree Centrality
Degree centrality of a node in a network measures the number
of links a node has to other node.
Example: In a Facebook network,it will measure the number
friendship ties a user has.
o Betweenness Centrality
Betweenness centrality is related to the centrality (or position)
of a node in a network. The nodes with high betweenness
centrality have the ability to control or facilitate collaboration or
flow of information due to their central position in the network
Example: In a Facebook friendship network, for the users who occupy the
central position are better positioned tocontrol the flow of social media content.
13
Department of Computer Engineering Social Media Analytics –Ms. Annies
9
Common Network Terminologies
o Eigenvector Centrality
Eigenvector centrality measures the importance of a node
based on its connections with other important nodes in a
network.
It can provide an understanding of a node’s networking ability
relative to that of others.
o Structural Holes
The idea of structural holes was first put forward by Burt, who
suggest that in a network exists when a certain node has an
advantage or disadvantage of its location in a network.
A node that is connected to users who are themselves not
directly connected has the opportunity to mediate between
them and profit from this mediation.
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
0
Common Network Terminologies
2. NETWORK-LEVEL PROPERTIES
Network properties provide insight into the overall structure
and health of a network.
Important network-level properties include:
o Clustering Coefficient
The clustering coefficient of a network is the degree to which
nodes in a network tend to cluster or group together.
o Density
The density of a network deals with a number of links in a
network.
Density can be calculated as the number of links present in a
network divided by the number of all possible links between pairs
of nodes in a network (for an undirected network, the number of
all possible links can be calculated as n (n – 1)/2); where n is the
number of nodes in a network).
A fully connected network, in which each node is connected to
every other node, will have a density of 1.
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
1
Common Network Terminologies
o Components
Components of a network are the isolated sub-networks that
connect within, but are disconnected between, sub-
networks.
In a connected component, all nodes are connected and reachable,
but there is no path between a node in the component and any
node not in the component. The main or largest component of a
network is the component with the largest number of nodes.
o Diameter
The diameter of a network is the largest of all the calculated
shortest path between any pair of nodes in a network and it
can provide an idea of how long it would take for some
information /ideas/message to pass through the network.
o Average Degree
The average degree centrality measures the average number of
links among nodes in a network.
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
2
🡪 Social Media Network
Analytics
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
3
Network Analytics Tools
NodeXL: (an add-in for Microsoft Excel) is the free tool for
social network analysis and visualization.
It can help you construct and analyze Facebook networks (based on
colikes and cocomments), Twitter networks (followers, followings,
and tweets), and YouTube networks (user network and comments),
among others.
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
4
Network Analytics Tools
Netminer: (http://www.netminer.com/) is also a software
application for large social network analysis and visualization.
The application can used be for free for 28 days.
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
5
Social Media Analytics: An Overview
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
6
Thank you …
14
Department of Computer Engineering Social Media Analytics –Ms. Annies
7