Module VI_Mining Social Network Graph (2).pptx
Module VI_Mining Social Network Graph (2).pptx
12
1
33
49
Need to re-compute
betweenness at
every step
• Based on concept:
3-clique
Clique
• Clique: Complete graph
4-clique
Clique
• Clique: Complete graph
5-clique
k-Clique Communities
• Adjacent k-cliques
k=3
k-Clique Communities
• Adjacent k-cliques
k=3
Clique 1
k-Clique Communities
• Adjacent k-cliques
Clique 3
k=3
k-Clique Communities
• Adjacent k-cliques
Two k-cliques are adjacent when they share k-1 nodes
Clique 2
k=3
Clique 1
k-Clique Communities
• Adjacent k-cliques
Two k-cliques are adjacent when they share k-1 nodes
Clique 2 Clique 3
k=3
k-Clique Communities
• k-clique community
Union of all k-cliques that can be reached from each
other through a series of adjacent k-cliques
k-Clique Communities
• k-clique community
Union of all k-cliques that can be reached from each
other through a series of adjacent k-cliques
Clique 2
k=3
Clique 1
k-Clique Communities
• k-clique community
Union of all k-cliques that can be reached from each
other through a series of adjacent k-cliques
Community 1
k=3
k-Clique Communities
• k-clique community
Union of all k-cliques that can be reached from each
other through a series of adjacent k-cliques
Community 1 Clique 3
k=3
k-Clique Communities
• k-clique community
Union of all k-cliques that can be reached from each
other through a series of adjacent k-cliques
Community 1 Community 2
k=3
CPM Algorithm
• Input :- The social graph G , representing a network
and a clique size k.
• Output : Set of discovered Communities C
• Step1 : All k-clique present in G are extracted
• Step 2: A new graph , the clique graph ,Gc formed
where each node represented an identified clique
and two vertices in Gc are connected by an edge ,if
they have k-1 common vertices.
• Step 3: Connected components in Gc are identified
• Step 4: Each connected component in Gc represents a
community.
• Step 5: Set C be the set of communities formed for G.
CPM Algorithm in short
• Locate maximal cliques
– Largest possible clique size can be determined from
degrees of vertices
– Starting from this size, find all cliques, then reduce size by
1 and repeat
• Convert from cliques to k-clique communities
CPM
Example 1
• Find all cliques for the given graph using CPM
showing all steps.
Exercise1: MU COMP May2016
Exercise 2: MU IT May2016
SimRank: A Measure of
Structural-Context Similarity
Motivation
• Many applications require a measure of
“similarity” between objects.
– Web search
– Shopping Recommendations
– Search for “Related Works” among scientific
papers
• But “similarity” may be domain-dependent
• Can we define a generic model for similarity?
Problem Statement
• Given a Graph G = (V, E), for each pair of
vertices a,b ∈ V, compute a similarity
(ranking) score s(a,b) based on the concept of
structural-context similarity.
Basic Graph Model
• Directed Graph G = (V,E)
– V = set of objects
– E = set of unweighted edges
– Edge (u,v) exists if there is an relation u 🡺 v
– I(v) = set of in-neighbors of vertex v
– O(v) = set of out-neighbors of vertex v
SimRank Similarity
• Recursive Model
– “Two objects are similar if they are referenced by
similar objects”
– That is, a ~ b if
• c 🡺 a and d 🡺 b, and
• c~d
– An object is equivalent to itself (score = 1)
• Example
1. ProfA ~ ProfB because both are
referenced by Univ.
2. StudentA ~ StudentB because they
are referenced by similar nodes
{ProfA,ProfB}
SimRank: Example
SimRank
SimRank