SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Inferring Peer Centrality
in Socially-Informed P2P Systems
Nicolas Kourtellis, Adriana Iamnitchi
Department of Computer Science & Engineering
University of South Florida
Tampa, USA
11th IEEE International Conference on Peer-to-Peer Computing
Kyoto, Japan, 2011
Socially-aware Applications
 Applications collect and use social information:
 Location, collocation, history of interactions, etc.
 Build (implicit/explicit) social network of users
 Use: reduce spam, provide recommendations, etc.
 Wide range of system architectures
 How does the social network of users affect the load
in a P2P architecture?
2
Decentralization of user social data
• MobiClique
• Yarta
• ...
• PeerSoN
• LifeSocial.KOM
• Safebook
• Prometheus
• …
Social Graphs & P2P Networks
 Users connected with application-specific edges
 User-contributed peers form a P2P network
 User social graph is partitioned into subgraphs &
stored on peers
Questions:
 How do applications traverse a distributed social graph?
 What does it mean for the P2P routing? 3
 Invite user G’s 2-hop hiking contacts to a trip
 Social graph traversals => many P2P lookups
 Application performance affected by projection
of social graph on peers
Application Example
4
=> 1-hop={B, C, E} 2-hops={A, D, F, I}
 How do the properties of the projection graph compare with
the properties of the social graph projected?
Projection Graph
5
Projection
Graph (PG)
P2P Overlay
Social
Graph (SG)
Projection Graph Model
 Uses:
 Study properties of peers such as centrality
 Study how the social graph topology affects P2P
routing & system performance 6
Social Graph SG = (V,E)
V=set of users, E=set of social edges
Projection Graph PG = (VP
,EP
)
VP
=set of peers, EP
=set of P2P edges
PV
(i) = set of users mapped on peer Pi
, Pi
Î VP
(Pi
,Pj
) Î EP
iff $ a Î PV
(i), $ b Î PV
(j) s.t. (a,b) Î E
w(Pi
,Pj
) = (a,b) Î E |a Î PV
(i), b Î PV
(j){ }
7
Outline
 Motivation
 Projection Graph Model
 Social Network Centrality Metrics
 Degree Centrality
 Node Betweenness Centrality
 Edge Betweenness Centrality
 Centrality Calculation: Limitations
 Experimental Questions
 Experimental Methodology
 Experimental Results
 Impacts on Applications & Systems
A
B
C
D
EF
G
H
IJ
K
L
M
N
O
 Number of edges of a node
 High degree centrality peers: Network Hubs
 Can be targeted to directly influence many other
peers with a message broadcast or distribute a
search query
Degree Centrality
8
A
B
C
D
EF
G
H
IJ
K
M
N
O
Node Betweenness Centrality
 Measures the extent to which a node lies on the
shortest path between two other nodes
 High betweenness centrality peers: Control
communication between distant peers
 Can host data caches for reduced latency to locate
data
9
A
B
CD
EF
G
H
I
J
K
L
M
N
O
Edge Betweenness Centrality
 Measures the extent to which an edge lies on the
shortest path between two nodes
 High betweenness centrality edges: Connect
distant parts of P2P network
 Can be monitored to block malware traffic
10
Calculating Peer Centrality
 Challenging because of:
 Limited access to user data (e.g., privacy settings)
 P2P network scale
 Peer churn
 Through experimental analysis on the social and
projection graph, we investigate how to
circumvent these limitations
11
Experimental Questions
 Can we approximate the centrality of peers using
the centrality scores of their users?
 How does the number of users storing data per
peer affect the centrality scores of their peers?
 Social graph is less dynamic than the P2P network
 Calculate infrequently centrality score of users & use it
to estimate their peer’s centrality
Spoiler Alert!
 [1, ~150] users/peer: Can estimate degree &
betweenness centrality of peers with good
accuracy
 Above 150 users/peer: The projection graph
becomes highly connected => peers do not
differentiate in centrality 12
 Naturally-formed communities offer incentives for resource
sharing  1 community subgraph mapped per peer
 Projection graphs generated from 5 real social graphs
 Communities detected via recursive Louvain algorithm*
 Varied average community size: 5,10,20,…,1000 users/peer
 Calculate correlation of centralities of users and their peers
 Compare average centralities of users and their peers
 Identify top centrality peers from their users’ scores
Experimental Methodology
13
Social Network Users Edges
gnutella04 10,876 39,994
gnutella31 62,561 147,878
enron 33,696 180,811
epinions 75,877 405,739
slashdot 82,168 504,230
*V. D. Blondel et al, “Fast unfolding of communities in large networks”,
Journal of Statistical Mechanics: Theory and Experiment, vol. 10, 2008.
Correlation of Centrality Scores
 [1-150] users/peer:
 Projection graph resembles
closely social graph
 Highest correlation of social &
projection graph metrics
 Degree & node betweenness
estimated from local
information (cumulative scores)
14
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
DegreeCentralityCorrelation
Users/Peer (a)
gnutella04
enron
gnutella31
epinions
slashdot
0
0.2
0.4
0.6
0.8
1
1 10 100 1000NodeBetweennessCentralityCorrelation
Users/Peer (b)
gnutella04
enron
gnutella31
epinions
slashdot
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
EdgeBetweennessCentralityCorrelation
Users/Peer (c)
gnutella04
enron
gnutella31
epinions
slashdot
 After 150 users/peer:
 Projection graph topology
loses social properties
 Highly connected network
 Peers participate equally
in graph traversal
Users/Peer
vs.
Degree
Users/Peer
vs.
Node Betweenness
Users/Peer
vs.
Edge Betweenness
Comparison of Centrality Scores
 Increase number of users/peer  turning point in
projection graph
 More connections with other peers
 increase peer degree & betweenness to maximum
 More social edges within peers
 decrease edge betweenness to minimum
15
1e-05
0.0001
0.001
0.01
0.1
1
1 10 100 1000
DegreeCentrality
Users/Peer (a)
gnutella04_CDCU
gnutella04_DCP
enron_CDCU
enron_DCP
gnutella31_CDCU
gnutella31_DCP
epinions_CDCU
epinions_DCP
slashdot_CDCU
slashdot_DCP
1e-05
0.0001
0.001
0.01
0.1
1
1 10 100 1000
NodeBetweennessCentrality
Users/Peer (b)
gnutella04_CNBCU
gnutella04_NBCP
enron_CNBCU
enron_NBCP
gnutella31_CNBCU
gnutella31_NBCP
epinions_CNBCU
epinions_NBCP
slashdot_CNBCU
slashdot_NBCP
1e-11
1e-10
1e-09
1e-08
1e-07
1e-06
1e-05
0.0001
0.001
0.01
1 10 100 1000
EdgeBetweennessCentrality
Users/Peer (c)
gnutella04_CEBCU
gnutella04_EBCP
enron_CEBCU
enron_EBCP
gnutella31_CEBCU
gnutella31_EBCP
epinions_CEBCU
epinions_EBCP
slashdot_CEBCU
slashdot_EBCP
Users/Peer
vs.
Degree
Users/Peer
vs.
Node Betweenness
Users/Peer
Vs.
Edge Betweenness
Finding High Betweenness Peers
 Placing data caches on high betweenness peers
can reduce latency to locate data
 Can we identify such peers, knowing the top
betweenness users or communities?
 Top 5% betweenness centrality users => top betweenness
centrality peers with 80–90% accuracy 16
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
PeerOverlap
Users/Peer (Method 1)
1%
5%
10%
1 10 100 1000
Users/Peer (Method 2)
1%
5%
10%
Users/Peer Users/Peer
With Top-N% users With Top-N% communities
Summary of Findings
 [1, ~150] users/peer:
 Projection graph resembles closely social graph
 Highest correlation of social & projection graph metrics
 Degree & node betweenness can be estimated from
local information (cumulative scores of users)
 Cannot estimate well edge betweenness
 Above 150 users/peer:
 Projection graph topology loses social properties
 A highly connected projection graph
 No differentiation in peer centrality
 Top betweenness centrality users can pinpoint the top
betweenness centrality peers with good accuracy
 Overall: Applications can calculate infrequently
centrality score of users to estimate peer centrality
 Social graph changes slowly compared to P2P network 17
Impact on Applications & Systems
 Target high degree peers to:
 Decrease search time
 Increase breadth of search and diversity of results
 Target high betweenness peers to:
 Monitor information flow and collect traces
 Place data caches and indexes of data location
 Quarantine malware outbursts
 Disseminate software patches
 Tackle P2P churn
 Predict centrality of peers to allocate resources
 Reduce overlay overhead
 Enhance routing tables with P2P edges for faster &
more secure peer discovery
18
19
Thank you!
This work was supported by NSF Grants:
CNS 0952420 and CNS 0831785
http://www.cse.usf.edu/dsg/
nkourtel@mail.usf.edu

Weitere ähnliche Inhalte

Was ist angesagt?

Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demoMayank Mohan
 
Presentation rutgers
Presentation rutgersPresentation rutgers
Presentation rutgersGuido Borà
 
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave kingDave King
 
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGDYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGAndry Alamsyah
 
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGIMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGijp2p
 
sos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networkssos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networksswathi78
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Michael Steurer
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSIJwest
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social NetworksKent State University
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collectiondnac
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...syeda yasmeen
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011guillaume ereteo
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...1crore projects
 

Was ist angesagt? (20)

Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demo
 
Presentation rutgers
Presentation rutgersPresentation rutgers
Presentation rutgers
 
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
 
05 Whole Network Descriptive Stats
05 Whole Network Descriptive Stats05 Whole Network Descriptive Stats
05 Whole Network Descriptive Stats
 
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGDYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
 
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGIMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
 
presentation29
presentation29presentation29
presentation29
 
sos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networkssos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networks
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social Networks
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
 

Ähnlich wie Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems

Studying user footprints in different online social networks
Studying user footprints in different online social networksStudying user footprints in different online social networks
Studying user footprints in different online social networksIIIT Hyderabad
 
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...reshma reshu
 
Friend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life StyleFriend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life Stylepaperpublications3
 
Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...CloudTechnologies
 
Community detection in complex social networks
Community detection in complex social networksCommunity detection in complex social networks
Community detection in complex social networksAboul Ella Hassanien
 
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAFRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAJournal For Research
 
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONSA MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONSIJCNC
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Editor IJAIEM
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfBalasundaramSr
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Luis Fernandez Luque
 
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Nicolas Kourtellis
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis Jari Jussila
 
Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...eSAT Publishing House
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018Arsalan Khan
 
ISDA 2011 Cordoba
ISDA 2011 CordobaISDA 2011 Cordoba
ISDA 2011 CordobaAndrea Zaza
 

Ähnlich wie Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems (20)

Ppt
PptPpt
Ppt
 
Studying user footprints in different online social networks
Studying user footprints in different online social networksStudying user footprints in different online social networks
Studying user footprints in different online social networks
 
Final Report
Final ReportFinal Report
Final Report
 
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
 
Friend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life StyleFriend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life Style
 
Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...
 
Community detection in complex social networks
Community detection in complex social networksCommunity detection in complex social networks
Community detection in complex social networks
 
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAFRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
 
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONSA MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums.
 
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis
 
18 Diffusion Models and Peer Influence
18 Diffusion Models and Peer Influence18 Diffusion Models and Peer Influence
18 Diffusion Models and Peer Influence
 
09 Diffusion Models & Peer Influence
09 Diffusion Models & Peer Influence09 Diffusion Models & Peer Influence
09 Diffusion Models & Peer Influence
 
Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018
 
ISDA 2011 Cordoba
ISDA 2011 CordobaISDA 2011 Cordoba
ISDA 2011 Cordoba
 

Mehr von Nicolas Kourtellis

On managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesOn managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesNicolas Kourtellis
 
Scalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsScalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsNicolas Kourtellis
 
Prometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataPrometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataNicolas Kourtellis
 
Cultures in Community Question Answering
Cultures in Community Question AnsweringCultures in Community Question Answering
Cultures in Community Question AnsweringNicolas Kourtellis
 
Privacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringPrivacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringNicolas Kourtellis
 
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)Nicolas Kourtellis
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...Nicolas Kourtellis
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)Nicolas Kourtellis
 

Mehr von Nicolas Kourtellis (8)

On managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesOn managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and services
 
Scalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsScalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving Graphs
 
Prometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataPrometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social Data
 
Cultures in Community Question Answering
Cultures in Community Question AnsweringCultures in Community Question Answering
Cultures in Community Question Answering
 
Privacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringPrivacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question Answering
 
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
 

Kürzlich hochgeladen

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 

Kürzlich hochgeladen (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 

Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems

  • 1. Inferring Peer Centrality in Socially-Informed P2P Systems Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South Florida Tampa, USA 11th IEEE International Conference on Peer-to-Peer Computing Kyoto, Japan, 2011
  • 2. Socially-aware Applications  Applications collect and use social information:  Location, collocation, history of interactions, etc.  Build (implicit/explicit) social network of users  Use: reduce spam, provide recommendations, etc.  Wide range of system architectures  How does the social network of users affect the load in a P2P architecture? 2 Decentralization of user social data • MobiClique • Yarta • ... • PeerSoN • LifeSocial.KOM • Safebook • Prometheus • …
  • 3. Social Graphs & P2P Networks  Users connected with application-specific edges  User-contributed peers form a P2P network  User social graph is partitioned into subgraphs & stored on peers Questions:  How do applications traverse a distributed social graph?  What does it mean for the P2P routing? 3
  • 4.  Invite user G’s 2-hop hiking contacts to a trip  Social graph traversals => many P2P lookups  Application performance affected by projection of social graph on peers Application Example 4 => 1-hop={B, C, E} 2-hops={A, D, F, I}
  • 5.  How do the properties of the projection graph compare with the properties of the social graph projected? Projection Graph 5 Projection Graph (PG) P2P Overlay Social Graph (SG)
  • 6. Projection Graph Model  Uses:  Study properties of peers such as centrality  Study how the social graph topology affects P2P routing & system performance 6 Social Graph SG = (V,E) V=set of users, E=set of social edges Projection Graph PG = (VP ,EP ) VP =set of peers, EP =set of P2P edges PV (i) = set of users mapped on peer Pi , Pi Î VP (Pi ,Pj ) Î EP iff $ a Î PV (i), $ b Î PV (j) s.t. (a,b) Î E w(Pi ,Pj ) = (a,b) Î E |a Î PV (i), b Î PV (j){ }
  • 7. 7 Outline  Motivation  Projection Graph Model  Social Network Centrality Metrics  Degree Centrality  Node Betweenness Centrality  Edge Betweenness Centrality  Centrality Calculation: Limitations  Experimental Questions  Experimental Methodology  Experimental Results  Impacts on Applications & Systems
  • 8. A B C D EF G H IJ K L M N O  Number of edges of a node  High degree centrality peers: Network Hubs  Can be targeted to directly influence many other peers with a message broadcast or distribute a search query Degree Centrality 8
  • 9. A B C D EF G H IJ K M N O Node Betweenness Centrality  Measures the extent to which a node lies on the shortest path between two other nodes  High betweenness centrality peers: Control communication between distant peers  Can host data caches for reduced latency to locate data 9
  • 10. A B CD EF G H I J K L M N O Edge Betweenness Centrality  Measures the extent to which an edge lies on the shortest path between two nodes  High betweenness centrality edges: Connect distant parts of P2P network  Can be monitored to block malware traffic 10
  • 11. Calculating Peer Centrality  Challenging because of:  Limited access to user data (e.g., privacy settings)  P2P network scale  Peer churn  Through experimental analysis on the social and projection graph, we investigate how to circumvent these limitations 11
  • 12. Experimental Questions  Can we approximate the centrality of peers using the centrality scores of their users?  How does the number of users storing data per peer affect the centrality scores of their peers?  Social graph is less dynamic than the P2P network  Calculate infrequently centrality score of users & use it to estimate their peer’s centrality Spoiler Alert!  [1, ~150] users/peer: Can estimate degree & betweenness centrality of peers with good accuracy  Above 150 users/peer: The projection graph becomes highly connected => peers do not differentiate in centrality 12
  • 13.  Naturally-formed communities offer incentives for resource sharing  1 community subgraph mapped per peer  Projection graphs generated from 5 real social graphs  Communities detected via recursive Louvain algorithm*  Varied average community size: 5,10,20,…,1000 users/peer  Calculate correlation of centralities of users and their peers  Compare average centralities of users and their peers  Identify top centrality peers from their users’ scores Experimental Methodology 13 Social Network Users Edges gnutella04 10,876 39,994 gnutella31 62,561 147,878 enron 33,696 180,811 epinions 75,877 405,739 slashdot 82,168 504,230 *V. D. Blondel et al, “Fast unfolding of communities in large networks”, Journal of Statistical Mechanics: Theory and Experiment, vol. 10, 2008.
  • 14. Correlation of Centrality Scores  [1-150] users/peer:  Projection graph resembles closely social graph  Highest correlation of social & projection graph metrics  Degree & node betweenness estimated from local information (cumulative scores) 14 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 DegreeCentralityCorrelation Users/Peer (a) gnutella04 enron gnutella31 epinions slashdot 0 0.2 0.4 0.6 0.8 1 1 10 100 1000NodeBetweennessCentralityCorrelation Users/Peer (b) gnutella04 enron gnutella31 epinions slashdot 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 EdgeBetweennessCentralityCorrelation Users/Peer (c) gnutella04 enron gnutella31 epinions slashdot  After 150 users/peer:  Projection graph topology loses social properties  Highly connected network  Peers participate equally in graph traversal Users/Peer vs. Degree Users/Peer vs. Node Betweenness Users/Peer vs. Edge Betweenness
  • 15. Comparison of Centrality Scores  Increase number of users/peer  turning point in projection graph  More connections with other peers  increase peer degree & betweenness to maximum  More social edges within peers  decrease edge betweenness to minimum 15 1e-05 0.0001 0.001 0.01 0.1 1 1 10 100 1000 DegreeCentrality Users/Peer (a) gnutella04_CDCU gnutella04_DCP enron_CDCU enron_DCP gnutella31_CDCU gnutella31_DCP epinions_CDCU epinions_DCP slashdot_CDCU slashdot_DCP 1e-05 0.0001 0.001 0.01 0.1 1 1 10 100 1000 NodeBetweennessCentrality Users/Peer (b) gnutella04_CNBCU gnutella04_NBCP enron_CNBCU enron_NBCP gnutella31_CNBCU gnutella31_NBCP epinions_CNBCU epinions_NBCP slashdot_CNBCU slashdot_NBCP 1e-11 1e-10 1e-09 1e-08 1e-07 1e-06 1e-05 0.0001 0.001 0.01 1 10 100 1000 EdgeBetweennessCentrality Users/Peer (c) gnutella04_CEBCU gnutella04_EBCP enron_CEBCU enron_EBCP gnutella31_CEBCU gnutella31_EBCP epinions_CEBCU epinions_EBCP slashdot_CEBCU slashdot_EBCP Users/Peer vs. Degree Users/Peer vs. Node Betweenness Users/Peer Vs. Edge Betweenness
  • 16. Finding High Betweenness Peers  Placing data caches on high betweenness peers can reduce latency to locate data  Can we identify such peers, knowing the top betweenness users or communities?  Top 5% betweenness centrality users => top betweenness centrality peers with 80–90% accuracy 16 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 PeerOverlap Users/Peer (Method 1) 1% 5% 10% 1 10 100 1000 Users/Peer (Method 2) 1% 5% 10% Users/Peer Users/Peer With Top-N% users With Top-N% communities
  • 17. Summary of Findings  [1, ~150] users/peer:  Projection graph resembles closely social graph  Highest correlation of social & projection graph metrics  Degree & node betweenness can be estimated from local information (cumulative scores of users)  Cannot estimate well edge betweenness  Above 150 users/peer:  Projection graph topology loses social properties  A highly connected projection graph  No differentiation in peer centrality  Top betweenness centrality users can pinpoint the top betweenness centrality peers with good accuracy  Overall: Applications can calculate infrequently centrality score of users to estimate peer centrality  Social graph changes slowly compared to P2P network 17
  • 18. Impact on Applications & Systems  Target high degree peers to:  Decrease search time  Increase breadth of search and diversity of results  Target high betweenness peers to:  Monitor information flow and collect traces  Place data caches and indexes of data location  Quarantine malware outbursts  Disseminate software patches  Tackle P2P churn  Predict centrality of peers to allocate resources  Reduce overlay overhead  Enhance routing tables with P2P edges for faster & more secure peer discovery 18
  • 19. 19 Thank you! This work was supported by NSF Grants: CNS 0952420 and CNS 0831785 http://www.cse.usf.edu/dsg/ nkourtel@mail.usf.edu