Professional Documents
Culture Documents
Towards Segregation Aware Social
Towards Segregation Aware Social
Towards Segregation Aware Social
Recommendations
INTRODUCTION
•A community in a social network refers to a group of people who are
more tightly interconnected than the overall network.
•This dataset consists of 'circles' (or 'friends lists') from Facebook. The
data has been anonymized by replacing the Facebook-internal ids for
each user with a new value.
•It is possible to determine whether two users have the same political
affiliations, but not what their individual political affiliations represent.
Dataset Description
first name Alan position Cryptanalyst
last name
Fig 2. User profile tree representation
Turing company GC&CS
name Cambridge
work
type College
type College
Profile information in all of our datasets can be represented as a tree where each level encodes increasingly
specific information From Facebook we collect data from 26 categories, including hometowns, birthdays,
colleagues, political affiliations, etc.
Main objectives
We have also given users political labels in our dataset namely 0 or 1. So, we have
included all the users having political label 0 in group 0 and all the users having political
label 1 in group 1 and there is also some user who does not belongs to either groups.
User Arrival Process:
The unit of ri is the number of activities per unit time. We then create a Poisson
point process for each of the users using their corresponding activity rates.
We feed into this function, probability values from the continuous uniform
distribution Uniform(0,1)
following is the table of patient inter-arrival times in hours at the ER
for the first 10 patients. We have generated this date using the
above formula, with λ set to 5 patients per hour.
The user sends request to the jth ranked friend recommendation Ri[j] based on the
following Bernoulli sampling.
The user then chooses to accept or reject friend requests if any. If vj had earlier sent a
friend request, vi accepts or rejects it based on the following Bernoulli sampling.
The following is experiment screenshot of our friendship recommendation simulation for user 3195
Recommendation based on Network structure(Algorithm 1)
One of the important factors while recommending new friends to a user is the number of mutual or
common friends between them and all those nodes which is having less distance from other nodes .
We take graph of friends having their group label as input and output recommendation list of new
friends based on their distance.
The Algorithm can be easily underslood by using our practice graph. There are five nodes in the
graph namely A,B,C,D,E.F.
Where did the algorithm fails?
𝑢3 𝑢13
𝑢10
𝑢11 𝑢14
𝑢4
𝑢12 𝑢2 𝑢5 𝑢15
𝑢1
𝑢9
𝑢6
𝑢16
𝑢19
𝑢7
𝑢8 𝑢17
𝑢20 𝑢18
Here we are doing 20% reservation for different group user, in similar we
can also increase it to 30% reservation
Evaluation Metric.
This metric measures segregation between groups in
network.
•Here we can clearly see that groups segregation is increasing with the numbers of users
login.
•Now with this we can clearly say that similarity based friendship recommendation increases
social segregation.
•We have run avg cluster coefficient for group specific subnetwork and for the entrie
network.
•Here we can clearly see that there is increase of cluster coefficient of the nodes with the
increasing numbers of user login.
•Here users are clustering together more tightly because of the reason that our algorithm is
only recommending user who is close to the target user.
Recommendations based on Reservation (Algorithm 2)
•Here we can clearly see that groups segregation is decreasing with the numbers of users login
significantly as compared to algorithm 1.
•Here we can also see that with the increase in reservation percentage ,more new connection are
being made between users from different groups like from figure we can see that 30% reservation are
making our network less segregation as compared to 20% reservation.
•Here we can clearly see that algorithm 2 decreases of avg. cluster coefficient of the nodes
with the increasing numbers of user login as compared to algorithm 1 and with this we can
say that nodes are not getting clustered more tightly as compared to algorithm
•Here we can also see that with the increase in reservation percentage ,more new
connection are being made between users from different groups and with that more open
triangles are being made and it results less clustering of nodes.
Conclusion and Future works
In summary, we performed graph analysis and studied the various
properties of the social network. After properly understanding the
entire network, we run friendship recommendation simulation and
see how it recommending friends for a specific user on the
network with two different type of algorithms. Apart from this, we
analysis our network user dynamic by applying evaluation metric
and avg cluster coefficient at different login interval on the
network.In brief,what we found is algorithms which suggestion is
based on closest or nearest distance generally forms clusters. The
current dataset does not have specific features listed for each user.
If they were provided, we can use them to detect users with
similar features and recommend friends.
References
[1] Representing degree distributions, clustering, in social networks with latent cluster
random effects models. Social Networks, 2009 P. Krivitsky, M. Handcock.
[2] Modularity and community structure in social networks. PNAS, 2006 M. Newman.
[3] Uncovering the overlapping community structure of complex networks in nature and society.
Nature, 2005 I. Farkas, and T. Vicsek.
[4] Discovering social circles in ego networks. J. McAuley and J. Leskovec. 2012.
[5] The anatomy of the Facebook social graph. J. Ugander, B. Karre preprint, 2011.
[6] Towards discovering hidden communities based on user profiles. In ICDM , 2010 T.
Yoshida
THANK YOU