Professional Documents
Culture Documents
Social Media Analytics
Social Media Analytics
Conversation
TERM 4
Metrics
PART A
Kanika Mohan
PGP/24/038
Twitter Conversations on #MUSIC
Using the Twitter Search Network to import data about #Music, obtained the in-degree, out-degree,
betweenness, closeness, eigenvector centrality, and clustering coefficient for every Twitter user that
participated in the conversation.
First graph for #Music after merging duplicates was obtained as below:
As we have collected the data from Twitter, the network type is Directed
The data holds 1745 vertices, which is the number of unique entities that have participated in
this conversation
The data consists of 1984 edges after merging the duplicate edges and holds data regarding
Retweets, Tweets, Mentions, and Mentions in Retweets
There exist 431 original tweets in the collected data
Since the reciprocated vertex pair ratio is 0.019 which is a lower value, hence majority of
the conversations are one way in the collected data
Since the reciprocated edge ratio is 0.0373 which a comparatively lower value, the majority
of the edges do not have reciprocal edges
There exist 480 separate connected components in the graph of collected data
There exist 250 single vertex connected components or isolates in the graph i.e., having no
connections with any other node
At most 406 vertices exist in a connected component
At most 406 edges exist in a Connected Component, which is the same value as the maximum
number of vertices in connected component
15 is the maximum geodesic distance or that between the shortest path in the network
2.201 is the average shortest distance between different paths in the network
Since the graph density is only 0.00051, so we can say the graph is not densely connected
The top 10 values for the respective properties as well as the Twitter ID related to the same have
been extracted from the data as below:
In-Degree
In-
Vertex Subgraph Degr
ee
rockstardoge
bsc 406
broadcastkin
g_ 85
youtube 41
hicetnunc20
00 19
whoiszid_nft 16
devteaml 13
allexbenedit
h 11
blackettmusi
c 11
benbarnes 10
thejazzsoul 9
-The highest in-degree centrality was observed for the Twitter handle of the individual
rockstardogebsc, who is a Berlin based artist and crypto currency enthusiast with over 7k followers,
showcasing that maximum tweets are directed towards this entertaining music twitter account.
Out-Degree
Out-
Degr
Vertex Subgraph ee
edjlazar 16
wireradiouk 16
radiozitymusi
c 14
souseman 12
stream_caste
r 12
lontermbelieb
er 12
artistrtweeter
s 11
mwubbaphat 11
theasmrzone 11
mistaseeforce 11
-The Twitter handle of edjlazar has the highest out-degree showcasing it has high activity and
extraversion due to high outbound ties.
Betweenness Centrality
Betweennes
Vertex Subgraph s Centrality
rockstardogebsc 163620.000
artistrtweeters 7357.205
broadcastking_ 6972.000
youtube 4096.667
blackettmusic 3396.695
icdreamz_ 2626.214
retweelgend 2623.548
technobotnl 2319.495
4dapeoplemusic 2182.295
usaruckus 2025.467
-Highest betweenness centrality is again of the account rockstardogebsc, which means it is the most
likely to be the most direct route between two people in the conversation. This is followed by
artistrtweeters, an account which retweets tweets of famous artists. We can also see the account of
YouTube, which makes sense as a lot of people would engage in conversations by Youtube directly.
-Now, looking at closeness centrality, 230 vertices with closeness centrality 1 were observed,
meaning that these are the people who can reach the fastest to any other node in the network.
Eigenvector Centrality
Eigenvector
Vertex Centrality
rockstardogebsc 0.048
erichoussein 0.002
tom_87indo 0.002
ctyzzen 0.002
wndrchive 0.002
chocovanillawin 0.002
elle_ack 0.002
livsyeji 0.002
jaemnavs 0.002
mamamuda20209
9 0.002
-Highest eigenvector centrality was witnessed for rockstardogebsc, who also had high in degree and
betweenness centrality. The graph showcasing its eigenvector centrality proves that it is highly well
connected to the other well-connected people.
-Also, the handle of other independent artists who wish to gain traction in this industry are highly
well connected.
Clustering Coefficient
Clustering
Vertex Coefficient
stormhour 1.000
bandcamp 1.000
soundcloud 1.000
stomp_city 1.000
pablolarosaxo 1.000
timeoutart 1.000
timeoutlondon 1.000
arboretumlondon 1.000
tonykofi 1.000
canadianmusicbl 1.000
Graph Partitioning
This graph has a lower modularity score as compared to the graph which is generated through
clustering form of grouping.
B. Grouping by Clustering
1. Clauset-Newman-Moore
-There’s not a large difference in the modularity scores of the two different clustering algorithms,
and they are both higher than when the data is grouped by connected components.
-Artists and celebrities have in-hub and spoke structure of network and the media pages have out-
hub and spoke network structure.
-Youtube is one of the central nodes in G1, a broadcast network structure, as is G3 where central
node is a journalist with handle Hafiz Miakhel and similarly the same case with G4 where central
node is Takayama handle.