Professional Documents
Culture Documents
Twitch Report
Twitch Report
1
Ahmedabad University ‘22, Ahmedabad, Gujarat
Table 1: Data details for the 6 different datasets of 6 Once the data was preprocessed and uploaded on Gephi, we
different languages. were able to analyze different data sets in different formats,
where the node size and color could be altered according to
need, and the degree range for the visualized graph could be
3 Process edited. The most prerequisite analysis was based on the
modularity class distribution. Using Louvain Algorithms which
The first thing which has to be done with a dataset that is not is inbuilt in Gephi, we are able to segregate data into modularity
particularly big but spread is to sectionalize it for different classes. The algorithm was originally used as a fast community
fields. The dataset requires the implementation of the fields unfolding algorithm for large networks where the approach
such as days_compare and views_compare so as to would be based on modularity. This approach tries to maximize
categorically assign different nodes into different sets so as to the expected number of edges and the actual number of edges
better differentiate between them. Now to differentiate a node in a community. After actually visualizing the graph and
into three different categories of “New”, “Mid” and “Old”, a analyzing it, we aimed at drawing conclusions about the
metric has to be set such that we aren't particularly making a different cases and at the same time plotting the distribution
group niche. So the metric used was presuming that the for cases like degree distribution for different nodes around
distribution given would be normal in nature. So, the mid different age classes (based on days_compare).
consisted of 1 standard deviation from the mean on both sides.
That is nearly equal to 68% of the total data. The remaining Along with analyzing the traits of the nodes, we also analyzed
data on both sides were given the category of New and Old the assortativity of all the graphs and tier reciprocities using
respectively, New for the case that the days are lesser than python and networkx. Assortativity gave us that not a single
(mean – standard deviation) and old for the case that the days graph amongst all had nodes going back to similar nodes but
for the nodes are more than (mean + standard deviation). rather all the nodes wanted to connect with dis-similar nodes.
The reciprocity for all the graphs turned out to be 0 which gave
For the case of views, we simply divided it into two parts, the the hint that the edges don't traverse back to the source node
first part including the nodes that had views lesser than of the first edge at all, throughout the whole graph.
average and the second part consisting of nodes that had views
more than average. Once we received 2 new fields
days_compare and views_compare, we were able to start the
basic analysis. Then using networkx, we were able to append
the degree for each node to the nodes table for each of the
datasets. Table 2: Assortativity coefficient and Reciprocity of the
graphs formed by the datasets.
2
predominantly streamers from some communities particularly
have a lot of viewers.
3
Ahmedabad University ‘22, Ahmedabad, Gujarat
4
Also for the two outliers, we think that this anomaly could mean No of the views had the total of the views which we compared
that maybe these two languages might be new to the platform with the days_cmpre to get an exact idea of how many views
itself and it is still budding there thus the new streamers are were associated with which age category of the streamers. We
getting more attention because the platform not being more did this to again understand which age category is more
popular than the older streamers might have been inactive as dominant in accordance with the views that they are getting.
well.
5 Hypothesis
Hypothesis 1: - Based on how the human mind thinks, we
Figure 10: Mature vs Non-Mature content for the different hypothesized that old streamers must have a high connection
datasets and a more significant degree associated with them. These
4.8.1 Results higher degrees of connection must be leading to a more
The result of this analysis here is that we can pinpoint the significant number of views than any other age category of
numbers of the content creators according to which age section streamers. The result of which has been discussed in the
they belong to. It is very much evident that the mid-aged conclusion provided.
streamers were the most for all the languages and again not Hypothesis 2: If a more significant number of streamers
much difference could be seen between the new and old belong to a particular age group, the age group with the highest
streamers. counts of streamers would overall have the highest cumulative
views. The results of this hypothesis are described in the
4.9 No of views/days_compare conclusion stated below.
5
Ahmedabad University ‘22, Ahmedabad, Gujarat
REFERENCES
[1] Meier, F., 2020. Social Network Analysis as a Tool for Data Analysis and
Visualization in Information Behaviour and Interactive Information
Retrieval Research | Proceedings of the 2020 Conference on Human
Information Interaction and Retrieval. [online] ACM Conferences. Available
at: <https://dl.acm.org/doi/10.1145/3343413.3378018>
[2] Meier, F., 2020. Social Network Analysis as a Tool for Data Analysis and
Visualization in Information Behaviour and Interactive Information
Retrieval Research | Proceedings of the 2020 Conference on Human
Information Interaction and Retrieval. [online] ACM Conferences. Available
at: <https://dl.acm.org/doi/10.1145/3343413.3378018> [Accessed 3 May
2022].
[3] Communications of the ACM. 2022. The power of social media analytics |
Communications of the ACM. [online] Available at:
<https://dl.acm.org/doi/10.1145/2602574>