Professional Documents
Culture Documents
02. Manuscript (5)
02. Manuscript (5)
1
Chapter I
Introduction
was always a costly and challenging undertaking. The Internet makes things
significantly simpler, quicker, and less expensive (Blank, 2013). However, the
computer systems, had made media streaming feasible. When data is sent across
continue while new data is being received, the technique is known as streaming
(Fecheyr-Lippens, 2010).
Platforms that allow individuals to stream video content over the internet
are often referred to as user-generated live streaming systems (Pires & Simon,
In recent years, live video streaming has become a global business and
Live, have been established and have experienced remarkable global expansion.
However, not enough attention has been given by studies to comprehending the
al., 2017).
these comments.
linguistic structure, the context of words used in the text, and both positive and
content creators. Viewer sentiment can profoundly impact the success of a live
gain valuable insights into audience preferences, reactions, and interests. This
information enables streamers to tailor their content to better align with viewer
streamers, revolves around handling the large volume of comments their videos
attract. It's impractical for them to manually sift through hundreds or thousands
of comments. While likes and dislikes offer some insight into viewer feedback,
1. What are the requirements to produce a web system that utilize sentiment
analysis?
The main objective of the study, titled "Viewer Sentiment Analysis in Live
system, that utilizes sentiment analysis. Specifically, this study aims to:
preferences.
reputation management.
responses. This insight allows them to tailor their content more effectively to
Viewers: Viewers stand to gain from the enhanced content quality that
attuned to their audience's preferences and emotional responses, they can create
content that better aligns with what viewers enjoy and find engaging.
social media can expand their understanding of the dynamics within online
develop more sophisticated models and tools for sentiment analysis, contributing
audio, and multimedia content that is created and shared across digital platforms.
digital content for online distribution and consumption. Digital content creators
creative professionals who produce content for websites, social media, streaming
the internet, allowing viewers to watch events as they happen. Livestreams often
features.
over the internet in real-time. Live streaming allows content creators to engage
feedback.
Streamer. Someone who broadcasts live video content over the internet,
YouTube, or Facebook Gaming. They cover diverse activities such as gaming, art
Video on Demand (VOD). Videos that viewers can access at any time,
rather than at a scheduled broadcast time. VOD allows users to watch videos
viewers, which can include actions such as liking, commenting, sharing, and
comments through lexicon-based sentiment analysis and data collection via the
The system is designed to work only with YouTube videos, and the
findings may not be applicable to other platforms. Data is gathered only from
algorithm is limited to text data and may struggle to interpret sarcasm, slang,
Sentiment Analysis
The study aimed to provide insights into the strengths and limitations of each
lexicon-based techniques, along with related fields like emotion detection. The
in text and highlighted the need for further research in sentiment analysis.
people's sentiments in online comments. They suggest that considering both text
and emojis is crucial, especially for Arabic comments. The study also notes that
Overall, the study highlights the potential of emojis for sentiment analysis and
YouTube; users often express their opinions towards the content that they
consume by leaving comments. The study used three class attributes which are:
positive, neutral, and negative; TextBlob library is used to process textual data
in python.
A study by Li, M., Chen, Zhao, and Li, Q. (2021) proposed a sentiment
analysis model based on BERT model for Chinese stock reviews. This avoids
building a new dictionary, and extract its features manually. The researchers
found out that BERT+FC model via fine-tuning performs the best among the
different variants they designed. The proposed model includes BERT model and
a classifier layer. This classifier layer is stacked on top of BERT and jointly
fine-tuned with the model. The dataset used to be trained on the Chinese
Wikipedia corpus is from Github website with a total of 9204 labeled reviews,
and 88.09% accuracy. To verify the efficacy of the proposed BERT model, it
feature selection and data gathering techniques while delving into several
Tan and Chia (2022) stated that increased usage of social media has
analysis, the sentiment on the topic of interest must be identified, also known as
topic modeling. Topic modeling was not recognized to be reuse-ready for broad
for Sentiment Reasoning (VADER). Using the Reuse Readiness Levels (RRL),
it was identified that LDA for Topic Modeling is at RRL 4, where it might be
reused by most users with some effort, additional cost, and calculated risk.
behavior has made it the most popular medium for sharing videos in society.
YouTube allows anyone to create an account in any category and upload videos
various tactics to make their videos popular. Evaluate user comments, determine
content, which influences their decision to subscribe to such channels (Danda &
Talarczyk, 2021).
Singh and Tiwari (2021) have used sentiment analysis to explore the
(ML) and natural language processing (NLP). In a similar case, they analyzed
sentiments in user feedback, which can provide valuable insights into public
Using this system, they examined comments on popular YouTube channels and
found that Sadness, Surprise, and Joy were the most common emotions
expressed.
based method for retrieving relevant and popular YouTube videos by analyzing
sentiment in user comments. Their approach involves four key steps: collecting
in user comments.
items.
Networks
people make videos to attract audiences and earn money through views,
reputation. If the number of likes exceeds the number of dislikes, the material is
good; on the other hand, a high number of dislikes compared to likes usually
indicates poor content. Although the amount of likes a video receives provides a
summary of its success, it does not explain the fundamental causes of its success
or failure.
see how people feel about the material. Prior to AI and machine learning,
manual analysis could only handle a limited number of comments per video.
Most popular YouTube channels, on the other hand, receive 1,000 or more
comments per video and post at least five videos every week. As a result, the
In a study by Kobs et al. (2020), they found that sentiment analysis helps
Despite challenges with Twitch's unique language, the study showed that these
Twitch.tv as the platform. In this study, researchers scraped livestream chats and
clips for data. Since the data was unlabeled, they had to find a solution. They
had two people label each message as Positive, Neutral, or Negative. They used
evaluate than other methods. They compared two models: convolutional neural
time (Chouhan et al., 2021). The study utilized a range of models, including
Random Forest Classifier, and Multinomial Naïve Bayes. The Support Vector
Twitch.tv.
Strauss, 2019). These YouTuber gamers were selected based on their common
traits and high popularity. The study employed judgment sampling followed by
Netvizz (version 1.45), researchers extracted posts from each YouTuber's page,
considering the unique language used in the gaming community. The study's
findings were divided into Facebook metric analysis, sentiment analysis, and a
feedback and preferences. The study classified sentiments into positive and
negative categories, using lexical classification when labeled data was scarce.
decision-making.
cyberbullying within Instagram user comments was explored. The study focuses
on discussions related to the 2017 Jakarta Governor Election and examines how
social media enables individuals to freely express opinions, both positive and
processes.
promoting good health. Using Twitter data logs and the MooM dataset, they
categorized tweets based on trending topics and described their alignment and
user requirements and Twitter data analysis. Overall, their study provided
trends.
StarCraft 2 chat. They use SO-CAL for sentiment and toxicity detection, finding
it effective. Their study emphasizes the need for tailored dictionaries in gaming
They also note the overlap between sentiment classification and toxicity
detection tasks.
found negative reviews are longer and written sooner than positive ones. Each
genre shows different emotional patterns, with action and adventure games
receiving mixed reviews and role-playing and strategy games getting mostly
positive feedback.
align closely with their ratings. They analyzed reviews for Dragon Age and
Mass Effect games, focusing on aspects like combat, story, and character.
sentiment and opinion. The Stanford Deep Learning for Sentiment Analysis
model is noted for its success. Challenges in sentiment analysis include the
prices.
analysis to figure out how people felt about products. Then, they used another
in the future. Based on this prediction, they made sure that the best reviews of
products were shown first to customers. This helps customers find great
Book Series" by Vyas and Uma (2019), Chapter 2 discusses how to understand
feelings in product reviews. It explains that sentiment analysis helps figure out if
a review is positive, negative, or neutral. This helps businesses see what they're
doing well and what needs improvement in their products. Sentiment analysis
also helps marketing teams target their ads better. The chapter explains how
sentiment analysis works and mentions that it usually uses supervised learning
from text, across different languages, due to ambiguities and nuances like
can gauge people's feelings. They found that using machine learning was better
than other methods. Looking at whole documents, not just sentences, improved
accuracy by 7%. Using more data helped traditional methods. They found that
product reviews.
Conducted by Anton Borg & Martin Boldt (2020), titled, “Using VADER
sentiment and SVM for predicting customer response sentiment.” The researchers
provide labeling of the emails. Two support vector machine models are being
trained by email content and sentiment labels. The outcomes has demonstrated that
According to Elbagir and Yang (2019), Social media technologies take many
enterprise social networks, video sharing networks, and social networks. As the
number of social media technologies has grown, so has the popularity of online
social networking sites like Facebook, YouTube, and Twitter, which allow people
to express and share their ideas and opinions about life events. Currently, numerous
programs, such as Linguistic Inquiry and Word Count (LIWC), can extract
complex aspects from texts. However, many of these tools require some
sentiment analysis.
Amin, Hossain, Akther, and Alam (2019) modified the VADER model to
analysis of the English language, the Bengali language is far behind but the polarity
lexicon based works can learn from SentiWordNet, VADER, etc. The researchers
of the study developed a model that identifies Bengali text sentiments. They created
two dictionaries of negation and booster words. The negation list consists of
negative common words in Bengali that affects the polarity of the text to either
positive or negative. The booster dictionary includes Bengali words that boosts the
valence of the text. The Bengali lexicon was constructed by translating VADER
lexicon with bilingual dictionary and gave the corresponding polarity scores. The
stemming. Identification of the position of booster words was also done by using
three processes: Bigram, Trigram, and Negation. The final step in developing a
comparing unmodified VADER and Bengali VADER. First, the Bengali text was
the text into English by Google Translate and Python Translator. Unmodified
VADER, using its three translators, labeled the text with a positive sentiment,
where in Bengali context, it should be negative. The Bengali VADER model that
also evaluated the models with a positive sentiment Bengali text. As expected,
during the Era of COVID-19" was the title of a study done in 2020 by Pano and
Kashef. The researchers investigated with both full-length and truncated tweets to
find out how different text preparation methods affects VADER sentiment scores.
Texts were deemed positive if their VADER emotion score was above 0.05 and
negative if it was below -0.05. Additionally, the researchers found that phrases
work better with bigger datasets. Furthermore, cleaning the data before employing
strategy for establishing a positive association between Bitcoin prices and tweet
sentiment analysis and how they might increase the association between sentiment
Garay et al. (2019) conducted a study examining the opinions and emotions
expressed within the anti-vaccine movements on social media. The data, consisting
of tweets and excerpts, has been processed to remove irrelevant information and
background noise. This information is then grouped using the k-means clustering
sentiment tool. The prevalent sentiments within each cluster help to characterize the
elite, reinforcement of the right to not vaccinate, the presence of toxic components,
the profit motives of big pharmaceutical companies, purported links to autism, and
reported health issues following vaccination. To assess the results of the k-means
points to other nearby clusters. The average silhouette score that is derived is
0.013540022, indicating that the data points are close to the decision boundaries.
wherein it is based on the transformer model and deep learning models to improve
sentiment classification tasks. BERT (RoBERTA) was chosen for vectors of the
input sentences as well as the Long Short-Term Memory (LSTM) model along with
the Convolutional Neural Networks (CNN) model to understand the context and
sentiment between each sentence. The results show that the suggested hybrid model
Lioa et al. (2020) said that, Aspect-category sentiment analysis can provide
within the same text. The primary issue of aspect-category sentiment analysis is
that multiple aspect categories can display distinct polarity in the same text.
text and aspect tokens, and the cross-attention mechanism directs the model to
focus on the most relevant characteristics for the aspect category. According to the
Lee, and Lim (2023) to tackle challenges in this field like imbalanced datasets. The
removing of stop-words, punctuations, hashtags, and URLs in the raw text data.
Data augmentation technique was applied to give more attention to the minority
Substitution, Text Generation, and Word Embedding. Global Vectors for Word
Representation (GloVe) was chosen as the word embedding technique for data
augmentation. The model was evaluated by using three datasets: the Internet Movie
datasets. RoBERTa-GRU model had accuracy scores of 94.63% for IMDb dataset,
89.59% for Sentiment140 dataset, and 91.52% for Twitter US Airlines dataset. This
study states that the combination of RoBERTa and GRU is a promising solution for
Bolem Sai Chandana's study, "Aspect-Based Sentiment & Emotion Analysis with
ROBERTa and LSTM" (2022), used LSTM (Long Short-Term Memory networks)
models for aspect-based analysis, this work aimed to increase the precision of
sentiment and emotion detection in textual data. They discovered that the
linguistic tasks is effective because it helps them learn features that aid in
only learn to represent linguistic features but also to prioritize the use of those
diagnostic set called MSGS (the Mixed Signals Generalization Set) has been
tuning. RoBERTa models were pre-trained from scratch on datasets ranging from
available RoBERTa-base. The study found that models can learn to represent
linguistic features with relatively small amounts of pretraining data, but they
surface ones. Eventually, with around 30B words of pretraining data, RoBERTa-
base did demonstrate a linguistic bias consistently. The conclusion drawn is that
there is still potential to improve the speed at which models learn to recognize
(AES), Twitter, and Google Play. It involves fine-tuning using pre-trained BERT in
feeding it to the BERT model. The outcomes showed that BERT exceeded
A study by Li, M., Chen, Zhao, and Li, Q. (2021) proposed a sentiment
analysis model based on BERT model for Chinese stock reviews. This avoids
building a new dictionary, and extract its features manually. The researchers found
variants they designed. The proposed model includes BERT model and a classifier
layer. This classifier layer is stacked on top of BERT and jointly fine-tuned with
the model. The dataset used to be trained on the Chinese Wikipedia corpus is from
Github website with a total of 9204 labeled reviews, and 88.09% accuracy. To
verify the efficacy of the proposed BERT model, it was compared with TextCNN,
approximately 2.14% to 3% better than the rest of the methods in terms of accuracy
tuning.
Singh et al. (2019) claimed that sentiment analysis, also known as opinion
In another study conducted by Sousa and Sakiyama (2019), titled "BERT for
Stock Market Sentiment Analysis", the researchers based the prediction for stock
prices from the moods in social media and financial news. After fine-tuning the
BERT model, it was able to accurately identify nuanced emotions from the text and
sentiment model might greatly increase the accuracy of stock market predictions by
Ashir (2021) stated that E-commerce reviews are increasingly valued by both
climate, which drives its high demand. The goal of this review article is to
investigate and assess the applicability of the BERT model, a Natural Language
processing cleans up the data and converts it to numbers before it is sent into
BERT, which builds vectors for each input character. We discovered that BERT
reviews in the dataset are two issues that affect the model's accuracy. Training is
also sluggish due to its size and the large number of weights to update. Additional
fake review classification model and providing more training to the model in
After reviewing the current literature, it has been found that employing
can provide valuable insights to a streamer, allowing them to adjust based on the
words and nuances such as sarcasm may pose as a barrier for computational
analysis. The study done by Kobs et al. (2020), established various methods in
comparison, the researchers utilize a web browser system for sentiment analysis by
sentiment.
C.J. Hutto and Eric Gilbert launched Valence Aware Dictionary and
tool based on language and rules that is specifically built for social media
performance quickly set a new standard for NLP tasks such as language
The sentiment analysis for "Behind the Screens" is built using these
Chapter III
Research Methodology
the overall sentiment. In this study, the researchers would like to emphasize that
live stream VOD is different from other YouTube videos as VODs are recorded
during a live stream and viewers can interact with the streamer real-time. Also,
VOD serves as the archived recording of the stream. While, YouTube videos are
recorded and edited beforehand. Comments under the VOD were extracted and
comment section is disabled, and with comments, a viewer can see the opinions of
others. YouTubers, specifically in this study, the streamers also check the
comments under their videos to see the reactions, opinions, or insights of their
viewers. Sometimes, there are too many comments to read and it is a hassle to do
it one by one. The researchers will develop a website that will get the overall
sentiment of the comments under a YouTube video and give the summary of the
intent.
to conduct the study. Quantitative research takes numerical data and measurement
into account and supposes that circumstances in the study can be measured
data that describe the current state of a phenomenon (Koh & Owen, 2000).
defined into numerical scores or labels. To know the best performing Natural
each comment, the researchers compared three (3) language models for sentiment
Hutto (2014), is used for general sentiment analysis, particularly for social media
texts or comments found online (Wu et al., 2024). Researchers from Google
designed BERT model to improve the fine-tuning approaches and get the context
(Barbieri et al., 2020). Descriptive research comes into play when labeling or
In the study, purposive sampling was used to choose three (3) YouTube
characteristics by being variety streamers, playing the same games, and mainly
streaming on YouTube under a contract. They are also part of the same circle
called OfflineTV and Friends, and were nominated for or won an award on The
platform, Twitch.
comparisons between the sampled streamers. The same number of VODs (n=200)
with the same sampling time scale end point were chosen. Lilypichu’s latest VOD
in the playlist was streamed on April 28, 2024, while Valkyrae and Sykkuno’s
latest livestreams in the playlists were on April 29, 2024. All comments under the
The researchers created three (3) YouTube playlists and named them
according to the sampled streamers, which consist of 200 VODs from each of the
streamers. Python program (version 3.11.7) was used to retrieve all the comments
under each VOD in a playlist. In this study, the YouTube Data API was also used
for the developer or API key to get the comments. Then, the output was saved as a
CSV file. Comments, along with their timestamps, username, video ID, and date,
were retrieved. A total of 31,054 comments were extracted from 600 livestream
ascending order.
Data Preprocessing
comments retrieved for classification. Texts from the internet usually contains
noise such as HTML tags, scripts, special characters, etc. Natural Language
Toolkit (NLTK) was used for language processing, especially the VADER model.
spellings were done to get the sentiment of each comment. These preprocessing
techniques were done for classifying each comment sentiment. For the overall
sentiment of all the comments, English stop words, punctuations, and special
relating to each streamer and content of the study. Finally, the data were
As what was mentioned, the datasets were analyzed by using three (3)
models to compare which model would perform best for the study. Capitalization
and punctuations were kept as they give more subtle understanding of the
emotions behind the comments. First, VADER model was used to classify the
datasets. Each comment extracted from the VODs was analyzed for its sentiment.
Next, BERT model classified the datasets. Unlike the two (2) models, the polarity
scores of this model are zero (0) to five (5): 0 being the most negative, 3 being
neutral, and 5 being the most positive. Lastly, the datasets were classified by
using RoBERTa model. The polarity score of each comment was done in a for-
The rest of the preprocessing techniques were applied to the dataset. The
sentiment of the comments were counted to determine the emotions of the viewers
towards the streamer. The first five (5) most frequent words that appeared in the
comments per streamer based on sentiment were extracted. These words from the
streamers were also compared from one another. Word clouds were used to
visualize the sentiment of the viewers per streamer. Lastly, all the comments per
Iterative model will be used for developing the system. This software
development life cycle has elements from waterfall model in iterative form.
Initially, iterative model implements parts of the total system and adds
The main functions of the system were designed. The main function should be
accepting any YouTube video URL and analyze the comment section’s overall
sentiment. No backend programming was added yet. The developers of the study
designed the initial user interface and the basic requirements of the system. The
system was not ready to accept any user input in this stage. The developers
suggestions into account. The frontend will be improved and the basic functions
of the system will be programmed. The user should be able to input YouTube
video URL and analyze for the overall sentiment. The results of the analysis
should be returned. The system will be tested, implemented, and reviewed for
changes. This stage will continue until the application achieves user-satisfaction.
Iteration 3. This stage will still include all the changes, improvements,
and feedbacks from previous iterations. In this stage, the system will undergo user
testing and be implemented. The system will be reviewed to check for further
improvements or errors. Then, the system will be deployed and be available for
the users to utilize. The web application should be maintained by the developers
Work Plan
techniques were applied to the data collected. Then, the data were analyzed and
main feature for the web application. To develop a web application, Python Flask
will be used as a framework for backend, Tailwind CSS for frontend, and MySQL
for database.
frontend. Initially, the UI/UX designers of the study used Figma to design the
frontend.
Backend. The researchers will use Flask framework for developing the
system that is used for managing structured data. The web application will be
using this database management system as the user should be able to access their
Viewers and Streamers (New and Registered) and admin are the actors in
this diagram. YouTubers who either streams or uploads edited videos, or viewers
are the users of the website. New users can register an account to start using the
web application. Once a user has an existing account, they can log in and utilize
URL of the desired YouTube video, and view interpretations of past requests. The
interpretation includes the overall sentiment of the comment section, five (5) most
frequently used words, and a word cloud. The user can also view how many
comments are negative, positive, or neutral with the sentiment counter. The admin
1. Users: YouTube URL is provided by the user and results are received.
2. Admin: The system is managed by the admin and is also able to view the
interpretations.
Processes
category.
4. Most Frequently Used Words: Identifies the five (5) most frequently used
words, how many times the words were used, and provide their
sentiments.
interpretation.
Data Stores
are stored.
2. Labeled Comments: Where the comments that were labeled are stored.
stored.
The database has tables called: loginReq for login requirements that also
serves as an audit trail; admin and user that consists of basic user and admin
comments extracted from the YouTube video are stored; labeledComments where
stored; FrequentWords where words from the comments that frequently appeared
were counted and labeled are stored; Lastly, summarizedComments where the
summary of all the comments from the YouTube video are stored.
This section contains the system flowchart of the web system “Behind the
Screens." It is divided into 3 parts: login and registration, admin, and user.
analysis was used to assess the overall sentiment of the viewers towards the
streamer.
viewer comments and the content. The researchers watched ongoing live streams,
and live stream VODs on YouTube. The researchers observed the chat during live
streams, and analyzed the recorded chat during VODs to identify the behavior of
the viewers.
the stream?
streamed?
5. How does the streamer interact with the chat and how often do they do
it?
given to those who were not able to attend the interview due to conflicts of
allow the streamers and viewers more freedom to express their thoughts and
insights. The researchers designed different set of guide questions to ask the
streamers and viewers: one set for the streamers, and another for the viewers.
2. What type of content do you usually watch? Can you give an example of a
4. What do you think about the comments? Do they influence your opinion
on the video?
comment?
8. What types of comments do you find most helpful when deciding whether
to watch a video?
10. If you do leave a comment, what type/s of video content motivate you to
11. If the content creator saw a comment of yours, what would you feel if they
13. In what ways do you think viewer comments contribute to the success or
4. What do you usually feel when you read the chat? How does it affect you?
comment?
9. Have you ever timed out or banned a viewer based on their chat? If so, can
10. Do you think leaving a chat (on a livestream) or a comment (if it's a pre-
Project Manager
she manages, organizes, and plans for the researchers. He or she is also
the thesis adviser and other personnel involved with the study.
Frontend Developer
They are responsible for designing and developing the user interface of the
website.
Backend Developer
Manuscript Writer
Writers are assigned to write and document the processes that took place
Data Collector
They are responsible for collecting the data necessary for the study. In this
study, sentiment analysis usually involves text analysis. The data collector
Alrumaih, A., Al-Sabbagh, A., Alsabah, R., Kharrufa, H., & Baldwin, J. (2020).
and Computer Engineering, 10(6), 5917. Retrieved May 18, 2024, from
https://doi.org/10.11591/ijece.v10i6.pp5917-5922
Bird, Steven, Edward Loper and Ewan Klein (2009), Natural Language
https://doi.org/10.1016/j.knosys.2021.107134
Chouhan, A., Halgekar, A., Rao, A., Khankhoje, D., & Narvekar, M. (2021).
DOI. https://doi.org/10.48550/arXiv.1810.04805
Identification-of-YouTube-Gaming-Gajanayake-Sandanayake/
7c90f945e738487209c1194f53e9260c60ec3e52
Hartmann, J., Heitmann, M., Siebert, C., & Schamp, C. (2023). More than a
from https://doi.org/10.1016/j.ijresmar.2022.05.005
Kobs, K., Zehe, A., Bernstetter, A., Chibane, J., Pfister, J., Tritscher, J., & Hotho,
7_7
https://ijarcce.com/papers/sentiment-analysis-on-youtube-using-lexicon-
based-approach/
Naf’an, M. Z., Bimantara, A. A., Larasati, A., Risondang, E. M., & Setya
of-Cyberbullying-on-Instagram-Naf%E2%80%99an-Bimantara/
5ed9294f98c53e8c8b4d5b06ff5c54091f0e1054?p2df
Poecze, F., Ebster, C., & Strauss, C. (2019). Let’s play on Facebook: using
019-01361-7
Qian, K., & Jain, S. (2024). Digital Content Creation: An analysis of the impact of
from https://doi.org/10.1287/mnsc.2022.03655
Sainath Pichad, Sunit Kamble, Rohan Kalamb, & Chavan, S. (2023, May 10).
https://www.ijraset.com/research-paper/analysing-sentiments-for-youtube-
comments
IANFIS. Journal of Big Data, 7(1). Retrieved May 19, 2024, from
https://doi.org/10.1186/s40537-020-00308-7
https://www.researchgate.net/publication/351351202_YOUTUBE_COM
MENTS_SENTIMENT_ANALYSIS
Tibor, G., & Szűcs, J. (2023). Length and sentiment analysis of reviews about
top-level video game genres on the steam platform. Retrieved May 19,
https://doi.org/10.4018/978-1-5225-4999-4.ch002
Wu, Y., Lin, M., & Yao, W. (2024, April 19). The Influence of Titles on YouTube
Yang, Z., MA. (2020, December 11). Text and sentiment analysis of YouTube
https://repositories.lib.utexas.edu/items/ebee793a-80ef-420d-bc19-
4e9a9841faae
https://arno.uvt.nl/show.cgi?fid=161872.
Borg, Anton & Boldt, Martin. (2020). Using VADER sentiment and SVM for
Semary, Noura & Ahmed, Wesam & Amin, Khalid & Pławiak, Paweł &
10.3389/fnhum.2023.1292010.
Sayeed, Md Shohel & Roji, Varsha & Anbananthen, Kalaiarasi. (2023). BERT: A
study.
Elbagir, S., & Yang, J. (2019). Twitter Sentiment Analysis using Natural
https://www.iaeng.org/publication/IMECS2019/IMECS2019_pp12-16.pdf
Singh, M., Jakhar, A. K., & Pandey, S. (2021). Sentiment analysis on the impact
020-01964-1
Cunha, Alexandre & Costa, Melissa & Pacheco, Marco. (2019). Sentiment
10.1007/978-3-030-20912-4_51.
Amin, A., Hossain, I., Akther, A., & Alam, K. M. (2019). Bengali VADER: A
https://doi.org/10.1109/ecace.2019.8679144
Li, M., Chen, L., Zhao, J., & Li, Q. (2021). Sentiment analysis of Chinese stock
https://doi.org/10.1007/s10489-020-02101-8
Tan, K. L., Lee, C. P., & Lim, K. M. (2023). RoBERTa-GRU: A Hybrid Deep
Alzahrani, Mohammad & Aldhyani, Theyazn & Alsubari, Saleh & Althobaiti,
Maha & Fahad, Adil. (2022). Developing an Intelligent System with Deep
10.1155/2022/3840071.
of Bitcoin (BTC) Tweets during the Era of COVID-19. Big Data and
Stock-Market-Sentiment-Analysis-Sousa-Sakiyama/
e03d32c04c6bb4d2383ac4df25f954dd941152c3
Sirisha, U., & Chandana, B. S. (2022). Aspect based Sentiment & Emotion
https://doi.org/10.14569/ijacsa.2022.0131189
Sayeed, Md Shohel & Roji, Varsha & Anbananthen, Kalaiarasi. (2023). BERT: A
Tan, Jih & Chia, Wai. (2022). Research Output to Industry Use: A Readiness
8515-6_2.
the insights of the anti-vaccine movement from social media posts using
10.1088/1757-899X/482/1/012043.
Warstadt, Alex & Zhang, Yian & Li, Haau-Sing & Liu, Haokun & Bowman,