Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/329937817

Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment


Analysis

Article  in  Indian Journal of Science and Technology · December 2017


DOI: 10.17485/ijst/2018/v11i47/130980

CITATIONS READS
4 204

1 author:

Las Johansen Balios Caluza


Leyte Normal University
45 PUBLICATIONS   38 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

COVID-19 Pandemic Impact Analyses in the Philippines View project

LNU IT Researches View project

All content following this page was uploaded by Las Johansen Balios Caluza on 29 October 2019.

The user has requested enhancement of the downloaded file.


ISSN (Print) : 0974-6846
Indian Journal of Science and Technology, Vol 11(47), DOI: 10.17485/ijst/2018/v11i47/130980, December 2018 ISSN (Online) : 0974-5645

Deciphering West Philippine Sea: A Plutchik and


VADER Algorithm Sentiment Analysis
Las Johansen B. Caluza*
Information Technology and Computer Education Department, Leyte Normal University, Philippines;
lasjohansencaluza@lnu.edu.ph

Abstract
Objectives: Territorial disputes over the West Philippine Sea are an emerging issue in the Philippines and other ASEAN
Members with China. The only source of information is coming from the media like the television, online news portals, and
social media. In line with this, this study aimed to unveil Netizens emotions and common issues found from the different
news articles. Methods/Statistical Analysis: This study scraped the different conversations from Twitter and online
news articles published from January 2018 to April 2018 using a Pluchik algorithm for emotions, Valence Awareness
Dictionary sEntiment Reasoner (VADER) algorithm for sentiment analysis, and Latent Dirichlet Allocation (LDA) using
Gibbs Sampling – Monte Carlo Markov Chain (MCMC) for topic modeling. 2500 times of simulation were performed and
validated before developing the hidden themes that are prevalent in the different news articles and supported by Netizens
emotions. Findings: Results revealed five (5) underlying themes found in the different news articles such as The assertion
of Philippine Sovereignty over Wes Philippine Sea Claim, Philippine-China Co-Ownership and Joint Exploitation of Natural
Resources in the Disputed Islands, Intensive Military Presence, Freedom of Navigation, and Military Facilities Buildup
in the Disputed Island, China-Philippine Relations through Bilateral Agreement, Trade, Military Support, and Security,
and One China Policy through Territorial and Defense Power. Also, Netizens Sentiments showed a majority of them who
tweeted were neutral (95%) and only few who were negative (2.67%) and positive (2.33%) where the majority of their
emotions through tweets were classified as Joy, Surprise, and Sadness. Only a few tweets were classified as anger, disgust,
anticipation, and fear. The findings were unique since it focuses on the soft evidence and the actual opinions, views,
and emotions relative to the issues on the disputed islands. Moreover, this result could give insights to the appropriate
authorities in dealing with the situations in the West Philippine Sea. Application/Improvements: Further studies maybe
conducted by using other social media like Facebook in understanding the emotions and views of the Netizens.

Keywords: Asia, Content Analysis, Data Mining, Deciphering, Gibbs Sampling, Latent Dirichlet Allocation, Plutchik Wheel
of Emotion, Simulation, Topic Modeling, VADER Sentiment Analysis

Indonesia, Lao PDR, Malaysia, Myanmar, Philippines,


1. Introduction
Singapore, Thailand, and Viet Nam. The Philippines is
National territory is considered to be the most sacred as one of the claimants where it submitted a case against
it defines the sense of ownership and belongingness of a China known as the South China Sea Arbitration under
country and its populace. The 1987 Constitution of the Annex VII to the United Nations Convention on the Law
Republic of the Philippines defined national territory in of the Sea (UNCLOS) where it ruled and showed “no his-
Article 1. Anent to this, the disputed islands in the West torical rights” of the China1,2. After this decision made
Philippine Sea has been an issue between China, and by UNCLOS, News agencies around the world includ-
the Member State of the Association of Southeast Asian ing in the Philippines announced through television and
Nations (ASEAN) such as Brunei Darussalam, Cambodia, published both print and online about the ruling on the

*Author for correspondence


Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

disputed islands. China and other claimant countries 2. The Research Questions
had shown its stand over the disputed islands3 where
Chinese Navy expanded operational areas into the high This study aimed to unveil and described prevailing top-
seas towards the “second island chain” running along the ics and emotions about the West Philippine Sea. The
Bonins and Marianas4. Also, China’s economic muscle result of the study could be a basis in coming up with a
has translated into an enhancement of its military capaci- better and humanely solutions about the disputed island.
ties, especially its maritime capabilities propelled by its Specifically, this study seeks the following questions:
determination to safeguard its economic and core inter-
est. China has declared the South China Sea as her core • What underlying themes be generated based on
concern which implies that China is prepared even to go patterns?
to war to ensure this declared sovereignty5. In the same • What is the probability of the theme per articles?
article, Marwah5 articulated the ASEAN’s diverse opin- • Based on the post and comments made by the
ions on the South China Sea where the issue usually not Netizens in Twitter, what are their sentiments
been discussed based on the recent years of ASEAN sum- about the West Philippine Sea?
mit, in fact in July 2012 ASEAN Summit in Cambodia,
Cambodia showed its stiff resistance. While during the 3. Materials and methods
Malaysian Chairmanship of ASEAN Summit 2015, the
Philippine President asked leaders of the association to
issue a collective statement condemning China’s reclama- 3.1 Research Design
tion of the disputed waters. Among the ASEAN member This study applied Exploratory Sequential Mixed Methods
countries, the Philippines and Vietnam had shown its where the initial phase started with data collection using
resistance over China’s reclamation steps in the islands. qualitative information and analysis, then followed by
Another action that made more tension in the disputed quantitative data generation and analysis with a conclud-
island was when China started building a military outpost. ingstage of data analysis combining the two results8,9. To
A few years back, the military buildup in the conflicted support the intention, this study used content analysis
islands does not necessarily indicate that Beijing will use using relational analysis. Relational analysis, like con-
force to occupy more islands, but rather that Beijing seeks ceptual analysis, it started with data or text classification
to enhance its military presence to augment its bargaining present in the text corpus. However, it seeks to go beyond
leverage in future negotiations6. In UNCLOS, the Aquino by exploring the concepts identified. Relational analysis
government asked the panel to comment on: has also been termed semantic analysis10. This research
Whether China can lawfully make any maritime claim has six (6) steps focusing on qualitative data collection
based on its nine-dash line, either to sovereignty over the from the different sources, qualitative data processing,
waters or sovereign rights to the natural resources within and themes development. Also, it focused on themes
the waters; whether the “islands” occupied by China can validation before exhaustive discussion of the potential
claim more than twelve-nm territorial sea, or are even information hidden in the text corpus. Finally, the themes
legitimately subject of sovereignty claims; and whether were synthesized in four (4) stages such as descriptive,
China can be commanded to refrain from preventing diagnostic, predictive, and prescriptive analytics.
Philippine vessels from exploiting the living resources in
the disputed waters7. 3.2 Research Process and Data Analysis
With all of this issues about the South China Sea or the
West Philippines Sea, no study was made about under-
(Figure 1)
standing the common topics of the different news articles
published online andthe people’s feelings about the dis- Step 1: Data Selection and Scraping
puted island issues. Therefore, it is empirically essential In data selection for sentiment analysis and modeling, it
to understand and unveil prevalent topics and emotions is essential to consider the credibility of the information
of the Netizens about the conflicted islands in the West taken from the internet. The internet today is treated as
Philippine Sea to come-up with a better and humanely a rich source of literature and secondary data in social
solutions on the disputed islands. science research11. In line with this, Internet data collec-

2 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology
Las Johansen B. Caluza

tion could indeed be problematic from source credibility In the preprocessing stage, information extraction from
– an essential issue in persuasive communication12,13,14. the text corpus is performed to identify keywords and
With this issue, when using internet-based information relationships within the text17 also known as pattern
in research, the researcher must evaluate the data to be matching. This technology is very advantageous when
collected. Documentary Sources in Social Research15 dealing with large volumes of text17,19. The preprocessing
introduce four criteria in assessing secondary sources that and cleaning methods performed in this research are:
to include published materials, computerized databases,
and syndicated services16. The four criteria for evaluat- • Stopwords Removal Methods: A stop word
ing secondary sources are Authenticity, Credibility, may be identified as a word that has the same
Representativeness, and Meaning. likelihood of occurring in those documents
not relevant to a query as in those documents
relevant to the query20. This study used the Term-
Based Random Sampling (TBRS).
• Stemming Method: This method is used to iden-
tify the root/stem of a word. For example, the words
connect, connected, connecting, and connections,
are all can be stemmed to the word “connect”21. The
idea of stemming is to reduce the total number of
distinct terms in a document or a query which in
Figure 1. Research process for sentiment analysis. turn will decrease the processing time of the final
output. In this research, N-Gram Stemmer was uti-
In this case, the researcher follows the four criteria in lized as it is language independent and hence very
choosing News Articles online. The articles were taken useful in many applications17,22. An N-gram is an
from reputed and known News Publishing Companies N-character slice of a longer string23.
in the Philippines such as Inquirer, Manila Bulletin, • Term Frequency-Inverse Document Frequency:
Manila Standard, Manila Times, and Philippine Star Term Frequency-Inverse Document Frequency
from January 2018 to April 2018. The News Articles were (TF-IDF) is a numerical statistic which reveals
scraped using a standard scraping method and removing that a word is how vital to a document in a col-
unnecessary and unrelated information. There were 25 lection. The TF- IDF is often used as a weighting
News Articles that were scraped and named after the new factor in information retrieval and text mining.
publishing company and the date of publication. Finally, The value of TF-IDF increases proportionally to
the researcher mined the conversations from Twitter with the number of times a word appears in the text
a hashtag west philippine sea or anything that talks about file but affected by the frequency of the word
the West Philippines Sea with 1292 tweets to unveil the in the corpus. This can help to control the fact
sentiments of the Netizens relative to the issue on the dis- that some words are generally more common
puted islands in the West Philippine Sea. than others17. Term Frequency (TF) is specified
as the number of times a term occurs in a docu-
Step 2: Text Preprocessing and Cleaning ment24,25. The algorithm used in this research for
Text mining is the process of extracting vital information Term Frequency was:
from big data17; others call it Text Data Mining (TDM)
and Knowledge Discovery in Textual Databases (KDT). By using R-programming language, the result is
When performing text mining, preprocessing and clean- expected to constructs or coerces a term-document
ing text play an important role.Preprocessing in text matrix or a document-term matrix26.
mining, Natural Language Processing (NLP), and infor-
mation retrieval is an important task and critical step Step 3: Topic Modeling using Unsupervised
since this is the stage where data preprocessing is used Learning
for extracting interesting and non-trivial and knowledge Deeper understanding the sentiment does not end on
from unstructured text data also known as text corpus18. identifying positivity, negativity, and neutrality. It is also

Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology 3
Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

to unfold hidden topics or themes in the articles, conver- 2. Choose , where


sations, post, comments, or anything that is written by 3. For each of the word position i, j, where , and
looking into the pattern of words found in the text corpus. a. Choose a topic
Unfolding the topics or themes can be done by using unsu- b. Choose a word
pervised machine learning in data mining called Topic
Modeling. Topic modeling algorithms are widely applied The sizes are treated as independent of all the other
to examine the thematic composition of text corpora but data generating variables (w and z). The subscript is the
remain difficult to interpret and adjust27. Topic modeling often released, as in the plate diagram shown here.
finds human-readable structures in unstructured textual However, the exact parameter inference of the LDA
data, and the most widely used technique is the Latent model is intractable, and thus, approximate estimation
Dirichlet Allocation (LDA) algorithm28. By taking into methods are needed. An approximate algorithm named
consideration the technical requirement of text analysis, Gibbs sampling31 is widely used for parameter estimation
this research utilized Latent Dirichlet Allocation (LDA)29 in topic models due to its simplicity under Dirichlet priors.
to analyze the topics of news articles online and explore
the different issues about the West Philippine Sea. Inline
with the assumption of “bag of words,” LDA represents a
document as a mixture of potential topics in which a topic
has a multinomial distribution over words. Every docu-
ment will have its mixing proportion of topics, and each
topic has its word distribution30. Based on an unsuper-
vised Bayesian learning algorithm, LDA can capture the
potential topics that represent the voices of the writers/
Figure 2. Gibbs sampler general algorithm.
authors from unstructured and enormous online news
about the West Philippines Sea.
LDA enables the discovery of potential topics from
our large unstructured review data. With the method, we
can identify an optimum number of topics, label the top-
ics, and analyze the differences and relative importance of
the topics for different text corpus30. A topic is a unique
distribution over a vocabulary of words that authors use to
describe a certain issue where in this study, it is the West
Philippine Sea. LDA is a three-layer Bayesian probability
model and a complete generative model of a document.
Following Blei, Ng, and Jordan29, Figure 2 shows the Figure 3. Plate notation representing LDA.
graphical model representation of LDA. The basic idea
performed was that documentsas represented by random Step 4: Apply Gibbs Sampling Method- an
concoctions over possible topics, where categorization of MCMC Approach for Validation
each topic by a distribution of words29.
With plane notation (Figure 3), the dependencies Gibbs sampling is a method appropriate for the job. The
among the many variables bagged concisely. The boxes idea in Gibbs sampling is to generate subsequent samples
are “plates” representing duplicates. The external plate by sweeping through each variable (or block of variables)
represents documents, while the internal plate represents to sample from its conditional distribution with the
the frequent choice of topics and words in a document. remaining variables fixed to their current values as shown
in Figure 232. The underlying logic of MCMC sampling is
that we can estimate any desired expectation by ergodic
The generative process (algorithm):
averages. That is, we can compute any statistic of a poste-
rior distribution as long as we have N simulated samples
1. Choose , where and is the Dirichlet distribution for
from that distribution:
parameter

4 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology
Las Johansen B. Caluza

This procedure keeps on until “convergence” (the well on text originating in social media where it was
sample values have the same distribution as if they were able to include sentiment from emoticons, sentiment-
ample from the right posterior joint distribution33. related acronyms, and slang38. As presented in the work
of Gilbert and Hutto39, VADER was a mixture of quali-
Step 5: Sentiment Analysis Processes tative and quantitative approach to produce and validate
Sentiment analysis is the contextual mining of text which a standard sentiment lexicon attuned to microblog-like
identifies and extracts subjective information from the contexts. A sentiment lexicon is a collection of lexical fea-
source material and helped to see the social persuasion of tures (e.g., words) which are commonly labeled according
a particular event. However, restriction on the analysis of to their semantic orientation as either positive or nega-
social media streams to fundamental sentiment analysis tive40. In this approach, each of the words in the lexicon is
and count based metricsoccurs33. This is paralleltoscratch rated as to whether it is positive or negative, and in many
the surface and to miss out on those high-valuable insights cases,  how  positive or negative. When  VADER  analyses
waiting to be discovered. Sentiment analysis is the cog- a piece of text, it checks to see if any of the words in
nitive operation of shaping if a statement is negative, the text are present in the lexicon. VADER  produces
positive, or neutral andsome call it opinion mining.A typ- four sentiment metrics from these word ratings. The
ical use case for this technology is to discover how people first three, positive, neutral and negative, represent the
feel about a particular topic34. In this study, to unveil the proportion of the text that falls into those categories. The
sentiments of the Netizens on Twitter, the researcher uti- final metric, the compound score, is the sum of all of the
lized two unique sentiment algorithms such as Plutchik lexicon ratings (1.9 and 1.8 in this case) and standardized
Model and Valence Awareness Dictionary sEntiment to range between -1 and 141.
Reasoner (VADER).
• The Plutchik Wheel of Emotions Model Step 6: Interpretation and Exhaustive
In everyday human existence conceive of emotion – Discussion
anger, despair, joy, grief – as a feeling, an inner state35.
In interpreting the results, the researcher should clearly
Today’s generation, sentiment analysis is rarely used
describe the findings in details. Followed by an exhaustive
to unveil untold story and emotions of Internet Users
discussion about the phenomenon that revealed and must
towards specific issues in politics, education, business
be supported by the literature. The discussion should
and marketing, and other human experiences written in
answer the gaps stated in the introduction. Depth articu-
books, magazines, news articles, social media commen-
lation of the findings by answering how did it happen, why
taries, and another type of conversations. In line with this,
did it happen, and what should be doneis just some idea in
Plutchik introduces eight (8) primary emotions: joy, and
presenting the analysis in the discussion part.
sadness, anger and fear, trust and disgust, and surprise
and anticipation36. Moreover, Plutchik’s model shows
connectivity between the ideas of an emotion circle with
3.3 Tools and Software Used
a color wheel as shown in Figure 4. These eight emo- This study utilized Python programming language in
tions are assumed to be complete in the sense that any interpreting mathematical foundations and algorithms,
expressed emotion is related or subsumed by one of the R-programming for data visualization and processing
eight. In his work, Plutchik states that these emotions are through the help of python programming, Orange 3 soft-
culturally independent. Given this assumption, we can ware, and Web scraper.
apply this model to any given language, which we con-
sider to be a strong point37. 3.4 Ethical Consideration and Reflexivity
• Valence Awareness Dictionary sEntiment Ethical standards in conducting research must strictly fol-
Reasoner (VADER) Model low. That is why in this research, the researcher opted not
In support of Plutchik Wheel of Emotion model, this to reveal the origin of the news articles published online
study utilized the VADER Model to enhance the meaning nor either to reveal the authors of the news to avoid future
of the sentiments about a particular issue like the West predicaments and legal issues. While in obtaining tweets
Philippine Sea News Articles online. VADER is a lexicon in twitter, the researcher applies for API access key and
and rule-based sentiment analysis library. It performs security where terms and conditions in using tweet infor-

Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology 5
Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

mation are stipulated. The names of the Netizens were underlying themes came from the patterns of words in
removed, and only the content or the tweets remained every topic. Based on the result, topic1 theme talked about
and used in the study. After the analysis and interpreta- Assertion of Philippine Sovereignty over West Philippine
tion, the tweet databank was automatically destroyed Sea Claim, topic twothemestalked about Philippine-China
to prevent any mishandling and violations in the terms Co-Ownership and Joint Exploration of Natural Resources
and conditions where the researcher agreed. Finally, the in the Disputed Islands, and topicthreethemestalked about
position of the researcher in this research wasunequivo- Intensive Military Presence, Freedom of Navigation, and
cal; his intention was only to reveal underlying themes Military Facilities Buildup in the Disputed Island. Also,
in the different articles and Netizens reactions by study- topic 4 talked about China-Philippine Relations through
ing its emotions towards the conflicted islands in the Bilateral Agreement, Trade, Military Support, and Security
West Philippines Sea. The interpretation made by the and topicfivethemestalked about One China Policy
researcher maybe affected since he is a Filipino teacher through Territorial and Defense Power.
in a state university in the Philippines. His interpretations
are based on his capacity and readings about the subject. Table 1. Articles to topic assignment
Articles Topics
4. Results and Discussion Art1.txt 3
Art2.txt 1
Art3.txt 4
4.1 Latent Themes of News Articles about
Art4.txt 2
the West Philippine Sea
Art5.txt 3
Articles taken from the different leading news online pro-
Art6.txt 5
viders in the Philippines were assigned to topics using the
LDA algorithm. As shown in Table 1, articles are labeled Art7.txt 4
with the same topic number indicates common story or Art8.txt 3
underlying themes such as Art1.txt, Art5.txt, Art8.txt, Art9.txt 3
Art9.txt, Art10.txt, Art13.txt, Art19.txt, and Art25.txt Art10.txt 3
were auto-assigned to topic three (3). While articles Art2. Art11.txt 1
txt, Art11.txt, Art12.txt, Art16.txt, Art18.txt, and Art23.
Art12.txt 1
txt were auto-assigned to topic one (1). Also, topic four (4)
was auto-assigned to articles Art3.txt, Art7.txt, Art15.txt, Art13.txt 3
and Art21.txt. Moreover, topic two (2) was auto-assigned Art14.txt 2
to articles Art4.txt, Art14.txt, Art20.txt, Art22.txt, and Art15.txt 4
Art24.txt. Finally, articles Art6.txt and Art17.txt were Art16.txt 1
auto-assigned to topic five (5). Art17.txt 5
Based on the result, it shows that topic three (3) were
Art18.txt 1
the most common topics among the different articles from
the different online news provider. This was followed by Art19.txt 3
topic one (1),then topic two (2), followed by topic four Art20.txt 2
(4), and finally topic five (5). The result shows that there is Art21.txt 4
an emerging theme among the different articles based on Art22.txt 2
topic assignment. This issue could be of high impact and Art23.txt 1
public concern relative to the disputed islands in the West
Art24.txt 2
Philippine Sea.
In understanding further, the topics assigned to every Art25.txt 3
article, LDA generateda list of topicswith terms. As shown
in Table 2, it presented only the top 20 words from over a Theme 1: Assertion of Philippine Sovereignty over West
thousand words assigned to every topic. Meanwhile, the Philippine Sea Claim

6 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology
Las Johansen B. Caluza

Table 2. Topic to terms and generation of latent themes


Top 20 Words Topic 1 Topic 2 Topic 3 Topic 4 Topic 5
1 sea exploration china china china
2 west joint sea forces one
3 china international south foreign chinese
4 duterte philippines military think policy
5 philippines gas islands navy world
6 government law area maritime time
7 rise resources disputed sovereignty war
8 president agreement freedom development country
9 rights percent naval areas power
10 filipino disputed aircraft countries asia
11 statement part navigation relations national
12 claim energy reef philippines secretary
13 sovereign ruling continue agreements conduct
14 now tribunal artificial bilateral dispute
15 within oil united issues administration
16 exclusive president used trade defense
17 natural court protest security nation
18 must marine facilities support interest
19 country legal defense develop foreign
20 manila coownership building ministry territorial
Underlying The assertion Philippine-China Intensive Military China-Philippine One China
Themes of Philippine Co-Ownership and Presence, Freedom Relations Policy through
Sovereignty over Joint Exploration of of Navigation, and through Bilateral Territorial and
West Philippine Natural Resources Military Facilities Agreement, Trade, Defense Power
Sea Claim in the Disputed Buildup in the Military Support,
Islands Disputed Island and Security

The Philippine Government take a stand on the rights advantages than the Philippines, from economic to mili-
over conflicted islands in the West Philippine Sea when tary and the capability to occupy the West Philippine Sea.
UNCLOS in favor the ruling to the Philippines and
emphasizing China’s “no historical rights”1,2. In a report Theme 2: Philippine-China Co-Ownership and Joint
conducted by42, the Philippine has emphasizedrights to Exploration of Natural Resources in the Disputed Islands
Scarborough Shoal as well as 50 [Spratly island] feature The West Philippine Sea is rich in natural resources
which is known collectively as the Kalayaan Island Group like marine life and oil and gas. As reported by Forbes,
(KIG). In the same report, the Philippines claims to sov- state-owned oil major China National Offshore Oil
ereignty over the Scarborough Shoal and the KIG both Company (CNOOC), a significant amount of oil and
legally and historically. It implies that the Philippines has gas reserved from unexposed areas, although the num-
all the rights to claim the conflicted islands in the West bers were unconfirmed through research43. With this, the
Philippine Sea as the tribunal itself had made its ruling Philippine Government action is to avoid confrontation
relative to the case filed by the Philippine Government. against China and considering sealing a pact with China
However, what remains unclear is as to who will imple- for joint exploration West Philippine Sea44. Moreover, in
ment such ruling. Considering that China has enormous a recent development, it was reported that the Philippine

Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology 7
Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

Government is considering at least two (2) areas in the the relations with China. Bilateral cooperation where
South China Sea for possible joint exploration with made such as tourism, economy, military, and security in
China45. The Philippine President and Chinese President the region49. According to50, Duterte’s deeper engagement
have agreed in principle on jointexploration in the South and constructive approach towards China has already
China Sea46. As a result, the Philippine Government is started to yield economic dividends: bilaterally, multilat-
taking its risk to have a joint venture with China in oil and erally, and institutionally. Bilaterally, as a consequence of
gas exploration in the West Philippine Sea. This could be President Duterte’s state visit to China, the Philippines
an excellent action by the Philippine Government since it were able to procure $24 billion worth of financial help
does not have the capability and machinery in exploring and investment pledges ($15 billion in business deals and
and mining oil and gas. Moreover, if this will prosper and $9 billion in official development aid). Part of this is a $3
with favorable outcome, the Philippines will surely avail billion poverty relief credit access that will be made avail-
massive advantage in oil and gas as this could also propel able to micro, small, and medium enterprises (MSMEs).
the economy of the country and the living condition of
its citizens. Theme 5: One China Policy through Territorial and
Defense Power
Theme 3: Intensive Military Presence, Freedom of The One China Policy givesthe boundaries of claimants
Navigation, and Military Facilities Buildup in the Disputed on the different disputed islands, including in the South
Island China Sea showing the nine-dash line including Taiwan48.
The intensive military presence in the conflicted islands The military presence in the different islands and building
in the West Philippine Sea hasincreased in the past few outpost where among its activities47. It is the unification
years. As reported in Kyodo News, the commander of of all territories of China based on its historical rights
the U.S. Pacific Command has warned of China’s grow- showing the nine-dash line in the West Philippine Sea.
ing military might, saying Beijing has unilaterally built However, with this foreign policy that China imposes,
“seven new military bases” in the South China Sea. China member countries of the ASEAN organization and
is nearly done building air and naval facilities on seven Taiwan reacted and apprehensive due to unacceptable
on islands it claims in the South China Sea, recent sur- move or activities in the disputed islands where military
veillance photos obtained by a Philippine newspaper and infrastructures and presence are visible.
published47. Following this events, China’s first-ever land- The probabilities of having the same topics in the different
ing of the heavy bomber on a disputed island in the South articles were shown in Table 3. For example, regard-
China Sea punctuates a steady build up of military assets ing topic 3 with a theme Intensive Military Presence,
that has solidified Beijing’s claims to one of the world’s
Freedom of Navigation and Military Facilities Buildup
busiest shipping lanes48. Anent to this, the Philippine did
in the Disputed Island, article1 has 58.19% probability,
not do anything about this events, many critics are very
article5 has 30.77%, article8 has 58.75% probabilities and
vocal suggesting to protest and ask to stop building a mili-
so on as so forth. While topic1 with a theme Assertion
tary outpost in the disputed islands in the West Philippine
of Philippine Sovereignty over West Philippine Sea Claim,
Sea. If this will continue, most islands in the South China
article2 has 52.19%, article11 has 46.92%, and article12
Sea or West Philippine Sea will be occupied and con-
trolled by China. As a result, the Philippine Government has 32.76% probabilities and so on as so forth. Also, topic2
will have no control over these islands including the natu- with a theme Philippine-China Co-Ownership and Joint
ral resources it has. Exploration of Natural Resources in the Disputed Islands,
article4 has 43.21%, article14 has 32.38%, and article20
Theme 4: China-Philippine Relations through Bilateral has 50.76% probabilities and so on as so forth. Moreover,
Agreement, Trade, Military Support, and Security topic4 with a theme China-Philippine Relations through
When President Duterte became President, the Bilateral Agreement, Trade, Military Support, and Security,
Philippine-China relation improved. According to article3 has 33.85%, article7 has 50.17, and article15 has
Foreign Minister Wang Yi in one of the press meeting, 53.47% probabilities. Finally, topic5 with a theme One
since President Duterte came into office, he appropriately China Policy through Territorial and Defense Power, arti-
handled the South China Sea issue and actively improved cle6 has 38.93%, and article17 has 77.74% probabilities.

8 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology
Las Johansen B. Caluza

Table 3. Article-topic probability of latent themes.


Articles Topic1 Topic2 Topic3 Topic4 Topic5
1 0.0819 0.0621 0.5819 0.2203 0.0537
2 0.5219 0.1594 0.1355 0.1036 0.0797
3 0.2154 0.159 0.1282 0.3385 0.159
4 0.2815 0.4321 0.121 0.0963 0.0691
5 0.1612 0.1465 0.3077 0.2454 0.1392
6 0.1557 0.2254 0.123 0.1066 0.3893
7 0.1463 0.216 0.0383 0.5017 0.0976
8 0.0791 0.06 0.5875 0.1583 0.1151
9 0.1131 0.1237 0.4841 0.1943 0.0848
10 0.069 0.0668 0.5234 0.2539 0.0869
11 0.4692 0.1475 0.193 0.0697 0.1206
12 0.3276 0.084 0.1578 0.3019 0.1286
13 0.235 0.068 0.501 0.1068 0.0893
14 0.249 0.3238 0.1571 0.1877 0.0824
15 0.1748 0.1388 0.0617 0.5347 0.09
16 0.304 0.1498 0.2775 0.1366 0.1322
17 0.0776 0.0537 0.0479 0.0434 0.7774
18 0.5854 0.0941 0.1324 0.1167 0.0714
19 0.2418 0.087 0.3913 0.1087 0.1712
20 0.2985 0.5076 0.085 0.0523 0.0566
21 0.2125 0.1 0.1875 0.3563 0.1438
22 0.1836 0.457 0.1172 0.1563 0.0859
23 0.5392 0.1054 0.0784 0.1814 0.0956
24 0.321 0.4233 0.0741 0.0741 0.1076
25 0.2426 0.0721 0.4754 0.082 0.1279
Underlying The assertion of Philippine-China Intensive Military China-Philippine One China
Themes Philippine Sovereignty Co-Ownership Presence, Freedom Relations Policy through
over West Philippine and Joint of Navigation, and through Bilateral Territorial
Sea Claim Exploration of Military Facilities Agreement, and Defense
Natural Resources Buildup in the Trade, Military Power
in the Disputed Disputed Island Support, and
Islands Security

4.2 Sentiments on the West Philippine Sea arbitration terms were obvious. Finally, 95% tweets were
neutral sentiments with Philippine, assert, want, right,
The sentiments of the Netizens towards the disputed
China, and red terms were apparent. In understanding
islands in the West Philippine Sea shown in Figure 5 is
sentiments, Netizens tweets were processed showing the
very evident. 2.67% tweets belong to negative sentiments
different clusters of the Plutchick wheel of emotions. As
with a pro, mean, gained, duterte, spooning, big, wonder,
shown in Figure 6, Joy (457 tweets) has the most predom-
antagonize, and expense terms were the most obvious.
inant emotion, followed by surprise (289 tweets), then
While 2.33% tweets that belong to positive sentiments
sadness (271 tweets), and trust (184 tweets). Lastly, fear
with assert, candle, down, sit, general, intelligent, and

Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology 9
Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

(84 tweets), anticipation (3 tweets), and anger and disgust


(2 tweets).

Figure 6. Netizens emotions on the West Philippine Sea.

This sentiments and emotions should not be neglected


as these are the actual feelings of the Netizens about the
disputed islands. The people should be consulted and be
heard about their personal feelings and emotion before
making decisions.

Figure 4. Plutchik wheel of emotion model. 5. Conclusion


In this study, it reveals unexpected prevailing themes
hidden in the different news articles online were articu-
lately explored and exhaustively presented and discussed.
Among those themes were Assertion of Philippine
Sovereignty over West Philippine Sea Claim, Philippine-
China Co-Ownership and Joint Exploration in the
Disputed Islands, Intensive Military Presence, Freedom
of Navigation, and Military Facilities Buildup in the
Disputed Island, China-Philippine Relations through
Bilateral Agreement, Trade, Military Support, and
Security, and One China Policy through Territorial and
Defense Power. At the same time, the opinions, emotions,
and feelings of the Netizens about the current events
through news online was revealed through understanding
and exploring tweets. It revealed varied emotions where
most of the tweet clustered as joy, surprise, trust, and sad-
Figure 5. Netizens sentiments on the West Philippine Sea. ness. Some emotions were fear, anticipation, anger, and
disgust. Finally, the process made in this study on senti-
The result implies that, despite the low percentage of ment analysis and topic modelings give a clear view and
positive sentiments, many Netizens were happy, surprised, directions for future researchers.
and trust in the recent actions made by the Philippine
Government relative to the disputed island. In contrary,
many were sad, feared, anger and disgust, and anticipated
6. References
on some activities made by China and the Philippines in 1. Perlez J. Tribunal Rejects Beijing’s Claims in South China
the West Philippine Sea. Sea. The New York Times; 2016.

10 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology
Las Johansen B. Caluza

2. Press Release - The South China Sea Arbitration. Available 15. Scott JA. A matter of record: Documentary sources in social
from: https://pca-cpa.org/en/news/pca-press-release-the- research. UK: John Wiley & Sons; 2014. p. 1–200.
south-china-sea-arbitration-the-republic-of-the-philip- 16. FEP Market Research, LGE 508. Exploratory Research
pines-v-the-peoples-republic-of-china/ Design: Secondary Data; 2018. p. 1–14.
3. Kaplan RD. Asia’s Cauldron: The South China Sea and the 17. Vijayarani S, Ilamathi J, Nithya M. Preprocessing Techniques
End of a Stable Pacific. New York: Random House; 2014. for Text Mining - An Overview. International Journal of
p. 1–256. Computer Science and Communication Networks. 2015;
4. Kotani T. Geopolitical change in Asia and legal warfare at 5(1):7–16.
sea: Maritime Asia. In Peace and Stability in Asia Pacific 18. Preprocessing Techniques for Text Mining. Available from:
Region: Assessment of the Security Architecture. New https://www.researchgate.net/publication/273127322_
Delhi: Vij Books India Pvt. Ltd; 2012 Preprocessing_Techniques_for_Text_Mining
5. Marwah R. Interrogating competing claims in South East 19. Gupta V, Lehal GS. A Survey of Text Mining Techniques
Asia: The South China Sea or the West Philippine Sea. and Applications. Journal of Emerging Technologies in Web
China’s Global Rebalancing and the New Silk Road; 2018. Intelligence. 2009; 1(1):60–76. https://doi.org/10.4304/
p. 191–204. jetwi.1.1.60-76
6. Hyer E. The South China Sea disputes: implications of 20. Wilbur JW, Sirotkin K. The Automatic Identification of Stop
China’s earlier territorial settlements. Pacific Affairs. 1995; Words. Journal of Information Science. 1992; 18(1):45–55.
68(1):34–54. https://doi.org/10.2307/2759767 https://doi.org/10.1177/016555159201800106
7. Beckman RC, Townsend-Gault I, Schofield C, Davenport T, 21. Ramasubramanian C, Ramya R. Effective Pre Processing
Bernard L. Beyond territorial disputes in the South China Activities in Text Mining using Improved Porter»s
Sea: Legal frameworks for the joint development of hydro- Stemming Algorithm. International Journal of Advanced
carbon resources. NUS Centre for International Law series; Research in Computer and Communication Engineering.
2013. p. 1–384. 2013; 2(12):4536–8.
8. Creswell JW, Clark VL. Designing and conducting mixed 22. Cavnar WB, Trenkle JM. N-gram-based text categorization.
methods research. 2nd ed. Thousand Oaks, CA: SAGE Ann Arbor Michigan.1994; 48113(2):161–75.
Publications; 2017. p. 1–520. PMCid:PMC5240849 23. Jivani AG. A comparative study of stemming algorithms.
9. Berman E. An exploratory sequential mixed methods International Journal of Computer Technology Application.
approach to understanding researcher’s data management 2011; 2(6):1930–8.
practices at UVM: Integrated findings to develop research 24. Charanyaa S, Sangeetha K. Term frequency based sequence
data services. Journal of eScience Librarianship. 2017; generation algorithm for graph based data anonymization.
6(1):1–24. https://doi.org/10.7191/jeslib.2017.1104 International Journal of Innovative Research in Computer
10. Palmquist ME, Carley KM, Dale TA. Applications of and Communication Engineering. 2014; 2(2):3033–40.
Computer-Aided Text analysis: Analysing literary and 25. Menaka S, Radha N. Text classification using keyword
nonliterary texts. Text Analysis for the Social Sciences: extraction technique. International Journal of Advanced
Methods for Drawing Statistical Inferences from Texts and Research in Computer Science and Software Engineering.
Transcripts. 1997. p. 131–45. 2013; 3(12):734–40.
11. Befield JA, Szlemko WJ. Internet-based data collection: 26. Feinerer I, Hornik K, Feinerer MI. Text mining infrastruc-
Promises and realities. Journal of Research Practice. 2006; ture in R. Journal of Statistical Software. 2008; 25:1–54.
2(2):1–15. https://doi.org/10.18637/jss.v025.i05
12. Hong T. The influence of structural and message features 27. El-Assady M, Sevastjanova R, Sperrie F, Keim D, Collins C.
on web site credibility. Journal of the American Society for Progressive learning of topic modeling parameters: a visual
Information Science and Technology. 2006; 57(1): 114–27. analytics framework. IEEE transactions on visualization
https://doi.org/10.1002/asi.20258 and computer graphics. 2018; 24(1):382–91. https://doi.
13. Hovland CI, Weiss W. The influence of source credibility on org/10.1109/TVCG.2017.2745080 PMid:28866566
communication effectiveness. Public Opinion Quarterly. 28. Agrawal A, Fu W, Menzies T. What is wrong with topic
1951;15(4):635–50. https://doi.org/10.1086/266350 modeling? And how to fix it using search-based software
14. Olson JM, Cal A. Source credibility, attitudes, and the engineering. Information and Software Technology. 2018;
recall of past behaviours. European Journal of Social 98:74–88. https://doi.org/10.1016/j.infsof.2018.02.005
Psychology. 1984; 14(2):203–10. https://doi.org/10.1002/ 29. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation.
ejsp.2420140207 Journal of Machine Learning Research. 2003; 3:993–1022.

Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology 11
Deciphering West Philippine Sea: A Plutchik and VADER Algorithm Sentiment Analysis

30. Wang W, Feng Y, Dai W. Topic analysis of online reviews for 41. Using VADER to handle sentiment analysis with
two competitive products using latent Dirichlet allocation. social media text. Available from: http://t-redactyl.io/
Electronic Commerce Research and Applications. 2018; blog/2017/04/using-vader-to-handle-sentiment-analysis-
29:142–56. https://doi.org/10.1016/j.elerap.2018.04.003 with-social-media-text.html
31. Griffiths TL, Steyvers M. Finding Scientific Topics. 42. Rosen ME. Philippine claims in the South China Sea: A
Proceedings of the National Academy of Sciences of the legal analysis. A CNA occasional paper. Arlington, VA,
United States of America. 2004; 101(1):5228–35. PNAS. USA; 2014. p. 1–66.
https://doi.org/10.1073/pnas.0307752101 PMid:14872004 43. Why the South China Sea has More Oil Than you
PMCid:PMC387300 Think? Available from: https://www.forbes.com/sites/
32. Bayesian Inference: Gibbs Sampling. Rochester, New York, timdaiss/2016/05/22/why-the-south-china-sea-has-more-
USA. Available from: http://www2.bcs.rochester.edu/sites/ oil-than-you-think
jacobslab/cheat_sheet/GibbsSampling.pdf. 44. Philippine Eyes Joint Exploration Deal with China in South
33. Sentiment analysis: Concept, analysis and applications. China Sea within Months. Available from: https://www.reuters.
Available from: https://towardsdatascience.com/sentiment- com/article/us-philippines-china-southchinasea-energ/
analysis-concept-analysis-and-applications-6c94d6f58c17 philippines-eyes-joint-exploration-deal-with-china-in-
34. Sharma N, Pabreja R, Yaqub U, Atluri V, Chun SA, Vaidya J. south-china-sea-within-months-idUSKBN1HG10G
We-based application for sentiment analysis of live tweets. 45. Palace: 2 S. China Sea areas considered for joint explora-
Proceedings of the 19thAnnual International Conference tion with China. Available from: http://news.abs-cbn.com/
on Digital Government Research: Governance in the Data news/03/02/18/palace-2-s-china-sea-areas-considered-for-
Age. Delft, The Netherlands: ACM; 2018. joint-exploration-with-china
35. Plutchik R. The nature of emotions: Human emotions 46. Duterte and Xi Endorse Joint Exploration in South China
have deep evolutionary roots, a fact that may explain Sea. Nikkei Asian Review. Available from: https://asia.
their complexity and provide tools for clinical prac- nikkei.com/Politics/International-Relations/Duterte-and-
tice. American scientist. 2001; 89(4):344–50. https://doi. Xi-endorse-joint-exploration-in-South-China-Sea
org/10.1511/2001.4.344 47. China’s Military Facilities in South China Sea ‘Almost Ready.
36. Chafale D, Pimpalkar A. Review on developing corpora for Available from: https://www.straitstimes.com/asia/se-asia/
sentiment analysis using plutchik’s wheel of emotions with chinas-military-facilities-in-south-china-sea-almost-ready
fuzzy logic. International Journal of Computer Sciences 48. Bombers in the South China Sea: Beijing Grows Its Military
and Engineering (IJCSE). 2014; 2(10):14–8. Presence on Disputed Islands. Available from: https://www.
37. Tromp E, Pechenizkiy M. Rule-based emotion detection on wsj.com/articles/bombers-in-the-south-china-sea-beijing-
social media: putting tweets on Plutchik’s wheel.arXiv pre- grows-military-presence-on-disputed-islands-1526804539
print arXiv; 2014. p. 1–6. 49. Philippines - China Relations. Global Security. Available
38. Quick Social Media Sentiment Analysis with from: https://www.globalsecurity.org/military/world/phil-
VADER. Available from: https://medium.com/@ ippines/forrel-prc.htm
aneesha/quick-social-media-sentiment-analysis-with- 50. Philippines-China Relations: A deepening partner-
vader-da44951e4116 ship. Asian Pacific Pathways to Progress Foundation,
39. Gilbert CJ, Hutto E. Vader: A parsimonious rule-based Inc. Available from: http://appfi.ph/publications/
model for sentiment analysis of social media text. 8th commentaries/1311-philippines-china-relations-a-deep-
International Conference on Weblogs and Social Media ening-partnership
(ICWSM-14); 2014. p. 1–10.
40. Liu B. Sentiment Analysis and Subjectivity. In Handbook
of natural language processing. 2nd ed. CRC Press - Tylor
& Francis Group: A Chapman & Hall Book; 2010. p. 1–38.

12 Vol 11 (47) | December 2018 | www.indjst.org Indian Journal of Science and Technology

View publication stats

You might also like