Professional Documents
Culture Documents
Keyness of The British Online News in 2022
Keyness of The British Online News in 2022
Department of
Applied Linguistic
Presented by:
The student of the group FL-31
Teliatynska Nataliia
Supervised by:
Dilai Marianna
Lviv 2022
2
Abstract
The primary purpose of the research is to examine specific modifiers using the
corpus-based form of study. Keyword analysis is one of the most widely used
methods in corpus linguistics. The method is used to generate keywords which
provide an indication of concepts in texts or a corpus. Keyword analysis tools
commonly produce resulting keywords presented as a list which rather poorly
indicates what the corpus is about since it typically requires analysts’ knowledge on
conceptual associations between keywords. Therefore, common follow-up methods
of keyword analysis are to examine concordances, collocational patterns, and some
other patterns of associations
between keywords and contexts. This study focuses on the association within a group
of keywords by constructing a representation of a keyword list as keyword clusters.
This work summarizes the previous studies and also conducts research using the
News on the web corpus. The corpus helps us to receive diverse pieces of information
in one place, including British news in a given year of publication and also
automatically identified keywords.
The keywords for an analysis were generated from one corpora; the target
corpus was collected from the articles in British online news in 2022. The selection
with the key words includes a noun, adjective, adverb, verb. Next, it is a semantic
analysis of all keywords
Table of contents
Abstract..........................................................................................................................4
Table of contents...........................................................................................................5
Introduction...................................................................................................................6
Chapter 1: Theoretical and methodological background...............................................8
1.1 An introductory survey.........................................................................................8
1.2 Corpus linguistics.................................................................................................9
1.3 Establishment in corpus linguistics....................................................................10
1.4 Method: How Are Key Words Calculated?.......................................................11
1.5 Problems, Limitations, and Extensions of the Technique..................................12
1.6 The NOW corpus...............................................................................................13
1.7 What is semantic analysis in linguistics?...........................................................14
1.8 Elements of Semantic Analysis..........................................................................14
Chapter 2. Presentation and analysis of data...............................................................16
2.2. The general information about the words, PEOPLE, FEEL, HUGE and
TODAY....................................................................................................................19
2.3 Semantic analysis of the noun PEOPLE............................................................21
2.4 Semantic analysis of the verb to FEEL..............................................................23
1. The first example...............................................................................................23
2. The second example..........................................................................................24
2.4 Semantic analysis of the adjective HUGE.........................................................25
1. The first example...............................................................................................25
2.4 Semantic analysis of the adverb TODAY.............................................................27
Conclusions.................................................................................................................28
References...................................................................................................................31
4
Introduction
The target issue is quite pertinent nowadays. 2022 was an eventful year.
Therefore, the press of any country, including Great Britain, wrote many articles on
topics that worried many people. Keywords will help people to understand what the
UK media wrote about and in what context they used those words. In addition, it will
be useful for foreigners to compare the main keywords from British news with news
from their country. English is the language of communication not only of the British,
but also of the whole world.
5
The subject of this research is the semantic analys of the keywords in british
online news in 2022.
The paper used the mixed methods approach. It involves gathering and
analyzing qualitative and quantitative data to comprehend a problem better and
answer the research questions.
The material used in the work was received from various websites, books,
databases, online dictionaries, etc. The main source is the data from the Corpus
NOW.
Semantic analysis is the process of relating syntactic structures, from the levels
of phrases, clauses, sentences and paragraphs to the level of the writing as a whole, to
their language-independent meanings.
Outside of corpus linguistics, the term "keyword" is widely used, but it is often
interpreted differently. Keywords by Raymond Williams was not the first book to
discuss keywords. Despite this, it has helped popularise the notion that certain words
- such as democracy, private, and industry - capture the essence of particular social,
cultural, or political ideas, viewpoints, or discourses. Williams' work has inspired the
creation of a journal: A Journal of Cultural Materialism keywords. Such words are
seen as essential based on "readings" of their symbolic and discursive structuring
functions. A keyword in corpus linguistics is a term for a word statistically indicative
of a text or a collection of texts[5, p.1-2]. This base has two distinct benefits. First
off, it is less susceptible to the whims of arbitrary assessments of cultural
significance. Second, it can disclose elements that researchers were unaware were
significant, as opposed to relying on researchers choosing items that could be
important and then confirming their value.
But we wouldn't want to contest that the two different kinds of keywords,
qualitatively-defined and quantitatively-defined, are entirely separate. Quantitatively-
defined keywords can indeed be of social, cultural or political significance if they are
characteristics of social, cultural or political texts by doing qualitative assessments on
the keywords discovered through initial. Most research using the quantitative concept
of keywords to analyse specific texts across various genres has used this strategy.
Several examples are Warren on engineering texts, Gerbig on trip writing, and
Jeffries and Walker on newspaper representations of New Labour. Studies of literary
texts include those by Archer and Bousfield, Culpeper, and Fischer-Starcke on
Shakespearean characters, Tribble on romantic literature, and Bednarek and McIntyre
on dialogue in contemporary theatre. In addition to Fraysse-Kim's study of Korean
8
school textbooks, Philip's study of Italian political speeches and news releases is
another study conducted in a language other than English[6, p. 109-110].
The nature of keywords has been the subject of much research; moving
forward, the phrase "keyword (s)" will always refer to quantitatively-defined things
unless otherwise mentioned. In addition to Fraysse- Kim's study of Korean school
textbooks, Philip's study of Italian political speeches and news releases is another
study conducted in a language other than English[6, p.112-113]. However, the
proliferation of studies like those in the previous paragraph did not begin until
computer tools were developed, particularly Mike Scott's WordSmith Tools. Scott's
study is noteworthy, resulting in Scott and Tribble. The discipline of keyword studies
had matured by the middle of the 2000s, with researchers debating the idea of a
keyword[18].
Computers and text that can be read by machines have made it feasible to
obtain data quickly and easily, as well as to have it displayed in a way that is
acceptable for study.
known as corpus linguistics. Finding the data is not the corpus linguist's primary
responsibility; rather, it is analysis. Computers are an important and occasionally
necessary instrument in this process.
In the history of linguistics, the use of actual texts as examples while studying
language is not a brand-new problem. However, because of the fantastic opportunities
provided by the computer processing of natural language, corpus linguistics has
advanced significantly over the past few decades. What precisely is corpus
linguistics? Through corpora, corpus linguistics approaches the study of language in
use (singular: corpus). A corpus is a sizable, logical collection of electronically stored
instances of language that occur spontaneously[6, p.119-120]. Corpus linguistics is
unable to offer contradictory proof. This means that a corpus can only tell us what is
present or absent in the corpus, not what is feasible or accurate or not possible or
what is improper in language. Many teachers are under the impression that if a corpus
does not include every possible way to represent a certain topic, it is inherently
flawed. Instead, teachers should think that if a corpus doesn't provide a certain way to
convey a given notion, perhaps that way isn't particularly prevalent in the register the
corpus is meant to reflect. Corpus linguistics can only tell us what is, not why
something is the way it is. We utilize our intuition as language users to determine
why. There are many scholars that work in this sphere. For instance, Eniko Csomay
from Georgia State Universities, or Paul Rickman and Juhani Rudanko, authors of a
book “Corpus-Based Studies on Non-Finite Complements in Recent English”.
Scientists from Ukraine that used such an approach to the language studying are
Levchenko O. and Dilai M. from Lviv Polytechnic University.
The concept of keywords and the method of keyword analysis were established
and made famous in corpus linguistics. Popularised most notably by Mike Scott
through WordSmith Tools' KeyWords feature, a program that makes it relatively
quick and easy to calculate the frequency of each word in the target data, and a
comparison dataset is a chore for a researcher. Compare the frequency of the same
10
terms statistically to identify significant changes, and then rank the keywords in the
findings according to the degree of relevance.
provide some complexity, which we shall go into more detail about below.
Conceptually, the comparison is once more quite simple. We use a metric or formula
to compare the relative frequencies of each word in the two texts (i.e., percentages of
occurrence in the word-frequency lists). The ratio of the disparity in relative
frequencies to the statistic's value, or "keyness," In other words, the value of the
statistic or "keyness" is greater the more significant the difference in relative
frequencies. Sorting the words according to their keyness is the third and last step of
the procedure[16, p.2-4]. The most intriguing terms with the highest keyness values
will thus appear at the top of the list, all other things being equal. The least exciting
words, whose relative frequency is similar in the two texts, are listed toward the
bottom of the keywords list. We can further distinguish between positive and
negative keywords. Positive keywords are those which are "overused" in the first
text: their relative frequency is higher in this text compared to the second or reference
text. Negative keywords are said to be "underused" in the first text relative to the
second since their relative frequency is lower in this text compared to the reference
corpus. In most cases, it is the positive keywords that are most interesting since they
tell us about what occurs more often in the first text [16, p.4].
In addition to the caveats already expressed above, there have been some
criticisms of the keywords approach. Berber Sardinha [2] highlighted one drawback:
there usually are far too many keywords for the researcher to analyse. Baker (2004a)
noted three points about the technique. First, "a keyword analysis will focus only on
lexical differences, not lexical similarities" [1 p. 349]. Comparing, as Baker did, two
corpora of gay and lesbian erotic narratives to general corpora such as Brown or LOB
would produce different keywords than when comparing the erotic narratives to each
other. This reinforces the careful choice of a reference corpus as of prime importance.
Second, a word may be key because it frequently occurs in one part of a corpus.
Hence, examining the range or dispersion of a keyword is important. Third,
"keywords only focus on lexical differences, rather than semantic, grammatical, or
functional differences" [1 p. 354]. It is possible to find cases where a word is key
12
when it appears with several different meanings in the text and, in contrast, cases
where a word does not get marked as key because counting all of its senses together
hides the fact that one of the senses is key when counted separately. Gries [3 p.116]
also stated one obvious limitation of keyword variability studies "they have little or
nothing of interest to offer a linguist who is primarily interested in grammatical or
other phenomena." Recent research has pointed out the limitations of using one
frequency count to represent the within-group variation of occurrences in a corpus
problems of the assumption of independence and has proposed the use of a new
effect size metric to complement existing significance measures In terms of revisions
to the technique, Scott [19], right from the start, extended the procedure to find "key
keywords."[20, p.4-5] These are words that are shown to be key in a number of files
within a corpus. Mahlberg showed that the keyness procedure applied to lists of
clusters (recurrent phrases or n-grams) could produce useful results for studying local
textual functions in literary stylistics. In order to address the criticisms of the
keyword procedure discussed above, Rayson [4] proposed an extension to include
key parts of speech and key semantic domains as implemented in the Wmatrix
software. This exploited automatic corpus-annotation tools that assign a grammatical
label and semantic field tag to every word or phrase in a corpus. The frequencies of
these tags were counted to produce tag-frequency lists and then the keyness
calculation was applied to those lists in addition to the word-level lists. As a result, at
the semantic level, words are grouped into semantic fields that do not emerge from
the word-level analysis, thus allowing a richer and deeper set of key items to emerge.
The practical problem of too many words to examine is also partially solved because
of the smaller number of semantic groups that need to be consulted.
The NOW corpus (News on the Web) contains 16.2 billion words from
newspapers and magazines published on the web between 2010 and the present (the
most recent day is 2022-11-09). More importantly, the corpus grows about 180-200
million words of new data each month (about 300,000 new articles per month) and
about two billion words each year.
13
While other resources like Google Trends show you what people are searching
for, the NOW Corpus is the only structured corpus that shows you what is happening
in the language -- virtually right up to the present time. For example, see the
frequency of words since 2010 and new words and phrases from the last few years. In
this sense, NOW is the most robust monitor corpus of English[11].
It may be defined as the words having same spelling or same form but having
different and unrelated meaning. For example, the word “Bat” is a homonymy word
because bat can be an implement to hit a ball or bat is a nocturnal flying mammal
also.
Polysemy. The Greek term "polysemy" implies "many signals." This word or
phrase has a distinct but connected sense. In other words, polysemy has a similar
14
spelling but a distinct meaning connected to it. For instance, "bank" has several
meanings and is a polysemy.
We created a corpus with the text of British news articles from March 3, 2022 to
November 10, 2022. This is a sample that contains 1000 texts. Additionally, we took
the news of Great Britain as a resource [29]
Perhaps the best use of Virtual Corpora (at least for language learners) is the
ability to quickly and easily generate lists of “keywords” for a given topic. Examples
might be words related to biology in Wikipedia, words from the magazine Astronomy
in COCA, websites in iWeb dealing with endocrinology or solar power, or articles
referring to refugees in NOW.
NOV corpus suggests finding key nouns, adjectives, adverbs and verbs in the
corpus. There is also a key combination search function.
We will choose words based on their frequency. That is, we will choose the
Figure 2.2 BON (British online news)
word that came across most often.
16
Among the verbs in the top 10, all famous words are in high positions. But
The keyverb in British news is the word "feel" It appears 811 times. Also, this
word was in 353 texts out of 1000.
Next come the adjectives. We have added more selection criteria, so adjectives
that are common in all articles will not be included in the top.
The most used adverb in British news is the word "today" It appears 1368
times. Also, this word was in 207 texts out of 1000.
Summarizing, at the end we got the following words: «People», «Feel», «Huge»
and «Today».
18
2.2. The general information about the words, PEOPLE, FEEL, HUGE and
TODAY.
In this part, four words PEOPLE, FEEL, HUGE and TODAY will be studied
in terms of corpus-based analysis. . All of them are used as words of elementary and
intermediet levels of English.
1) Acording to Cambridge Dictionary, the noun PEOPLE have several
meanings[7]:
men, women, and children;
the large number of ordinary men and women who do not have positions
of power in society.
We have taken some sentences from British News 2022 articles that contain
the noun PEOPLE. For instance, the sentence in the Figure 2.8
" It is not future historians but the people of Ukraine who will be our judge. "
PronounsVerbsAdjectivesNouns
The meaning of the statement is, that ukrainians will be judge of all world, not
the future historians. To begin with, this statement is about the war in Ukraine. And
the attitude of the people (mainly the British in this sentence) to this situation.
Boris Johnson, who served as Prime Minister of the United Kingdom, talk
about a situation where it is necessary to provide Ukraine with all possible help. After
all, they will be judged for their indifference not by future historians, but by the
Ukrainian people.
We tried to find out the relationship between the members of the sentence.
[place]
Ukraine
from
Will not be
People Will be
[action] Judge [action]
Historians
[agent] [agent]
Whose?
Our
(... of people from all world)
[Demonstrative]
nation
residents public
citizens persons
inhabitants
Figure 2.10 The synonims of the word «people» in the sense of nation
the public
the crowd the crowd
Figure 2.11 The synonims of the word «people» in the sense of the public
After looking at figures 2.10 and 2.11, we can see that depending on the
context, the word "people" has different synonyms. We cannot say that the words
"crowd" and "inhabitants" have the same meaning. Nevertheless, these two words can
generally be replaced by the noun "people." So, the word "people" is a hypernym.
Moreover, its hyponyms are "residents", "crowd", "nation", and so on.
The word "people" itself has no negative or positive meaning. It all depends on
the word that is next to it. Most often, adjectives and numerals stand next to this
noun. Also, prepositions often stay with the "people". However, the most used is the
tandem noun + verb.
On Figure 2.12 we can see examples of collocations with the word «people».
Melissa thinks more men have felt confident wearing make-up within the last
five years.
AdverbsVerbsAdjectivesNouns
We tried to find out the relationship between the members of the sentence in
more
Figure 2.14 How much?
Of what?
[action]
Thinks that
men
Hfve felt
[agent]
what?
Melissa
When?
wearing make-up
[Person]
confident
[feeling]
In this case, the verb "to feel" means to experience something emotional.
«More men feel confident when wearing makeup». It is the first example of using the
word «to feel» with a different meaning. We found out that this verb can have
23
meaning: experience something physical or emotional. In the first example, the verb
"to feel" is paired with the adjective "confident".
Also, in the picture, we illustrated several examples of the use of this verb together
with other adjectives.
Figure 2.15 Some examples of using the word «people»
I feel like they were on the same level, but one couple was a little bit better for me
Verbs Adjectives Nouns
This sentence is about one of the dance competitions in Great Britain.
Journalists expressed their opinion about one of the pairs of the competition. In
addition to bad news, such as the war in Ukraine or bullying, Britain's mass media
also discuss local events, such as football, beauty contests, or, in this case, a dance
tournament.
On the Figure 2.16 we tried to find out the relationships between the members
of statements.
In this case, the verb "to feel" means the thought or feeling of the author of the
sentence. Instead of saying "In my opinion" or "I think", the journalist used the word
"feel". In his opinion, one of the two pairs of dancers performed better.
24
We considered 2 different cases in which the same word was used in different
meanings. The meaning of the first one is experience something physical or
emotional. The second example means to have a particular opinion about or attitude
towards something
To feel
The verb "to feel" can form a conjunction with any part of speech. The most
To grope To touch
common are nouns, adjectives and adverbs
To experience To perceive
The Figure 2.17 shows several synonyms for the verb "to feel"[27]
Depending on the context of the sentence, the word "to feel" will have a
different meaning. So, the synonyms of this word will also differ; it depends on what
meaning we choose for the word "to feel".
For example, on the one hand, if we mean physical contact when we feel
something, it is better to use the word "to touch". On the other hand, if we mean some
emotional experience or some kind of experience, the correct replacement would be
"to perceive."
We have taken some sentences from British News 2022 articles that contain
the adjective HUGE. For instance, the sentence in the Figure 2.18
honor
TV shows in childhood, dreamed of becoming a professional, and finally, he won.
What?
huge
On the Figure 2.18 we tried to find out the relationships between the members
of statements.
There was a huge amount of communication, and I was with people all the time
Figure 2.19 The sentance
Verbs Adjectives Nouns
The topic that has never disappeared from British news since 2019 is the
coronavirus pandemic. British journalists conducted a survey, and one of the
questions of which was about changing the atmosphere at the workplace. The
respondent answered that there were many people with whom it was possible to talk
26
and consult about work issues. Because of Covid-19, everything has changed, and
everyone is trying to switch to online work. In the sentence above, the word "huge" is
Also, in the Figure 2.20, we illustrated several examples of the use of this
adjective together with nouns.
large
The word "huge" has one meaning - enormous. It does not matter what word
this adjective goes with; it retains its single meaning.
The word "today" is used for one purpose - to indicate time. In articles in
British news, people use this word widely because if some event happened on a
particular day, then journalists usually start the conversation with the date, and the
It's a misnomer from the early days of telescopes when they didn't have anything like
the resolution they have today.
The word "today" has one meaning – a part of time. It does not matter what
word this adverb goes with; it retains its single meaning.
In general, we have learned that PEOPLE, FEEL, HUGE and TODAY appear
in many British news articles. Without these words, the life of the average English
speaker would be difficult. Although there are many synonyms for these words, they
cannot be replaced. The data provided by the corpus shows that every researched
word might be used in particular situations where the other words are not appropriate.
Their positions in a sentence vary, and each of them collocates with absolutely
different terms.
28
Conclusions
In this research paper, we studied the keywords of British online news. The
object of the research was the keywords PEOPLE, FEEL, HUGE and TODAY in
semantic, whereas the subject was the corpus-based semantic study of the used
keywords. Concepts, usages, functions, and meanings of them are examined. The
research explores the usage of the terms at one period of time, namely, the period of
2022 year, and in particular situations so that the problem of misunderstanding of any
language constructions could be resolved, at least partly.
people, then the synonym will be "crowd". Furthermore, if we want to use the word
"people" in the sense of some group with the same interests, "Community" is better.
We investigated with which parts of speech this noun often stands. Most often, the
word "people" is found next to adjectives, verbs, and also often with prepositions.
The second word is the verb "to feel". Our research has given examples of this
word and common word combinations. They also illustrated examples of synonyms
and explored the verb's meaning depending on the sentence and context. The word
"feel" has many meanings and is not difficult to learn, so people often use it
differently.
The word "today" is used very often in the British news. This word is perfect
for describing time. Mostly, the word is at the sentence's beginning or end. In work,
we also gave examples of synonyms. We analyzed the meaning of the adverb in
several sentences depending on the context.
Each word helped to clarify some important events in the life of the British.
The word "people" was often used in the context of refugees and victims of covid,
which explains the significant events in the world. British news focuses on people,
their thoughts, achievements, and experiences. The word "feel" comes up a lot in
articles because people are interested in what others feel. The word "today" indicates
the time in which the events occur.
The corpora made the job a lot easier. In it, we can create a corpus with data
from a specific year and a particular country. In the table with keywords, there is the
number of times using the word. Additionally, there is also the number of text with
30
this word. In addition, it was possible to see the sentences in which this word is used
immediately.
Besides, it became clear that the corpus-based study method may be applied to
English language learning. The notion of collocation might be reinforced in the
learning process as well. By employing this method, English language learners will
be able to operate all words more accurately, improving the quality of their
communication.
References
доступу: https://eprints.lancs.ac.uk/id/eprint/140803/1/Rayson_2019_CorpusAnalysi
sofKeyWords_Submitted.pdf (дата звернення: 16.11.2022). – Назва з екрана.
17. Semantic analysis [Електронний ресурс] // United Nations Economic
and Social Commission for Western Asia. – Режим
доступу: https://archive.unescwa.org/semantic-analysis (дата звернення:
17.11.2022). – Назва з екрана.
18. Scott M. Mapping key words to problem and solution / M. Scott, G.
Thompson. – 2001.
19. Scott M. PC analysis of key words - and key key words / M. Scott //
System. – 1997.
20. Scott M. Stroud: lexical analysis software / M. Scott // WordSmith Tools
version 7. – 2016.
21. Scott M. WordSmith Tools manual / M. Scott // Oxford: oxford
university press. – 1996.
22. Scott M. WordSmith tools manual, version 3.0 / M. Scott // Oxford:
oxford university press – 1998.
23. Scott M. Textual Patterns: key words and corpus analysis in language
education / M. Scott, S. Tribble. – 2000.
24. Scott M. : key words and corpus analysis in language education
[Електронний ресурс] / M. Scott, S. Tribble. – 2006. – Режим доступу: http://tesl-
ej.org/ej42/r4.pdf (дата звернення: 17.11.2022). – Назва з екрана.
25. Semantic analysis in linguistics [Електронний ресурс] // Факультет
іноземних мов. – Режим
доступу: http://foreign-languages.karazin.ua/resources/a6d42430569ca6df5d941cacf
9551100.pdf (дата звернення: 17.11.2022). – Назва з екрана.
26. Synonyms of people | thesaurus.com [Електронний ресурс] //
www.thesaurus.com. – Режим
доступу: https://www.thesaurus.com/browse/people (дата звернення: 17.11.2022). –
Назва з екрана.
34