Professional Documents
Culture Documents
Microcelebrity Around The Globe
Microcelebrity Around The Globe
Abstract
The YouTube affordance of auto-generated textual closed captions (CC) is
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
valued by the YouTube algorithm, and therefore spoken words in vlogs can be
strategically used to optimize and orient videos and channels for search. In their
blog, YouTube suggests that complicity with their desire for rich and accurate
CC is rewarded with algorithmic visibility (YouTube, 2017a). CC metadata are
therefore an example of the significant degree of pressure for vloggers (video
bloggers) on YouTube to optimize their content down to minutia of self-
presentations. In this chapter I analyze the practice of highly visible beauty
vloggers to conceptualize vlogging practices that contribute to algorithmically
readable CC text. I term this labor vlogging parlance. Vlogging parlance
includes keyword stuffing, defined as inserting often-searched-for keywords into
speech. It also encompasses the strategic verbal expressions, language choice,
speech pace, enunciation, and minimization of background noise by vloggers.
Vlogging parlance can be thought of as a microcelebrity (Senft, 2008) technique,
deployed to attract attention and visibility in an information-saturated online
“attention economy,” a system of value in which often “money now flows along
with attention” (Goldhaber, 1997). The call to optimize speech ultimately places
responsibility onto creators to ensure their videos can become visible, while
assisting YouTube in developing search accuracy for their viewers. Further-
more, the Western-centric language affordances of CC, and the high valuation
of English on the YouTube platform, are used as examples of how social media
platforms can underserve differently abled and non-English speaking audiences.
Introduction
To be a successful vlogger (video blogger) on YouTube, one must make oneself
legible to the site’s algorithms. In addition to making visual content appealing,
this necessitates optimizing video metadata. Metadata are defined as “data which
are used in organizing video to facilitate content-based retrieval” (Jain &
Hampapur, 1994: 2). On YouTube this includes textual keyword tags, video titles,
and closed captioning (CC) text. Since 2006, a proportion of videos on YouTube
have been afforded CC, which appear to the viewer as textual subtitles on videos
(Google Video Blog, 2006). CC were developed as auto-generated in 2009, uti-
lizing the same technology as their parent company, Google’s voice recognition
software (Official Google Blog, 2009). CC were launched, in part, to increase
accessibility for deaf and hard of hearing viewers. In each announcement of CC
updates, Google has worked to highlight the potential to improve accessibility,
with each rollout being publicized in blog posts written by a deaf engineer, citing
how they have benefitted personally from this capability. In 2006, the original
rollout was announced in a blog post by deaf Google engineer Ken Harrington,
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
who likened the move to the advent of CC on television. While he lauded the
potential for accessibility in the post, he also noted “the potential applications
here for search quality, automatic translation, and speech recognition should also
become more obvious” (Google Video Blog, 2006). In other words, while
recognizing the accessibility function, Harrington suggests here that auto-
generated CC text is valued as meaningful data and used in increasing accuracy
of video searches on the platform. As of 2017, one billion videos have CC; these
generated transcripts are also utilized to assign relevancy and visibility to You-
Tube videos (Official YouTube Blog, 2017). However, CC capability only
stretches to 10 languages, predominantly spoken in the Global North: English,
Dutch, French, German, Italian, Japanese, Korean, Portuguese, Russian, and
Spanish. This sidelines audiences speaking some of the world’s most widely
spoken languages, such as Arabic, Hindi, and Mandarin Chinese.
On their creator-facing help blog, YouTube suggests that “translated metadata
may increase a video’s reach and discoverability” (YouTube, 2017a). Translated
text is indexed for search, thus vloggers producing videos with translated CC
metadata containing more keywords will become visible for those keywords.
Visibility, or as YouTube puts it, discoverability, takes the form of promotion
through high billing within search rankings, inclusion in automatically generated
rolling playlists, and promotion via personalized “recommended for you” links
embedded within the platform’s interface. In short, visibility means being chan-
neled toward the eyeballs of YouTube’s users. Searching for a “smoky eye”
tutorial will return with videos titled and tagged with “smoky eye,” but also vlogs
that feature beauty YouTubers repeatedly stating the words “smoky eye” aloud,
clearly and in a translatable manner, and in a CC recognized language. This
chapter will look to how this platform affordance can be seen to influence the
speech of popular vloggers, and will consider the wider implications for
inequalities on YouTube.
the same words or phrases so often that it sounds unnatural”: it involves the
determination of popular keywords using analytics software, and packing
them, traditionally into textual website copy, multiple times to be read easily
by search engines (Google, 2017). In addition to keyword stuffing, I argue
speech designed to be readable can be termed vlogging parlance. Vlogging
parlance is broadly defined as the strategic verbal expressions, language choice,
speech pace, enunciation, and minimization of background noise by vloggers
that are informed by a desire to optimise platform visibility, in part through
generating accurate auto-translated CC metadata. The deliberate and consid-
ered uses of textual tags for visibility have been considered by Zappavigna
(2015), who outlines the practice of “searchable talk” among Twitter users.
They demonstrate that Twitter hashtags are used to catalogue tweets
(i.e., #breakingbad), to add interpersonal metacommentary (i.e., #sad), and as
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
in gossip tabloids (Abidin, 2015b: 3). These A List vloggers could also be accu-
rately conceptualized using Abidin’s definition of influencers:
when a video may ostensibly be on another topic. Beauty and lifestyle tags are
applied, no matter the video topic or genre, including videos about Easter
cupcake baking (Zoella, 2017a) or teenage friendships and periods (Zoella,
2017b). In addition to aggregating her content, this protects Zoella’s own personal
brand and trademarks by continuously reinforcing the relationship between her
band name and beauty and lifestyle content.
The labor and strategy behind tagging practices should be understood in the
context of the desire for, and performance of, visibility. Vloggers disclosed to me
in interviews they ran their vlog ideas through optimization tools such as Google
AdWords to determine how many times keywords relating to their topic idea
were searched for, and to determine busy search periods. The data they found
then informed which videos they decided to make, which keywords to use, and
which of these will likely be successful during which seasons. Similarly, the
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
this video genre is perhaps due to the capability to purchase many pieces of on-
trend clothing cheaply in the store, meaning this is a low-cost and seasonally
replicable genre. Primark has even embraced this YouTube-centered attention
by partnering with beauty vlogger GabriellaRose to release a homeware
collection in 2015 (Barns, 2015). The video “Huge Disastrous Primark Haul,”
produced by Zoella, illustrates the practice of keyword stuffing by “A List”
beauty vloggers (Zoella, 2016). In this video, Zoella holds up each item and
slows down her speech, carefully and crisply pronouncing keywords. She
includes a description of the seasonal change: “autumn transitional dress,” “the
true spirit of autumn,” and “more of a winter item,” keywords that are asso-
ciated with seasonal visibility. Zoella ensures she describes the style, length, and
color of each and every one of her purchases in search engine–ready discursive
patterns. She holds up a “navy midi dress,” a “striped maxi dress,” “black, flat
ankle boots” and a “burgundy corduroy pinafore.” She organizes her video
carefully, beginning with dresses and methodically moving on to various trou-
sers, shoes, and accessories, readying and neatly grouping together generically
related keywords.
The turn toward the valuation of CC metadata illustrates the significant
degree of pressure for vloggers to optimize their videos down to the seemingly
microperformances of the self. Through published creator resources, YouTube
suggests that complicity with their desire for rich and accurate metadata is
rewarded with algorithmic visibility. This ultimately places responsibility onto
creators on the platform to ensure their videos are optimized, while assisting
YouTube in developing search accuracy for their viewers. The vlogger is caught
in a double bind: they must both make themselves algorithmically visible, while
also appearing as authentic and genuine. Authenticity is highly valued by fans
and industry in influencer markets, including vlogging (see Abidin & Ots 2016;
Duffy, 2017; Marwick, 2013; Tolson, 2010). Indeed, one of the anchoring
components of influencer self-brands is their distinctiveness from mainstream
or traditional fashion and beauty houses. Despite the calculated style of
pronunciation performed throughout her Primark Haul, it is clear that
Zoella labors to imbue this video with an undercurrent of this performed
authenticity. In this context, a performance of authenticity can be considered
28 Sophie Bishop
video with an amateur quality that is consistent with the home-grown beauty
vlogging genre. Zoella uses “strategic intimacy” in an address to her more devoted
fans (Marwick, 2015). Due to the closely cultivated personal relationships
through her vlogs, and other social media platforms, viewers are both aware of
and somewhat invested in her relationship with vlogger Alfie Deyes. They also
know that one of the dogs in the background is not hers, and belongs to his sister.
In a chatty, familiar address she asks her audience to remind her that she
shouldn’t buy another pug “if you could all just remind me, after this, what it
would be like to have two dogs… or let Alfie know cos he’s the one that wants a
second dog already.” Speaking to her audience, as she would do a friend, rein-
forces the perceived accessibility and performance of “ordinariness” of the
YouTube star. It is important to recognize that the transgressions in the blooper
reel are cut together and set aside from the deluge of crisply pronounced keywords
in the video body. Despite the video title hinting that in the “Primark haul,” the
so-termed “disastrous” interruptions of pug fights and inconsistent sunlight are
confined to the video opening, the rotation of clearly pronounced products runs
smoothly. Ultimately, the video disasters do not hinder, overpower, or mask the
keywords or the crisp pronunciation on “Primark autumnal patterned shirt”
(Zoella, 2016).
production of metadata for search engine optimization, and they do not clearly
serve the needs of deaf and hard-of-hearing audiences” (Ellcessor, 2012: 333).
Auto-generated CC rarely encompass what would traditionally be considered CC.
In my sample, these translations were overwhelmingly evocative of textual sub-
titles, rather than accessibility-orientated CC. The predominate distinction is CC
are designed for deaf or hard-of-hearing viewers and include textual descriptions
of music and nondiagetic and diagetic sounds. The platform’s continuation to
term this textual translation affordance CC is therefore misleading and under-
serves differently-abled audiences. Furthermore, as highlighted earlier, CC
capability only stretches to 10 languages, predominantly spoken in the Global
North: English, Dutch, French, German, Italian, Japanese, Korean, Portuguese,
Russian, and Spanish. This sidelines audiences speaking three of the world’s most
spoken languages, including Arabic, Hindi, and Mandarin Chinese, both for
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
Conclusion
YouTube’s CC and algorithmic affordance are intended, at least in part, for a
generation of textual metadata. Textual subtitles auto-generated by YouTube are
not traditional CC, which identify and translate noise, music, and sounds.
Speaking to the valuation of translated metadata, this chapter has offered a
definition of the spectacular, slow, and careful pronunciations of popular and
desirable search keywords that I have defined as vlogging parlance. I argue
vlogging parlance is an identifiable and visible trend in recent high-profile beauty
vlogging output, which is shaped by assumptions and realities of how commercial
algorithms read and translate audio data. Building on the strategic use of textual
tags as a visibility strategy, and taking into account vloggers’ affective relation-
ships with the YouTube platform, the trend toward “keyword stuffing” can be
identified as an awareness of, and a response to, the high value attached to
commercial and advertising relevancy on the platform. The case study of Zoella’s
30 Sophie Bishop
Disastrous Primark Haul makes clear the tensions between producing content
that is algorithmically legible and the maintenance of valuable authentic micro-
celebrity self-branding. Furthermore, the Western-centric language affordances of
CC and the high valuation of English on the YouTube platform are examples of
how differently-abled and non-English speaking audiences and content creators
are underserved on YouTube.
References
Abidin, C. (2015a). Communicative ❤ Intimacies: Influencers and perceived inter-
connectedness. Ada: A Journal of Gender, New Media, & Technology, 8. https://
www.academia.edu/16366449/Abidin_Crystal._2015._Communicative_Intimacies_
Influencers_and_Perceived_Interconnectedness._Ada_A_Journal_of_Gender_New_
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
Duffy, B. (2016). The romance of work: Gender and aspirational labour in the digital
culture industries. International Journal of Cultural Studies, 19(4), 441–457.
Duffy, B. E. (2017). (Not) Getting paid to do what you love: Gender, social media, and
aspirational work. New Haven, CT: Yale University Press.
Ellcessor, E. (2012). Captions on, off, on TV, online: Accessibility and search engine
optimization in online closed captioning. Television & New Media, 13(4),
329–352.
Garcı́a-Rapp, F. (2016). The digital media phenomenon of YouTube beauty gurus:
The case of bubzbeauty. International Journal of Web Based Communities, 12(4),
360.
Glamour. (2016). How to vlog your way to a million pounds. Glamour UK. http://
www.glamourmagazine.co.uk/article/how-to-be-a-youtube-millionaire. Accessed
March 24, 2017.
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
Goldhaber, M. H. (1997). The attention economy and the net. First Monday, 2(4).
http://firstmonday.org/ojs/index.php/fm/article/view/519. Accessed October 3,
2017.
Google. (2017). Irrelevant keywords—Search console help. https://support.google.com/
webmasters/answer/66358?hl5en. Accessed May 23, 2017.
Google Video Blog. (2006). Finally, caption playback. Google Video Blog. http://
googlevideo.blogspot.com/2006/09/finally-caption-playback.html. Accessed May
15, 2017.
Jain, R., & Hampapur, A. (1994). Metadata in video databases. ACM Sigmod Record,
23(4), 27–33.
Jerslev, A. (2016). Media times. In the time of the microcelebrity: Celebrification and
the YouTuber zoella. International Journal of Communication, 10, 19.
Marwick, A. (2013). “They”re really profound women, they’re entrepreneurs’: Con-
ceptions of authenticity in fashion blogging. In 7th international AIII conference on
weblogs and social media (ICWSM), July. http://www.tiara.org/blog/wp-content/
uploads/2016/04/amarwick_fashionblogs_ICWSM_2013.pdf. Accessed October 6,
2017.
Marwick, A. E. (2015). Instafame: Luxury selfies in the attention economy. Public
Culture, 27(1 75), 137–160.
Marwick, A. E., & boyd, d. (2011). I tweet honestly, I tweet passionately: Twitter
users, context collapse, and the imagined audience. New Media & Society, 13(1),
114–133.
Official Google Blog. (2009). Automatic captions in YouTube. Official Google Blog.
https://googleblog.blogspot.com/2009/11/automatic-captions-in-youtube.html.
Accessed October 6, 2017.
Official YouTube Blog. (2017). One billion captioned videos. Official YouTube Blog.
https://youtube.googleblog.com/2017/02/one-billion-captioned-videos.html.
Accessed October 6, 2017.
Page, R. (2012). The linguistics of self-branding and micro-celebrity in Twitter: The
role of hashtags. Discourse & Communication, 6(2), 181–201.
Senft, T. M. (2008). Camgirls: Celebrity and community in the age of social networks.
Digital Formations (Vol. 4). New York: Lang.
Tate, B. (2017). Weekly vlog #117 | New Hair! | Brogan Tate. https://www.youtu-
be.com/watch?v5Kzthps2n9gE.
32 Sophie Bishop
Telegraph. (2014). Meet the 21 year old YouTuber who made millions playing video
games. The Telegraph, October 16. http://www.telegraph.co.uk/men/the-filter/
meet-the-21-year-old-youtuber-who-made-millions-playing-video-games/. Accessed
February 24, 2017.
Tolson, A. (2010). A new authenticity? Communicative practices on YouTube.
Critical Discourse Studies, 7(4), 277–289.
YouTube. (2017a). YouTube tools to translate your content—YouTube help. https://
support.google.com/youtube/answer/4792576?hl5en. Accessed May 22, 2017.
YouTube. (2017b). YouTube partner programme overview—YouTube help. https://
support.google.com/youtube/answer/72851?hl5en-GB. Accessed May 23, 2017.
Yuya. (2017). 5 Peinados Para Escuela/Trabajo♥ (Fácil)—Yuya. https://
www.youtube.com/watch?v5lBexzuT4GLc&t549s.
Zappavigna, M. (2015). Searchable talk: The linguistic functions of hashtags. Social
Semiotics, 25(3), 274–291.
Downloaded by University of Otago At 22:48 17 December 2018 (PT)
1. Alice E. Marwick. The Algorithmic Celebrity: The Future of Internet Fame and
Microcelebrity Studies 161-169. [Citation] [Full Text] [PDF] [PDF]
2. . Microcelebrity Around the Globe . [Abstract] [PDF] [PDF]
Downloaded by University of Otago At 22:48 17 December 2018 (PT)