Professional Documents
Culture Documents
Jones 2008 The Role of Text in Televideo Cybersex
Jones 2008 The Role of Text in Televideo Cybersex
Jones 2008 The Role of Text in Televideo Cybersex
RODNEY H. JONES
Abstract
1. Introduction
This paper focuses on the ways verbal messages are combined with bodily
displays in televideo cybersex between gay men—what is referred to by
my participants as ‘cam sex’ or ‘cam fun’. Although what will be de-
scribed here might seem rather removed from the experience of many,
looking at this kind of interaction, I will argue, can bring us closer to un-
derstanding how, in everyday life, the body functions as both an acting
subject and an objective ‘display’, and how the subjective and objective
qualities of the body change when it is combined with other modes such
as spoken and written language. As more and more human interaction is
displays one finds in strip shows, peep shows, and the masturbatory dis-
plays gay men engage in in public sex venues such as saunas and public
toilets, the exchange of verbal messages is not always evident and some-
times prohibited. In televideo cybersex, however, the regular exchange of
verbal messages is practically obligatory.
Research into the role verbal messages play in other forms of multimo-
dal interaction has resulted in a number of observations applicable to this
study. The first is the notion that di¤erent modes function di¤erently in
texts and interactions partly because these modes themselves embody cer-
tain ‘a¤ordances’ and ‘constraints’ regarding the kinds of communication
for which they can be employed. According to Kress and Van Leeuwen
(1996, 2001), di¤erent modes work to structure and constrain meaning-
making practices and construct participants’ orientations toward reality
and each other. Among the chief di¤erences between verbal and visual
modes, they say, is that verbal texts work within the logic of time, orient-
ing readers (or hearers) toward causality, and images operate within the
logic of space, orienting viewers toward spatial analytic perspectives.
One useful way of approaching the di¤erent meaning-making potential
of di¤erent modes is through the lens of Halliday’s three metafunctions of
language: the ideational, in which language is used to depict a state of
a¤airs, the interpersonal, in which language is used to construct the rela-
tionship between sender and receiver, and the textual, in which language
contributes to the organization and structure of messages. Stöckl (2004)
suggests that while all modes are theoretically capable of performing any
of these metafunctions, in di¤erent communicative events they may be
distributed across modes in an unequal way depending on how they can
be realized most e‰ciently.
One central point of such work is that modes do not function inde-
pendently, that verbal messages work together with other modes in texts
and interactions in an integrated way in which the meanings they create
together are more than just the sum of the meanings they create sepa-
rately (Kress and Van Leeuwen 1996). For Barthes (1977), for example,
one of the key ways that words interact with images is by ‘anchoring’ or
fixing certain meanings in them, defining their terms of reference or point
of view from which they are to be interpreted.
Whereas in printed texts, words are more often seen as functioning to
‘anchor’ or contextualize images, in face-to-face interaction, gestures,
posture, gaze, and other nonverbal communication are more often seen
as resources with which speakers contextualize words, and so such cues
are considered by Gumperz (1982) as part of a broader class of signals
he refers to as contextualization cues. Such cues a¤ect interaction in a
variety of ways. One way involves the ideational or referential plane of the
Televideo cybersex 457
That is not to say that participants never show their faces, but when they
do this display is often brief and usually preceded by careful negotia-
tion (see below), and it often signals a change in framing and footing
(Go¤man 1974, 1981), a shift from cybersex to some other activity such
as casual chatting, for example. This relative absence of the face as a
communicative tool, I will argue, is an important factor in understanding
the role of written text in these interactions.
4.1. Presence
For any interaction to proceed successfully, interactants must continually
communicate and monitor mutual attention toward each other in order
to maintain a sense of co-presence, which is central to what Go¤man
(1981) calls a ‘state of talk’. In face-to-face communication, this is usually
achieved through bodily orientation and gaze (Kendon 2004). Since par-
ticipants in televideo cybersex usually do not show their faces, the gaze is
typically not available, and participants’ physical bodies are usually ori-
ented toward themselves (i.e., in masturbation). Therefore, the function
of maintaining this sense of co-presence is transferred almost entirely to
the mode of text.
According to my participants, if one partner fails to type a message for
an extended period of time, his partner is likely to feel ignored, no matter
what kind of bodily display he is being o¤ered. ‘If the guy doesn’t type,
how do I know he’s there?’ This is further confirmed by the fact that par-
ticipants typically ‘prompt’ one another with utterances like ‘hey’ and
Televideo cybersex 461
4.2. Timing
Another important function of text is in regulating timing and creating a
sense of conversational synchrony. In face-to-face conversation, gestures,
posture, and paralinguistic cues play a key role in regulating the sequen-
tial and temporal organization of interaction and giving participants the
feeling that they are ‘in synch’ with one another (Condon 1986; Goodwin
2002; Gumperz 1982). In text-based computer-mediated communication,
conversational synchrony is just as important as it is in face-to-face con-
versation. In an earlier examination of text-based cybersex (Jones 2005b),
I noted how users who were successful in engaging partners the longest
tended to be those who were able to establish a regular rhythm of sending
and receiving messages with their partners, and that this rhythmicity was
part of the pleasure participants associated with the activity.
In the case of text-based cybersex, such rhythmic coordination is ac-
complished entirely though the mode of text, dependent upon things like
typing speed and the length of pauses between messages. In televideo
cybersex there is the added dimension of the body through which infor-
mation about timing can be sent. Participants, for example, can regulate
their verbal contributions based on the actions of the other, delaying, for
462 Rodney H. Jones
example, their own message while the other is visibly typing. They can also
coordinate the pace of their bodily actions based on visual observation of
the other. Even with these visual cues, however, the speed and frequency
of verbal messages still plays an important role in the maintenance
of conversational synchrony as well as in signaling di¤erent phases in the
interaction.
To illustrate this I have plotted the length of pauses between turns in
one of the interactions from my data (see Figure 1). As can be seen, in
the initial stages the time between turns is very short as users exchange
greetings and initial information, and then it lengthens slightly after they
‘get down to business’. What is most striking here is the regularity of
pause length that is maintained throughout this middle phase, with typed
messages being issued at a fairly constant rate of between 5 and 10 sec-
onds, mimicking the kind of rhythmic regularity one associates with
actual sex and helping the participants construct what Prior and Shipka
(2003: 230–231) call ‘embodied chronotropes’, which add further to the
‘tone and feel’ one associates with physical presence.
Goodwin (2002) points out that in multimodal interaction participants
orient to multiple orders of temporality simultaneously, with di¤erent
modes used to create di¤erent forms of temporal and sequential organiza-
tion. In the case of televideo cybersex, the rhythmic back and forth of text
provides a temporal context for the more rapid sexualized rhythms of
bodily movements involved in the masturbatory display. At the same
time, rhythm on these two time scales exists in a complementary relation-
Televideo cybersex 463
(1) A: Cum?
B: OK
A: Ready?
B: u?
A: yeahhh
B: cumming?
A: u first
B: your cum gets mine.
Often in such cases the camera only lingers in this region briefly, and,
as in this exchange, textual messages like ‘ok thanks’ are used to mark the
end of the exposure.
4.4. Framing
refusal. The most likely reason for this is the absence of the face as a com-
municative tool. In face-to-face interaction, one watches and reacts with
one’s gaze and facial expressions. In face-to-face sexual encounters, espe-
cially mutual masturbatory displays, the face plays a similar role, signal-
ing responses or reactions to the other’s performance. In televideo cyber-
sex, on the other hand, where the face is usually not available, one
watches with one’s words.
Of course, there is a kind of ambiguity and polyfunctionality to these
visual and verbal moves. A reaction, for example, can also be regarded
as a kind of performance, and, because of the code of reciprocity, each
visual display is also an implicit demand that the other party produce a
similar display. The combination of visual and verbal modes also allows
users to take up di¤erent positions in di¤erent modes, making a visual
o¤er of a particular body part, for example, while verbally issuing a de-
mand that one’s partner does the same.
5. Conclusion
meaning is delivered through images, and the words serve more of a con-
textualizing and regulating function. One performs with one’s body. One
watches with one’s words. Jointly employing both modes allows users
to simultaneously present themselves as objects and to exert agency as
subjects.
One of the chief reasons for this phenomenon is the absence of the face
as a communicative tool. One might say that in televideo cybersex, while
the image is the body, the text is the face. Like the face in face-to-face in-
teraction, text functions as an emblem of selfhood and agency to give to
the experience the feeling of being truly interactive. Thus, to use Waskul’s
(2003) terminology, while participants are ‘embodied’ in the visual images
they broadcast, it is through the words that they type that they are
‘enselfed’.
Another important observation to come from this analysis is how dif-
ferent modes serve not just to elaborate one another, but also to regulate
one another. One of the chief roles of text in these interactions is to allow
participants to manage more precisely their measured and incremental
visual displays. Televideo cybersex is just as much about what one does
not show as what one does. Visual messages come as carved up body
parts rather than complete persons, and verbal messages are stripped of
the paralinguistic cues of audible talk, and it is this ‘semiotic minimalism’
of both visual and verbal modes that helps to make these encounters so
exciting for users by leaving space for them to weave complex fantasies
from a limited set of cues.
In their search for something which one of my informants described as
‘more real than pornography and less real than reality’, participants in
televideo cybersex deploy the verbal mode of communication in di¤erent
ways and for di¤erent purposes than it is deployed in other kinds of inter-
action like text-based computer chat, and most face-to-face conversation
and face-to-face sexual interaction. At the same time, there are also simi-
larities. In nearly all forms of interaction there is an element of bodily
‘display’ (Go¤man 1959), and even users of text-based chat often engage
in textual descriptions of their bodies (Jones 2005a, 2005c). Furthermore,
there are numerous contexts of face-to-face interaction, especially in the
workplace, in which verbal messages do function in similar ways to con-
textualize bodily actions and help regulate pace and rhythm. Examples
can be seen in Nevile’s (2004) descriptions of interactions between pilots
in commercial airliners, and in Filliettaz’s (2005) observations of multi-
modal interactions in a factory. Indeed, much more work needs to be
done to understand bodily displays in general and the ways they are man-
aged in various configurations of time and space using discourse. Finally,
this work also invites a closer consideration of the use of verbal messages
Televideo cybersex 471
Note
1. ‘An ethnographic study of computer mediated communication among gay men in Hong
Kong’, City University of Hong Kong Small Scale Research Grant #9030988 (http://
personal.cityu.edu.hk/~en-cyber/home.htm). An earlier version of this paper was pre-
sented the Third International Conference on Multimodality, 25–27 May 2006, Pavia,
Italy.
References
Barcan, R. (2002). In the raw: ‘Home-made’ porn and reality genres. Journal of Mundane
Behavior 3 (1 February). URL: 3http://www.mundanebehavior.org/issues/v3n1/barcan.
htm4 [accessed on 3 October 2006].
Barthes, R. (1977). Image–Music–Text. London: Fontana.
Clatts, M. C. (1999). Ethnographic observations of men who have sex with men in public. In
Public Sex/Gay Space, W. Leap (ed.), 141–156. New York: Columbia University Press.
Condon, W. S. (1986). Communication: Rhythm and structure. In Rhythm in Psychological,
Linguistic and Musical Processes, J. R. Evans and M. Clynes (eds.), 55–78. Springfield,
IL: Charles C. Thomas.
Daft, R. L. and Lengel, R. H. (1984). Information richness: A new approach to managerial
behavior and organizational design. In Research in Organizational Behavior, L. L. Cum-
mings and B. M. Staw (eds.), 191–233. Homewood, IL: JAI Press.
Eggins, S. and Slade, D. (1997). Analyzing Casual Conversation. London: Cassell.
Filleittaz, L. (2005). Time, rhythm and multiactivity: Contextualizing teamwork. A paper
presented at the 9th International Pragmatics Conference, 10–15 July, Riva del Garda,
Italy.
Foore, K. A. (2004). Through the looking glass: Constructing sexual identity. Unpublished
M.A. thesis, University of Alaska, Fairbanks.
Go¤man, E. (1959). The Presentation of Self in Everyday Life: New York: Anchor Double-
day.
Go¤man, E. (1974). Frame Analysis. Cambridge: Harvard University Press.
Go¤man, E. (1981). Forms of Talk. Philadelphia: University of Pennsylvania Press.
Goodwin, C. (1994). Professional vision. American Anthropologist 96 (3): 606–633.
Goodwin, C. (2002). Time and action. Current Anthropology 43 (Suppl.): 19–35.
Gumperz, J. (1982). Discourse Strategies. Cambridge: Cambridge University Press.
Halliday, M. A. K. (1994). An Introduction to Functional Grammar, 2nd ed. London:
Edward Arnold.
Humphreys, L. (1970). Tearoom Trade: Impersonal Sex in Public Places. Chicago: Aldine.
Iedema, R. (2001). Resemiotization. Semiotica 137 (1–4): 23–39.
Iedema, R. (2003). Multimodality, resemiotization: Extending the analysis of discourse as
multi-semiotic practice. Visual Communication 2 (1): 29–57.
472 Rodney H. Jones
Jones, R. (2005a). ‘You show me yours, I’ll show you mine’: The negotiation of shifts from
textual to visual modes in computer mediated interaction among gay men. Visual Commu-
nication 4 (1): 69–92.
Jones, R. (2005b). Rhythm and timing in computer mediated communication. A paper
presented at the 9th International Pragmatics Conference, 10–15 July, Riva del Garda,
Italy.
Jones, R. (2005c). Sexual risk and the Internet. A paper presented at the Language and
Global Communication Conference, 7–9 July, Cardi¤, Wales.
Jones, R., Yu, K. K., and Candlin, C. N. (2000). A preliminary study of HIV vulnerability
and risk behavior among MSM in Hong Kong. Report to the Council for the AIDS
Trust Fund, Hong Kong. URL: 3http://personal.cityu.edu.hk/~enrodney/Research/
MSM/ MSMindex.html4 [accessed on 25 September 2006].
Kendon, A. (2004). Gesture: Visible Action as Utterance. Cambridge: Cambridge University
Press.
Kress, G. and Van Leeuwen, T. (1996). Reading Images: The Grammar of Visual Design.
London: Routledge.
Kress, G. and Van Leeuwen, T. (2001). Multimodal Discourse: The Modes and Media of
Contemporary Communication. London: Edward Arnold.
Lemke, J. L. (1987). Strategic deployment of speech and action: A sociosemiotic analysis.
In Semiotics: 1983, Proceedings of the Semiotic Society of America ‘Snowbird’ Conference,
J. Evans and J. Deely (eds.), 67–79. New York: University Press of America.
Lemke, J. L. (1998). Multiplying meaning: Visual and verbal semiotics in scientific text.
In Reading Science: Critical and Functional Perspectives on Discourses of Science, J. R.
Martin and R. Veel (eds.), 87–113. London: Routledge.
Lewis, P. H. (1998). Videoconferencing’s killer app may be sex. The New York Times 16
July: sec. G, p. 7, col. 1.
Mantovani, F. (2001). Cyber-attraction: The emergence of computer-mediated communica-
tion in the development of interpersonal relationships. In Say Not to Say: New Perspec-
tives on Miscommunication, L. Anolli, R. Ciceri, and G. Riva (eds.), 229–246. Amsterdam:
IOS Press.
McLellan, H. (1996). Virtual realities. In Handbook of Research for Educational Communi-
cations and Technology, D. H. Jonassen (ed.), 457–487. New York: Macmillan Library
Reference.
Molitor, F., Facer, M., and Ruiz, J. D. (1999). Sex communication and unsafe sexual behav-
ior among young men who have sex with men in California. Archives of Sexual Behavior
28 (4): 335–344.
Nevile, M. (2004). Beyond the Black Box: Talk-in-Interaction in the Airline Cockpit. Alder-
shot: Ashgate.
Norris, S. (2004). Analyzing Multimodal Interaction: A Methodological Framework. London:
Routledge.
Prior, P. and Shipka, J. (2003). Chronotopic lamination: Tracing the contours of literate
activities. In Writing Selves/Writing Societies: Research from Activity Perspectives, C.
Bazerman and D. Russell (eds.), 182–238. Fort Collins, CO: The WAC Clearinghouse
and Mind, Culture, and Activity.
Quina, K., Harlow, L., Moroko¤, P. J., Burkenholder, G., and Deiter, P. J. (2000). Sexual
communication in relationships: When words speak louder than actions. Sex Roles:
A Journal of Research April. URL: 3http://findarticles.com/p/articles/mi_m2294/is_
2000_April/ai_655767104 [accessed on 25 September 2007].
Raudaskoski, P. (1999). The use of communicative resources in language technology envi-
ronments. Unpublished doctoral dissertation, University of Oulu, Oulu.
Televideo cybersex 473
Reinish, J. M. (1991). The Kinsey Institute New Report on Sex. New York: St. Martin’s
Gri‰n.
Sarangi, S. and Slembrouck, S. (1996). Language, Bureaucracy and Social Control. London:
Longman.
Short J., Williams, E., and Christie, B. (1976). The Social Psychology of Tele-communica-
tions. New York: John Wiley & Sons.
Sinclair, J. M. and Coulthard, R. M. (1975). Towards an Analysis of Discourse. Oxford:
Oxford University Press.
Sproull, L. and Kiesler, S. (1986). Reducing social context cues: Electronic mail in organiza-
tional communication. Management Science 32 (11): 1492–1512.
Stanton, L. (2006). Talking Dirty: Learning to Speak the Language of Lust. San Francisco:
Chronicle Books.
Stöckl, H. (2004). In between modes: Language and image in printed media. In Perspectives
on Multimodality, E. Ventola, C. Charles, and M. Kaltenbacher (eds.), 9–30. Amsterdam:
John Benjamins.
Stone, A. R. (1996). The War of Desire and Technology at the Close of the Mechanical Age.
Cambridge, MA: MIT Press.
Walther, J. B. (1996). Computer-mediated communication: Impersonal, interpersonal and
hyperpersonal interaction. Communication Research 23: 3–43.
Walther, J. B. and Parks, M. R. (2002). Cues filtered out, cues filtered in: Computer-
mediated communication and relationships. In Handbook of Interpersonal Communica-
tion, M. L. Knapp and J. A. Daly (eds.), 529–563. Thousand Oaks, CA: Sage.
Waskul, D. D. (2002). The naked self: Being a body in televideo cybersex. Symbolic Interac-
tion 25 (2): 199–227.
Waskul, D. D. (2003). Self-Games and Body-Play: Personhood in On-line Chat and Cyber-
sex. New York: P. Lang.