Professional Documents
Culture Documents
Information Complexity in Air Traffic Control Displays: DOT/FAA/AM-07/26
Information Complexity in Air Traffic Control Displays: DOT/FAA/AM-07/26
Information Complexity in Air Traffic Control Displays: DOT/FAA/AM-07/26
Information Complexity in
Air Traffic Control Displays
Jing Xing
Civil Aerospace Medical Institute
Federal Aviation Administration
Oklahoma City, OK 73125
September 2007
Final Report
NOTICE
Xing J
9. Performing Organization Name and Address 10. Work Unit No. (TRAIS)
12. Sponsoring Agency name and Address 13. Type of Report and Period Covered
Air traffic controllers typically use visual displays to interact with various automation systems. Automation tools
are intended to reduce controller task load, but they may also create new tasks associated with acquiring,
integrating, and utilizing information from displays. Consequently, the complexity of information displayed may
reduce the efficiency and effectiveness of an automation system. Moreover, complexity could cause controllers to
miss or misinterpret visual data, thereby reducing safety. Thus, information complexity in air traffic control
(ATC) displays represents a potential bottleneck in ATC systems. To evaluate the cost and benefit of an
automation system, it is important to understand whether the information it provides is too complex for
controllers to process. The purpose of this study was to answer three basic questions: 1) What constitutes
information complexity in automation displays? 2) What level of display complexity is “too complex” for
controllers? 3) Can we objectively measure information complexity in ATC displays? In this study, we first
developed a general framework for measuring information complexity. The framework reduces the concept of
complexity into three underlying factors: quantity, variety, and the relations between basic information elements;
each factor is evaluated at three generic stages of human information processing: perception, cognition, and action.
By this definition, we decompose complexity into a 3x3 matrix, measuring the effects of a complexity factor on
information processing at a given stage. We then take the following steps to develop complexity metrics for ATC
displays: 1) Identify task requirements of using the displays in ATC; 2) Determine corresponding brain functions
pertinent to the task requirements; and 3) Choose the metric that can measure the effects of the complexity
factor on the brain functions. Using this approach, we developed nine metrics of ATC display complexity. These
metrics provide an objective method to evaluate automation displays for acquisition evaluation and design
prototypes.
17. Key Words 18. Distribution Statement
Information Complexity, Display, Interface Design, Document is available to the public through the
Evaluation, Air Traffic Control Defense Technical Information Center, Ft. Belvior, VA
22060; and the National Technical Information
Service, Springfield, VA 22161
19. Security Classif. (of this report) 20. Security Classif. (of this page) 21. No. of Pages 22. Price
Unclassified Unclassified 19
Form DOT F 1700.7 (8-72) Reproduction of completed page authorized
i
ACKNOWLEDGMENTS
The author thanks Dr. Carol Manning for her consultations throughout this study. Thanks, also,
to Drs. Lawrence Bailey, Ben Willems, Pam Della Roccco, and Earl Stein for their suggestions and
advice. Additional thanks go to Dino Piccione, Steve Cooley, and Paul Krois for their support of
the Federal Aviation Administration’s requirements for information complexity research. Finally,
the author expresses appreciation for the Federal Aviation Administration internal reviewers, whose
comments greatly improved the quality of this paper.
iii
CONTENTS
INTRODUCTION. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
IDENTIFYING COMPLEXITY METRICS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Understanding Information Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
A Framework for Decomposing Factors of Information Complexity . . . . . . . . . . . . . . . . . . . . . 3
Metrics of Information Complexity for ATC Displays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Perceptual Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Cognitive Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Action Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
A PRELIMINARY CASE STUDY: ASSESSING THE COGNITIVE COMPLEXITY OF
THE MICROSOFT POWERPOINT INTERFACE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Representational Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Processing Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
DISCUSSION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Perceptual Complexity in the Literature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Comparisons With Other Measures of Cognitive Complexity in the Literature . . . . . . . . . . . 12
Relevant Measures of Action Complexity in the Literature . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
CONCLUSIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
v
INFORMATION COMPLEXITY IN AIR TRAFFIC CONTROL DISPLAYS
Stimulus
Working memory
Time
Figure 2: Activity patterns over time for information processing in the three stages.
3
Table 1: Metrics of information complexity
Quantity
Variety
Relation
1, with the rows being the three complexity factors and Perceptual Complexity
columns being the three stages. Therefore, we decompose We derived the following basic perceptual tasks for
complexity into nine metrics. Each metric describes the using generic ATC displays. The generic tasks include:
effect of the complexity factor on information processing 1) Detect critical messages; 2) Search for data of a given
at a given stage, so it is associated with the operator’s task category; and 3) Scan / rapidly read graphic patterns and
performance. For example, one of the metrics that we text. The perceptual functions involved in these tasks
proposed is the degree of clutter (described in detail later). include target pop-out (Pop-out means that a visual target
Clutter occurs when the perception of a central target is can be effortlessly detected irrespective of the amount
masked by the presence of overlaying and immediately of surrounding visual materials), detection, search, seg-
surrounding stimuli. Clutter directly affects the speed and mentation, text reading, etc. To drive measurements of
accuracy of text reading and target detection. With the the effects of the complexity factors on the performance
known capacity limits of brain information processing, of these tasks, we need to understand the underlying
these metrics can elucidate why a visual display can be mechanisms of these functions.
too complex for human operators. Given the principle The mechanisms of the above visual tasks can be
that complexity is constrained by task requirements, the described with the well-known two-step model of visual
format of each metric may vary with different applica- information processing. Figure 3 illustrates the model
tions. Next, we used this framework to develop metrics in which the visual system processes information in two
specifically for ATC displays. steps. In the first step, a visual image is segmented into
distinctive visual objects, and salient targets pop out of
Metrics of Information Complexity for ATC Displays the image. This step involves parallel processing. Based
Since complexity is constrained by task requirements, on the results of the parallel processing, the second step
developing metrics for ATC displays needs to consider the involves the visual system serially focusing on the salient
generic tasks associated with using the displays. Therefore, targets or selected objects so that information can be ana-
we used the following steps to develop the metrics: lyzed in detail. Next we evaluated the three complexity
• Identify task requirements; factors in this perceptual model to derive the metrics of
• Determine corresponding brain functions pertinent perceptual complexity.
to the task requirements; and Quantity evaluated by perception. According to the
• Choose the metric that can measure the effects of the perceptual model, the quantity factor does not affect im-
complexity factor on the brain functions. age segmentation and target pop-out due to their parallel
processing. However, it does affect the serial processing of
This procedure requires understanding the nature of visual details. Processing time increases with the number
the tasks associated with using displays. Since the purpose of visual elements in a display. Since serial processing is
of this study is to develop complexity measures for generic limited to the information available within retinal fovea
ATC displays, we extracted some typical characteristics where the eyes are fixated, the basic visual element in serial
of those displays: processing is the fixation. Therefore, we propose that the
• The displays contain mainly text, icons, and other metric of Quantity evaluated by perception is the number
binary graphical patterns (symbol, charts, etc.); of fixation groups. A fixation group is defined as a set of
• Many categories or types of data are presented in in- visual stimuli that can be perceived within a foveal fixation
termingled and sometimes overlapped patterns; for detailed analysis. Typically, a foveal fixation spans a
• Displays are often dynamic; information is continu- viewing angle of about 2-4 degrees. The average time to
ously updated; and search for a particular target on a display increases with
• Displays are interactive; users actively access to and the number of fixation groups. While there is no physi-
update information. ological limit on how many fixations one can make on
4
Object Detailed
segmentation Serial
processing
searching
Salient area (contrast,
pop-out orientation,
speed, …)
a display, visual experiments have demonstrated that it Variety evaluated by perception. Variety is related
takes 600-700ms for an observer to perceive the informa- to image segmentation and pop-out, both based on the
tion in one fixation. Therefore, the capacity limit of this uniformity and distinction of the visual features. There-
metric is determined by the time available for users to fore, we proposed that this metric involved the number
monitor a display. For example, if an air traffic controller of different visual features, including distinctive colors,
has ten seconds maximally to scan all the information on luminance contrast, spatial frequency or size, texture,
a display, then the number of fixation groups included and motion signals in a display. Increasing the variety of
in the display should be less than 14. visual features leads to difficulties in visual segmentation
Figure 4 illustrates the concept of the metric. Both and target pop-out; as a result, a complex display cannot
pictures contain aircraft symbols and datablocks, mim- be efficiently organized, and salient targets cannot be
icking those on controllers’ radar displays. Controllers instantly detected without being searched. In addition,
typically scan the datablocks and fixate on important visual studies have demonstrated that switching between
ones for detailed processing. Thus, each datablock can visual features such as color and luminance contrast
be counted as one fixation group. The picture on the left increases search time. This effect is called “cost of switch-
panel has four datablocks and to scan them all would take ing.” Switching also reduces the reliability of text reading
at least 4 × 600ms = 2. 4 sec. In contrast, the picture on and target detection. Consider, for example, that the two
the right panel has 11 datablocks so it would take at least pictures in Figure 5 contain the same materials. The text
11× 600ms = 6.6 sec to scan all of them. in the left panel has the same font and uses three colors.
In many applications, displays are crowded and it The red letters “LA” indicate an alert that controllers
takes many fixations to view all the information. One need to instantly detect and pay attention to. The blue
strategy to reduce perceptual complexity is to use color and black text represents two categories of datablocks.
or other cues to aid visual segmentation, so information This picture uses three visual features (colors) to seg-
can be segregated into several objects. Consequently, ment the displayed information, and the segmentation
the number of fixations required to complete the visual is very effective. However, the figure in the right panel
search is reduced. uses many visual features (colors, font, sizes, etc) and
5
segmentation becomes impossible. As a result, the right patch was presented alone. However, when a blank gap
figure appears to be more complex than the one on the was introduced between the central and surrounding
left due to its variety. patch, the suppression effect became much weaker. These
Relation evaluated by perception. The relationship experimental results implicitly suggested two methods to
of visual elements affects the processing of detailed visual reduce the clutter effect: 1) reducing the amount of text
information. In particular, the perceived contrast of a in a display and, 2) keeping blank surrounds for targets
visual stimulus depends on the physical contrast of the to be quickly read or detected.
stimulus and other visual stimuli in its surrounding area. Figure 6 is an illustration of the clutter effect. The
Contrast is important as the difficulty of text reading picture on the left shows the datablocks in a baseline
and graphic target detection are primarily determined by conflict alert display to alert pilots of potential conflicts
the perceived contrast. Therefore, we proposed that the with other aircraft. The picture on the right shows the
metric for relation evaluated by perception was the degree datablocks on a prototype of the display improvement.
of clutter, defined as the effect of the visual perception of Since pilots primarily need only the aircraft heading and
a stimulus being masked by the presence of other stimuli. altitude information, the prototype displays altitude in
Clutter can increase search time and reduce target detec- the datablock and uses a simple triangle to indicate the
tion as well as text readability. The effect is apparent when current heading direction of the aircraft. The additional
background visual stimuli are spatially superimposed on information is hidden and displayed only upon the user’s
the target. Moreover, the perceived luminance contrast request. The clutter is thus greatly reduced, and the infor-
of a visual target can also be largely suppressed by the mation can be more easily read. Notice that this declutter
presence of surrounding stimuli. Reduction in perceived strategy works for pilot but not controllers who need to
contrast results in significant deterioration of text read- see the hidden information most of the time. Controllers
ability. Xing and Heeger (2001) examined this effect and typically declutter their radar displays by showing part of
found that the perceived contrast of a sine-grating patch the datablock (called limited-datablock) when too many
embedded in a large patch of the same kind of gratings aircraft make the displays crowded.
was about half the contrast perceived when the central
N123 N123
UAL 135
OTP /165 UAL 135 OTP /165
350 1210
DAL62 350 1210 DAL62 672HZ23 DAL62
DAL62 672HZ23
265H-22 299A 265H-22 299A 299A
299A
TWA45 UAL 260 UAL 742 HOLD
135 TWA45 742 HOLDUAL 260 742 HOLD
UAL 135
742 HOLD
310N 350 1210 350 1210 310N 350 1210 350 1210
DAL62 DAL62
LA 485CST 265H-22 265H-22 LA 485CST 265H-22 265H-22
TWA45 299A TWA45 299A
PAA 185 PAA 185
310N 742 HOLD 310N 742 HOLD
350C
TWA45 350C
TWA45
TWA45 AAL16 485CST TWA45
AAL116 AAL16
135 AAL116
310N UAL485CST 310N UAL 135
250A 310N 350A 250A 310N 350A
485CST 350 1210 485CST 350 1210
485CST 445 460 485CST 445 460
265H-22 470 265H-22 470
UAL 135
N123
1210 165
OTP /165
350 1210
265H-22 DAL62 672HZ23 DAL62 1144 102
299A 299A
340
TWA45 UAL 260 UAL
742 HOLD 742 HOLD
135 1100 1210
310N 350 1210 350 1210
PAA485CST
185 DAL62 350
265H-22 265H-22
350C TWA45 299A 485
310N 742 HOLD
TWA45 485
TWA45
135 AAL116
310N UAL485CST
AAL16 265 4701321 350
310N 350A
485CST 350 1210 250A485CST 445 460
265H-22 470
Figure 6: Illustration of the clutter effect on text reading and target detection.
6
Cognitive Complexity traditional term “manipulation” to be consistent with
We theorized that the following cognitive tasks are the recent literature and to emphasize that this type of
performed when operators interact with ATC displays. memory is for processing information, not for maintain-
The generic cognitive tasks include 1) constructing, main- ing it. The circles in each buffer represent the elements
taining, and updating the mental representation of the of information. Open circles represent items of available
information contained in the display; 2) comprehending information; filled circles represents information selected
text and graphic information; and 3) binding (or asso- for an action. The arrows represent information flow
ciating) items of information to plan an action or make between the buffers.
a decision. All of these cognitive tasks require working The model performs tasks through interactions be-
memory. Therefore, measures of cognitive complexity tween processing memory and maintenance memory.
should be based on quantifying the demand that using Below are the basic ways that the model processes a
a display imposes on working memory. complex task:
Traditionally, working memory has been considered • Maintenance memory keeps items of information
as the limited-capacity storage system involved in the “on-line” without being attended to; such items form a
maintenance and manipulation of information over short “to-do” list for completing a task; they can be quickly
periods of time. However, recent findings suggest that retrieved and are subject to decay if not attended to
working memory for maintenance is different from that for over a period of time.
manipulation. Memory for maintenance is about chunks, • Processing memory binds pieces of information that
or elements that are in our conscious awareness in the are simultaneously needed for planning an action or
absence of sensory inputs, while memory for manipulation making a decision.
is about the independent elements or variables that must • Processing memory retrieves information from sensory
be simultaneously considered to plan an action or make systems, maintenance memory, and / or long-term
a decision (Cowan, 2001; Halford, Wilson, & Phillips, memory. Information needed for an action or deci-
1998). Based on recent findings from psychophysical sion is selected from those sources and associated in
and neurophysiological studies, we generalized a work- processing memory.
ing memory model that incorporates maintenance and • Depending on the task, processing memory can discard
manipulation memory, and we used the model to evaluate information that is no longer needed or register new
the complexity factors. information into maintenance memory for later use.
Figure 7 shows a diagram of the working memory
model. It consists of input mechanisms (long-term The limit for representational complexity. According
memory and inputs from sensory systems) and two work- to our model, “too complex to use” is when the memory
ing memory buffers: processing memory and maintenance demand for processing the displayed information exceeds
memory. We used the term “processing” rather than the the capacity of working memory. We generalized the
Maintenance
memory
Processing
memory Decision-making
Action planning
Sequenced plans
Requested actions
actions. Figure 8 provides a framework for multi-step diverse environmental perturbations and reduce the dif-
action planning. The framework is based on the fact that ficulty of decision-making. In such systems, a task of any
the brain is able to hold several action plans and execute complexity can be decomposed into a series of subtasks,
them sequentially. When performing multiple tasks with each represented by a subgoal. Some researchers have
a display, users actually switch action planning back and used the number of serializable subgoals as a measure
forth. The switch can be so quick that they feel that they of complexity for a system with a multi-level structure
are doing multiple things at the same time. However, (Heylighen, 1989).
information can be misinterpreted or lost during these Relation evaluated by action. The relation factor affects
quick switches. Next, we use this framework to evaluate action planning. The brain cortices related to motor planning
the complexity factors and derive the metrics of action can only program one action plan at a time. Therefore, we
complexity. propose that this metric should be the number of simulta-
Quantity evaluated by action. The quantity factor neous action goals required to use displayed information.
affects the first part of the action model, the requested Since the brain can only reliably program one action plan
actions. Performing physical interactions with a display at a time, ideally each action should result in only one ac-
costs time and takes users away from other perceptual and tion goal for the next step. In the case where more than one
cognitive tasks. Therefore, we proposed that the metric action goal needs to be planned simultaneously, the brain
was the minimal amount of keystrokes, mouse move- has to switch back and forth between the goals. Errors may
ments, and transitions of action modes required to use occur when switches of planning occur at a fast pace.
displayed information. Compared with keystrokes and Table 2 summarizes the 3x3 metrics we developed for
mouse movements, the time needed for eye and head generic ATC displays. The metrics describe the objective
movements is negligible. Therefore, we only considered aspects of complexity. So far, we have developed the metric
keystrokes and mouse movements. An action transition definitions, yet the algorithms or methods of computing
is a change of action modes, such as from keystrokes to each metric remain to be developed or implemented from
mouse movements or vice versa. Those transitions also the literature. Next we describe a preliminary case study to
take time and require the brain to coordinate different explore the methods of applying our metrics to computing
action modes. Sears (1994) proposed a layout complexity cognitive complexity of a human-computer interface.
metric as the summed product of the frequency of action
transitions and the cost of transitions. The two factors in A PRELIMINARY CASE STUDY: ASSESSING THE
our metrics, the amount of manual movements and the COGNITIVE COMPLEXITY OF THE MICROSOFT
transitions between the movements, are essentially the POWERPOINT INTERFACE
same as Sears’ metric.
Variety evaluated by action. The variety factor affects The purpose of this case study was to explore how to
storage of action plans. Therefore, we propose that the apply the proposed metrics to evaluate display complex-
metric is action depth, defined as the number of serial ity. Microsoft PowerPoint™ is one of the most popular
steps needed to plan (or select from a number of action software applications for making presentations. Using our
options) to acquire information. Following the need to complexity metrics, we assessed the cognitive complexity
increase the variety of actions, today’s display systems of the PowerPoint interface.
tend to use multi-level structures to cope with more
9
Table 2: Metrics of information complexity
Quantity No. of fixation groups No. of functional units Amount of action cost
12
DISCUSSION
10
This paper presents a framework to decompose infor-
mation complexity and 3x3 complexity metrics for ATC
8 N=24 displays. Previous work has reported measures similar to
No. of responses
Slide show 0
Set font
Format 1 or 0*
characteristics
Insert new slide Insert 1
measure display complexity from the perspective of hu- One drawback of Tullis’ metrics is that those measure-
man performance on visual search tasks. The metrics were ments were specified for text-dominant displays but not
comprised of four basic characteristics of display formats for graphical ones. It is hard to define Tullis’ groups in
to describe how well users can extract information from spatially continuous two-dimensional images with vary-
displays. They included a) overall density of displayed ing colors and luminance contrast. On the other hand,
items, b) local density of characters, c) number and aver- Rosenholtz, Li, Mansfield, and Jin (2005) proposed a
age group size, and d) layout complexity, which describes feature congestion measure of display clutter in which clut-
how well the arrangement of items on a display follows ter is considered as the degradation of task performance
a predictable visual scheme. Tullis showed that these caused by the density of visual features such as color or
metrics are highly correlated with subjects’ performance luminance contrast. This measure is related to our clut-
time in visual search tasks. The overall and local density ter metric and can be applied to graphic visual displays.
characteristics, together, can be a measure of our clutter Unfortunately, the algorithm requires displayed materials
metric; the number of groups corresponds to our metric to be converted to digitized images to compute the fea-
of fixation groups; and the layout complexity is some- ture congestion. That is often not practical for dynamic
what related to the variety of visual features. Therefore, displays in which visual images evolve rapidly.
for limited applications, we can use Tullis’ metrics as the
quantitative measurements for the perceptual complexity
we proposed.
11
Comparisons With Other Measures of Cognitive that controllers are forced to manipulate the display fol-
Complexity in the Literature lowing a fixed procedure. That would be contrary to the
Previous studies of cognitive complexity have focused design philosophy.
on text comprehension, creativity, social phenomena, One measure related to action complexity is Sears’ layout
etc. For example, Crokett (1965) used the concept of appropriateness metric (Sears, 1994). Sears proposed this
“level of hierarchic integration of constructs” to define metric to assess users’ performance when using a com-
cognitive complexity. With this definition, cognitive puter interface. The metric was the summed product of
complexity is associated with increasing differentiation the frequency of action transitions and the cost of these
(containing a greater number of constructs), articulation transitions. The two factors comprising our metric of
(consisting of more refined and abstract elements), and action cost, number of manual movements, and transi-
hierarchic integration (organized and interconnected). We tions between the movements are essentially the same
identified three metrics of cognitive complexity based on as Sears’ metric. Sears used the distance that users must
working memory: number of functional units, number move the computer mouse and the size of the objects to
or frequency of unpredictable changes, and number of be moved as the cost of a transition. This metric can be
relations. Our metric, the number of functional units, used to evaluate the efficiency of a user-interface layout
is similar to the measure of constructs in the literature. and compute the extent to which a display demands
We also proposed relational complexity as a metric to action. However, it does not apply to ATC displays well
quantify how the relation factor of complexity affects because it primarily emphasizes the effects of mouse
cognition. This measure corresponds to the intercon- movements, while many other kinds of actions are
nected hierarchical integration proposed by Crokett. In involved in using ATC displays (Allendoerfer, Zin-
addition, we proposed to use the frequency of unpredict- gale, Pai, & Willems, 2006) .
able information changes to measure the variety factor of Perhaps the Keystroke Level Model (KLM) proposed
complexity. However, this dynamic aspect of cognitive by Card, Noran, and Newell (1980) is more applicable
complexity has been seldom studied. to measure action complexity within the ATC domain.
Measures of cognitive complexity, explicitly or im- The model measures the sum of the execution time of
plicitly, depend on cognitive task analyses that reveal sequenced operations, including key strokes, mouse
the cognitive aspects of tasks and knowledge needed for movement, switches, and mental preparation for a physi-
situation awareness, decision-making, planning, etc. One cal action. While most of these operations correspond to
popular cognitive task analysis method is GOMS: Goal, our metric of action cost, the last operation — mental
Operator, Methods, and Selection (Card, Moran, & preparation — is somewhat related to the other two ac-
Newell, 1983). This method seeks to analyze and model tion metrics: action depth and simultaneous action goals.
the knowledge and skills a user must develop to perform Compared to Sears’s metric, the KLM model considered
tasks on a device or system. The result is a description the effects of key strokes and mental preparation. Both
of the Goals, Operators, Methods, and Selection rules play crucial roles in controllers’ interacting with ATC
for any task. Currently we are exploring how to calculate displays. Allendoerfer et al. documented the frequency
the three cognitive metrics based on GOMS and other of use of en route controller commands using radar
similar analyses. displays and measured the time and number of key-
strokes or mouse clicks required to perform those
Relevant Measures of Action Complexity in the commands. They found that controllers entered
Literature information on the radar display more frequently
Many methods have been developed to assess the com- than they moved things around. They also proposed
plexity of human-computer interfaces (McCabe, 1976; that the time spent looking at the keyboard or screen
Rauterberg, 1992). Those methods require modeling a while entering commands should be included in the
system’s states and transitions between states. Unfortu- assessment of display usage characteristics. These
nately, it is implausible to directly apply such methods to results suggested that the KLM model is a potential
ATC displays. Controllers use displays adaptively, and no candidate to objectively measure action complexity
standardized procedure has been specified. Thus, there are for ATC displays. Next, we need to explore how to
no clearly defined states and transitions in their interac- calculate action complexity based on the measure-
tions with ATC displays. If the use of a display can be ments like those in the KLM model.
described explicitly with states and transitions, it implies
12
CONCLUSIONS Cowan N (2001). The Magical number 4 in short-term
memory: A reconsideration of mental storage capac-
This paper presents a framework for decomposing ity. Behavioral and Brain Sciences; 24(1): 87-114.
factors of information complexity and a set of metrics Crockett WH (1965). Cognitve complexity and impres-
to measure ATC display complexity. The framework is sion formation. In: Maher BA (Ed.), Progress in ex-
described as follows: 1) information complexity is the perimental personality research. New York: Academic
combination of three basic factors: quantity, variety, and Press; 2: 47-90.
relation; 2) complexity factors need to be evaluated with
the mechanisms of brain information processing at three Crutchfield JP, Young K (1989). Inferring statistical
stages of information processing: perception, cognition, complexity, Physics Review Letters; 63: 105-8.
and action; and 3) the metrics of complexity can be de- Drozdz S, Kwapien J, Speth J, Wojcik M (2002). Iden-
rived by associating task requirements to brain functions. tifying complexity by means of matrices. Physica;
The framework incorporates many human factors studies A314: 355-61.
involving interface evaluation. Within this framework, we
identified a set of complexity metrics for ATC displays. Edmonds B (1999). What is Complexity? - The phi-
Future work will focus on testing the metrics in a real or losophy of complexity per se with application to
simulated ATC work environment and converting the some examples in evolution. In: Heylighen F, Aerts
metrics into easy-to-use products for the design and hu- D (Eds.), The evolution of complexity. Kluwer:
man factors evaluation of new ATC technologies. Dordrecht; 1-18.
Georgopoulos AP, Schwartz AB, Kettner RE (1986).
REFERENCES Neuronal population coding of movement direc-
tion. Science; 233: 1416-9.
Allendoerfer KP, Zingale C, Pai S, Willems B (2006). En Grassberger P (1991). Information and complexity
route air traffic controller commands: Frequency of measures in dynamical systems. In Atmanspacher
use during routine operations. Federal Aviation Ad- H, Scheingraber H (Eds.), Information dynamics.
ministration, William J. Hughes Technical Center; New York: Plenum Press; 15-33.
Technical Report No: DOT/FAA/TC-TN06/04.
Halford GS, Wilson WH, Phillips W (1998). Process-
Andersen RA, Snyder LH, Bradley DC, Xing J (1997). ing capacity defined by relational complexity:
Multimodal representation of space in the posterior Implications for comparative, developmental and
parietal cortex and its use in planning movements. cognitive psychology. Behavioral Brain Sciences;
Annual Review of Neuroscience; 20: 303-30. 21(6): 803-31.
Bennett CH (1990). How to define complexity in phys- Heylighen F (1989). Self-organization, emergence and
ics, and why. In: Zurek WH (Ed.), Complexity, the architecture of complexity. Proceedings of the
entropy and the physics of information. Redwood 1st European Conference on System Science; Paris:
City, California: Addison-Wesley; 137-48. AFCET; 1989: 23-32.
Burleson BR, Caplan SE (1998). Cognitive complexity. Hopkin VD (1995). Human factors in air traffic control.
In: McCroskey JC, Daly JA, Martin MM. & Beatty Bristol, PA: Taylor & Francis.
MJ (Eds.), Communication and personality: Trait
perspectives. Cresskill, NJ: Hampton Press. Klemola T (2000). Cognitive complexity metrics and
requirements comprehension. Australian Conference
Card SK, Moran TP, Newell A (1980). The keystroke- on Software Measurement. Sydney, Australia.
level model for user performance time with interac-
tive systems. Communications of the ACM; 23(7): McCabe (1976). A complexity measure. IEEE Transactions
396-410. on Software Engineering; SE-2: 308-20.
Card SK, Moran TP, Newell, AL (1983). The psychol- Pouget A, Zemel RS, Dayan P (2000). Information
ogy of human computer interaction. Hillsdale, processing with population codes. Nature Review
NJ: Erlbaum. of Neuroscience; 1: 125-32.
Conway AR, Engle RW (1996). Individual differences Rauterberg M (1992). A method of a quantitative mea-
in working memory capacity: more evidence for a surement of cognitive complexity. Human-Com-
general capacity theory. Memory; 4(6): 577-90. puter Interaction: Tasks and Organization - ECCE’92;
Roma, Italy. 295-307.
13
Rosenholtz R, Li Y, Mansfield J, Jin Z (2005). Feature Xing J, Andersen RA (2000). Memory activity of LIP
congestion: A measure of display clutter. Proceedings neurons for sequential eye movements simulated
of the SIGCHI 2005; 761-70. with neural networks. Journal of Neurophysiology;
84(2): 651-65.
Schmidt BK, Vogel EK, Woodman GF, Luck SJ (2002).
Voluntary and automatic attentional control of Xing J, Heeger D (2001). Quantification of contrast-
visual working memory. Perception and Psychophys- dependent center-surround interaction. Vision
ics; 64: 754-63. Research; 41: 571-83.
Sears AL (1994). Automated metrics for user interface Xing J, Manning C (2005). Complexity and automation
design and evaluation. International Journal of displays of air traffic control: Literature review and
Biomedical Computation; 34: 149-57. analysis. Washington DC: Federal Aviation Admin-
istration; Report No: DOT/FAA/AM-05/4.
Tullis TS (1985). A computer-based tool for evaluat-
ing alphanumeric displays. In Shackel B (Ed.), Xing J (2004). Measures of information complexity and
Human-computer interaction: INTERACT ‘84. the implications for automation design. Washington
London, England. DC: Federal Aviation Administration; Report No:
DOT/FAA/AM-04/17.
Wickens CD (1991). Engineering psychology and human
performance. NY: Harper Collins.
Willems B, Allen RC, Stein ES (1999). Air traffic control
specialist visual scanning II: Task load, visual noise,
and intrusions into controlled airspace. Federal
Aviation Administration, William J. Hughes Tech-
nical Center; Technical Report No: DOT/FAA/CT-
TN99/23.
14