Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

Received November 8, 2019, accepted November 20, 2019, date of publication November 25, 2019,

date of current version December 9, 2019.


Digital Object Identifier 10.1109/ACCESS.2019.2955677

Collective Intelligence: A Taxonomy and Survey


FEIJUAN HE 1 , (Member, IEEE), YUDAI PAN 2, QIKA LIN 2, XIANGLIN MIAO 1,

AND ZHOUGUO CHEN 3


1 Department of Computer Science, Xi’an Jiaotong University City College, Xi’an 710018, China
2 School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an 710049, China
3 The 30th Research Institute of China Electronics Technology Group Corporation, Chengdu 610041, China

Corresponding author: Qika Lin (qikalin@foxmail.com)


This work was supported in part by the National Key Research and Development Program of China under Grant 2016YFE0206700, and in
part by the National Natural Science Foundation of China under Grant 61877050.

ABSTRACT Collective intelligence (CI) refers to the intelligence that emerges at the macro-level of a
collection and transcends that of the individuals. CI is a continuously popular research topic that is studied
by researchers in different areas, such as sociology, economics, biology, and artificial intelligence. In this
survey, we summarize the works of CI in various fields. First, according to the existence of interactions
between individuals and the feedback mechanism in the aggregation process, we establish CI taxonomy
that includes three paradigms: isolation, collaboration and feedback. We then conduct statistical literature
analysis to explain the differences among three paradigms and their development in recent years. Second,
we elaborate the types of CI under each paradigm and discuss the generation mechanism or theoretical
basis of the different types of CI. Third, we describe certain CI-related applications in 2019, which can be
appropriately categorized by our proposed taxonomy. Finally, we summarize the future research directions
of CI under each paradigm. We hope that this survey helps researchers understand the current conditions of
CI and clears the directions of future research.

INDEX TERMS Collective intelligence, paradigm, synergetics, emergence, survey.


I. INTRODUCTION of crowds of people to facilitate large scale tasks that are
Collective intelligence is not a new area of research, and costly or time consuming with traditional methods [2]. From
related research can be traced back to early in the last cen- the definition, the targets of CI and crowdsourcing are signif-
tury. To date, collective intelligence is a popular research icantly different, so we do not discuss studies on crowdsourc-
topic, attracting researchers from different areas, such as ing in this survey.
social science, economics, biology, and artificial intelligence. Thus far, we have found three surveys about CI [3]–[5].
Although there is no unified definition of collective intel- Yu et al. [3] presented a literature review on CI in human
ligence, most researchers agree that collective intelligence society from 1907 to 2017, focusing on the influential works
surpasses individual intelligence. Therefore, this survey uses of researchers. The authors discussed how to measure and
the definition given by Michelucci and Dickinson [1], that model CI as well as the typical applications and future
is, collective intelligence (CI) refers to the intelligence that directions of CI. Salminen [4] summarized the studies on
emerges at the macro-level of a collection and transcends that CI of human society at three levels. Micro-level research
of the individuals. focuses on the factors enabling CI. Macro-level researchers
There are some other terms that denote CI. When individ- regard CI as a statistical or probabilistic phenomenon of
uals are social animals, such as bees, ants and birds, the term individual output results. Emergence-level studies focus on
Swarm Intelligence is used to denote CI; otherwise, when the how CI emerges at the macro-level from interactions of
individuals are humans, the term Wisdom of the Crowd is individuals at the micro-level. Krause et al. [5] proposed an
used for CI. In this survey, we uniformly use CI to represent overarching definition of CI that covers CI in both human
these terms. populations and social animals: individuals acquire infor-
There are certain relationships between CI and crowd- mation independently or partially independently. Through
sourcing. Crowdsourcing is a paradigm for using the power social interactions, the individuals provide a solution to a
cognitive problem that could not be solved by isolated indi-
The associate editor coordinating the review of this manuscript and viduals. The authors further summarized some important
approving it for publication was Moayad Aloqaily . criteria of CI.

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
VOLUME 7, 2019 170213
F. He et al.: Collective Intelligence: A Taxonomy and Survey

FIGURE 1. Three paradigms of CI.

TABLE 1. Symbols and notations.

Distinguished from previous survey papers, this survey general notations used in this survey are shown in Table 1.
establishes a CI taxonomy of three paradigms and reviews The detailed explanations are given in the following
the existing studies. The major contributions are from three sections.
aspects:
1) On the basis of whether there are interactions between
individuals and the feedback mechanism in the aggregation A. TAXONOMY OF COLLECTIVE INTELLIGENCE
process, we establish a CI taxonomy, including three intelli- Generally, CI can be regarded as an aggregation of individual
gence paradigms: isolation, collaboration and feedback, from behavior results. According to the existence of interactions
simple to complicated. We explain their differences and con- between individuals and the feedback mechanism in the
duct certain statistical analysis of literatures. aggregation process, CI can be divided into three paradigms
2) We elaborate the categories of CI under each paradigm including isolation, collaboration and feedback, as Figure 1
and the corresponding typical studies, and discuss the gen- shows.
eration mechanisms and theoretical bases of different types Isolation. The CI of the isolation paradigm works without
of CI. interactions and feedback. The individual ai (i ∈ [1, . . . , n])
3) We summarize the challenges and future research direc- makes autonomous decisions and produces the behavior
tions of CI under each paradigm. results xi . There are no interactions between the individuals,
The survey is organized as follows: The Section II presents so the results xi and xj do not influence each other (xi 6= xj ).
an overview of the three paradigms. Section III, IV and V At the collective level, the aggregation method f is used
give typical research works and explain the CI mechanisms to process X = (x1 , x2 , . . . , xi , . . . , xn ) and generate f (X ),
under three paradigms. Section VI describes certain recent where f (X ) does not influence the subsequent individual
CI-related applications. Section VII summarizes the possi- behaviors.
ble future directions of CI. Finally, the Section VIII gives The CI of the isolation paradigm mainly appears in human
conclusions. society and is usually used for estimation, prediction and
association mining tasks. The theoretical basis of this kind
of CI is a set of mathematical theorems, such as Diver-
II. OVERVIEW OF COLLECTIVE INTELLIGENCE sity Prediction Theorem [6] and Condorcet’s Jury Theorem
In this section, we introduce an overview of CI tax- [7], or data mining algorithms, for example, association rule
onomy and conduct a statistical literature analysis. The learning.

170214 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

FIGURE 2. Word clouds of three paradigms in related abstracts of papers.

Collaboration. The collaboration paradigm is distin-


guished by working with interactions and no feedback.
Direct interactions or indirect interactions by modifying envi-
ronmental variables between individuals produces behav-
ior result X = (x1 , x2 , . . . , xi , . . . , xn ) and environment E.
On the collective level, the aggregation method f is used to
process X and E, and generate f (X , E), where f (X , E) does
not influence the subsequent individual behaviors.
The CI of the collaboration paradigm mainly exists in
human society and is used to solve complicated tasks that
might be hard for individuals to accomplish. This CI could
be considered a designed serendipity [8]. The group is the
best combination of individuals with complementary abilities
oriented to specific tasks.
Feedback. One of the characteristics of the feedback
paradigm is the existence of interactions and feedback. Direct
interactions between individuals or indirect interactions by
modifying environmental variables produces behavior results FIGURE 3. Proportion of papers under the three paradigms over time.

Xt = (x1,t , x2,t , . . . , xi,t , . . . , xn,t ) and environment Et at


time t. At the collective level, f (Xt , Et ) affects Xt+1 and Et+1 ,
and the CI emerges through a self-organization mechanism. analysis, we give an intuitive introduction of these studies,
The CI of the feedback paradigm appears in both human which are shown in Figure 2 and Figure 3.
society and social animals and is mainly used for solving Figure 2 shows three word clouds for the abstracts of
optimization problems and evolutionary analysis of complex papers on the three different paradigms. A word cloud is
systems. This kind of CI could be described and explained by based on the occurrence frequency of words or word terms
synergetic approaches [9]. Synergetics is an interdisciplinary in multiple documents, which are indicated by the position
research area that is used to illustrate how subsystems in open and size of the corresponding text in the figure. We use the
complex systems form structures of space, time, or function abstract of each paper as an object document for statistics,
through self-organization [10]. which can highly reflect the concerns of each paper. Accord-
The three paradigms mentioned above show that the gen- ing to the construction method mentioned in [11], [12], all the
eration mechanisms and theoretical bases of CI are quite possible word terms are first extracted and the corresponding
different, which indicates that there might not exist a unified frequencies are determined. Then, each word is assigned
framework to explain various CI phenomena. a position by an adaptive algorithm. Specifically, the sizes
of the terms depend on their frequencies. Word terms that
are large in size and close to the center are of significance.
B. LITERATURE ANALYSIS In this way, the research of each paradigm can be represented
In this survey, we collect 62 papers that cover all three visually by the corresponding word cloud.
paradigms by searching for the keywords collective intel- From Figure 2, we can see certain obvious common words
ligence, swarm intelligence and wisdom of the crowd on in all the word clouds, such as forecast, accuracy, indi-
Google Scholar. From the perspective of statistical literature vidual, and group. This result reflects the commonality of

VOLUME 7, 2019 170215


F. He et al.: Collective Intelligence: A Taxonomy and Survey

TABLE 2. Some representative studies on isolation paradigm.

the three paradigms, that is, they all try to achieve CI by III. ISOLATION PARADIGM
analyzing or forecasting using the information of individ- In CI of the isolation paradigm, there are two aggregation
ual or group behaviors. In Figure 2.1, the words individual, methods for output results of independent individuals: simple
forecast and estimate account for a large part of the cloud. statistics such as mean, median and mode, and complex text
This result indicates that the isolation paradigm focuses on mining technology. A summary of the representative studies
some forecasting or estimating tasks of individuals. Corre- on the isolation paradigm is given in Table 2.
spondingly, in Figure 2.2, the proportion of the words group,
social influence, collective, and team greatly increases. This A. SIMPLE STATISTICS
result shows that the collaboration paradigm emphasizes the A large number of experiments indicate that using certain
interactions between groups. In Figure 2.3 of the feedback simple statistics such as median, mean or mode at the col-
paradigm, the frequency of the words algorithm and opti- lective level can yield better estimations or decision-making
mization increases. Combined with the corresponding stud- results than those of most individuals.
ies, it indicates that the feedback paradigm helps to optimize Median. Statistician Galton [13] gave the earliest case of
the solutions of estimates or predictions. Overall, the fig- this kind of CI in 1907. Galton analyzed the estimates of
ure shows the differences of three paradigms, which indi- the weight of a particular living ox by 787 different persons
rectly proves the validity of our division of CI. and found that the median of estimates was closer to the true
Figure 3 shows the changes of the proportion of papers value than most individual estimates. The median is usually
under the three paradigms over time. Each bar in the fig- used for performance analysis with other statistics, especially
ure represents the proportion in each time period, and dif- mean values in the following studies. For example, for the
ferent colors represent different paradigms. Overall, with task of estimating the number of marbles in a large glass jar,
the passage of time, research of isolation paradigm has Krause et al. [14] found that the mean and median values
decreased, and research of collaboration and feedback of collective estimates were 553.6 and 516 respectively, and
paradigms has gradually increased. This shows that CI came within 1.5% and 8.2% of the true value of 562. Addi-
research is increasingly inclined to complex interactions and tionally, Agnew [15] compared the performances of mean,
feedback. median and Bayesian method in forecasting procedures for

170216 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

macroeconomic indicators such as Gross National Product According to this theorem, higher diversity leads to lower
(GNP) growth rate and Real Gross National Product (RGNP) collective error and promotes the performance of CI.
growth rate. Moreover, McNees [16] compared the forecast- Mode. The mode of the individual results in a group
ing performance of mean and median in seven macroeco- (also called Majority Voting) also achieves better decision
nomic indicators like GNP, unemployment rate and corporate outcomes than a single individual. Ralf et al. [25] analyzed
profits. From the previous experiments, the median has com- a dataset involving more than 20,000 diagnoses from more
parable performance in forecasting. than 140 doctors and found that the diagnostic performance
The reason that the median takes effect in CI is that no mat- of the mode is better than that of the best doctor in the group
ter what the real value is, the median of collective forecasts when the diagnostic accuracy is relatively similar. Hastie and
is closer to the real value than at least 50% of the individuals. Kameda [26] provided an evaluation of nine statistical values
The 50% maximum exists only when the actual outcome xT in CI in a simulated test bed environment and determine that
is outside of this range [16]: the mode has better robustness.
     In certain annotation tasks performed by crowdsourcing,
xM − xL xH − xM
xM − , xM + (1) the majority voting strategy is used to improve the anno-
2 2 tation quality. For example, Snow et al. [27] investigated
In Formula 1, xM , xH and xL are the median, maximum and five annotation tasks of natural language processing (NLP):
minimum of collective estimates, respectively. affect recognition, word similarity, recognizing textual entail-
Mean. The mean of collective estimates can also improve ment, temporal event recognition, and word sense disam-
the accuracy compared with that of individuals. Zarnowitz biguation. They found that using the majority voting strategy
[17] forecasted GNP, RGNP, Implicit Price Deflator (IPD) only requires a small number of nonexperts to achieve the
and Consumer Expenditures for Durable Goods (CEDG) with annotation quality of experts. Sheng et al. [28] used repeated
79 individuals and determined that the group mean forecasts labeling and combined the majority voting strategy with
are more accurate than most of the corresponding sets of other annotation strategies, such as round-robin and selective
individual forecasts. Stock and Watson [18] forecasted the repeated labeling, to improve the quality of annotation. Tian
output growth of seven countries from 1959 to 1999, such and Zhu [29] demonstrated a max-margin majority voting for
as the United States, Canada and France, by comparing a improving the discriminative ability of majority voting and
series of statistics like mean, median and Trimmed Mean further proposed a Bayesian generalization to gain true labels
[38]. They found that simple mean values often yield better from the labels of noisy observations.
results than complex statistics, which means they have lower These phenomena are supported by Condorcet’s Jury The-
Mean Squared Forecast Errors. Genre et al. [19] explored orem [7]. This theorem states that if individuals in a group
multiple combination schemes of expert forecasts, including make decisions independently and the probability of correct-
combinations based on principal components and trimmed ness is greater than 0.5, then the probability of making a cor-
means, performance-based weighting as well as Bayesian rect decision based on the majority voting strategy increases
shrinkage. Only a few of the schemes outperform the mean with the growth of group size.
of the collective forecasts. Moreover, in forecasting, even the
mean of multiple estimates from an individual also shows a B. TEXT MINING
CI effect [20]–[22]. By mining the text data of independent individuals, new
Mean values are affected by extreme values, which might knowledge unknown to any individual can be found. This
cause weak CI robustness. Therefore, some more sophisti- type of CI usually has two characteristics. First, the individ-
cated calculation methods for mean values have been intro- uals are highly distributed in time and space without obvious
duced, such as Weighted Mean, Trimmed Mean, and Win- interactions. Second, individuals can not perceive the goals at
sorized Mean [39]. These statistics can achieve better perfor- the collective level.
mance in particular forecasting tasks [23], [24]. The most representative study of this type is from the infor-
That the mean of collective estimates is more accurate than mation scientist Swanson. Swanson et al. [30] proposed a
that of the individual estimates can be explained by the Diver- new strategy to facilitate the discovery of medical knowledge
sity Prediction Theorem [6]. If there are n individual estimates based on text mining. A literature set LAB demonstrates the
x1 , x2 , . . . , xi , . . . , xn and a real value xT , then the Collective relationships between topic A and B, and another literature set
Error (CE) is (xT − x̄)2 , which is thePSquared Deviation (SD) LBC demonstrates the relationships between topic B and C.
of the estimates’ mean value x̄ = n1 ni=1 xi and the real value If LAB and LBC satisfy these two conditions, the relationships
xT . Group P Diversity (GD) is the variance of the estimates, between A and C have certain scientific meaning.
which is n1 ni=1 (xi − x̄)2 , and Average Individual Error (AIE) 1) Complementary: The new useful information can be
is the average of Squared Deviation inferred from LAB and LBC together but cannot be obtained
Pbetween the individual from LAB or LBC alone.
estimates and real value, which is n1 ni=1 (xi −xT )2 . Diversity
Prediction Theorem can be expressed as: 2) Noninteractive: There is no common literature between
LAB and LBC and no citation or co-citation relationship
CE = AIE − GD. (2) between them either.

VOLUME 7, 2019 170217


F. He et al.: Collective Intelligence: A Taxonomy and Survey

TABLE 3. Some representative studies on collaboration paradigm.

Using the previous strategy, Swanson [31] found a rela- 2) There are two limitations in research using text mining
tionship between Raynaud disease and eicosapentaenoic acid, in the aggregation method. First, generating a literature set,
which is the main component of fish oil. He also discovered such as LAB , mainly depends on the co-occurrence of topics in
some medical information such as the relationship between documents. Specifically, if topic A and B appear in every doc-
migraines and magnesium [32]. Swanson and his colleague ument in LAB , then we consider these topics to be connected.
Neil Smalheiser [33] then developed a software named However, this assumption is not reliable, so it will affect the
Arrowsmith [40] to identify relationship between Medline credibility of the final result. Second, the generation of liter-
literatures. Other scientists promoted similar research. For ature sets and association mining requires high overhead and
instance, Hristovski et al. [34], [35] developed a literature- tremendous manual work [42], leading to low extensibility.
based discovery system BITOLA [41] using an association
mining algorithm and figured out the relationship between IV. COLLABORATION PARADIGM
insulin and Huntington disease. Weeber [36], [37] used this In the CI of the collaboration paradigm, individuals accom-
knowledge discovery model to develop a Disease-Adverse plish assigned tasks by collaboration, competition and infor-
drug reaction-Drug (DAD) system for drug discovery pro- mation sharing. Individuals can be in the same space-time
cess. According to the therapeutic characteristic B, the DAD domain or in Internet conditions across space-time. For the
can build the pathway from drug A and disease C. CI of the collaboration paradigm, the research focuses on
In this kind of CI, every literature in LAB and LBC is a result the analysis and recognition of the key factors, including
of individual behaviors. The aggregation method f refers to social influence, diversity and other factors. A summary of
the text mining algorithm or tool. The pathway A ←→ C the representative studies on the collaboration paradigm is
corresponds to the CI. This pathway does not exist in any given in Table 3.
single literature from LAB and LBC .
There are still two drawbacks in the research of CI based A. SOCIAL INFLUENCE
on the isolation paradigm. Social influence refers to the transformation of opinions,
1) In the study using simple statistics as the aggregation attitudes and judgments of an individual from the influence
method, single values of the individual output are used for of other individuals in a group. This is a significant factor of
point estimation, which means the output from the collective the collaboration paradigm.
level is also a value. These single values reduce the flexibility Most studies have proposed that social influence pro-
of this kind of aggregation method. In many circumstances, motes CI. Gürçay et al. [43] analyzed the impacts of social
the output of individuals or groups is uncertain and can be influence on the estimation accuracy, consensus and con-
expressed as an interval or a probability distribution. The fidence of uncertain quantity estimates, and found that the
research on CI in these circumstances is still in its infancy. utilizing of social influence on estimates achieves higher

170218 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

accuracy and greater consensus than independent estimates. individuals from Top Management Teams (TMT) negatively
Bernstein et al. [44] compared the effect of intermittent and affects on strategic collective consensus. Aggarwal et al. [52]
constant social influence on CI. They found that intermittent investigated how the team members’ cognitive styles related
social influence cannot only develop effective information to object and spatial visualization affect the team’s strate-
exchanging and mutual learning, but also maintain a high gic focus and strategic consensus. The results indicated that
level of exploration improving CI. In a lie detection task, diversity in cognitive style is negatively associated with the
Klein and Epley [45] demonstrated that the results from group formation of group consensus, further affecting the CI.
discussion are more accurate than those from individuals There are two points need explaining: 1) The functions
in the isolation paradigm based on majority voting (mode of diversity are different in the collaboration paradigm and
value). This demonstration verifies that social influence can isolation paradigm. For the collaboration paradigm, the effect
increase CI in certain conditions. of diversity represents the complementary views, skills and
However, some studies suggest that social influence knowledge among individuals, which is more conducive to
might reduce individual diversity and negatively impact CI. solving complex tasks. For the latter paradigm, diversity can
Lorenz et al. [6] confirmed the assumption that the knowl- reduce the collective error. 2) The effects of social influ-
edge of estimates undermines CI by three effects: social ence and diversity on CI are not isolated. According to
influence, range reduction and confidence. The social influ- Mavrodiev et al. [53], social influence does not directly affect
ence effect diminishes the diversity of individual opinions. CI, but benefits or harms the CI indirectly by the diversity and
Muchnik et al. [46] observed the social bias effect caused by collective error.
social influence, especially positive social influence, which
significantly increases the likelihood of positive ratings. For C. OTHER FACTORS
example, if an up-vote is first inserted into an online forum, Except for social influence and diversity, some researchers
the influence bias of group decision-making will increase worked on other factors of CI in the collaboration
to 25% on average. Becker et al. [47] determined that in paradigm, including individual attributes, group composi-
decentralized networks, social interactions between individ- tion, leadership and incentive mechanisms. For example,
uals enhance the performance of CI, whereas in centralized Woolley et al. [54] showed that CI is weakly correlated with
conditions, the influence of central individuals on other indi- the average or maximum individual intelligence of group
viduals reduces the CI effect. members but is strongly correlated with the average social
sensitivity of individuals, the equality in the distribution of
B. DIVERSITY conversational turn-taking, and the proportion of females
Diversity, which is an important factor in CI, refers to the in the group. In the Polymath1 project [55], Tim Gowers
degree of variation between the individuals in a group. Many and other online netizens tried to find combinatorial proof
studies have indicated that diversity can improve CI of the of the Density Hales-Jewett Theorem with a side length
collaboration paradigm. Hong and Page [48] compared the of three. This task was accomplished in only 6 weeks and
performance for two kinds of problem-solving teams. One is within 1000 comments. Cranshaw and Kittur [56] analyzed
a team of randomly selected agents, composed of individuals this collaborative process and found that: 1) Only a small
with functional diversity, and another is a team comprised percentage of the individuals contribute significantly to CI.
of the best-performing agents. They found that the former 2) Leadership plays a critical role in CI. Mann and Helbing
team outperformed the latter team, which is called diversity [57] proposed an incentive scheme by rewarding accurate
trumps ability. Jeffery [49] examined the effect of interper- minority predictions to achieve diversity in individual esti-
sonal congruence as a moderator in the relationship between mates and increase the accuracy of collective estimates.
diversity and group effectiveness. Interpersonal congruence Geifman and Raban [58] studied the role of individual self-
refers to the coherence degree to which group members see efficacy in solving cooperative problems. Self-efficacy is the
the others in the group as the others see themselves. The subjective judgment of an individual on whether he or she can
investigation shows that diversity tends to improve the per- use his or her own skills to accomplish a certain task.
formance in groups with high interpersonal congruence but At present, there are still two shortcomings in the research
undermines the performance of groups with low interpersonal of CI in the collaboration paradigm.
congruence. Aggarwal and Woolley [50] found that CI has First, the studies available use correlation analysis, stan-
an inverted U-shaped relationship with cognitive divergence. dardized regression coefficients, linear regression, Lasso
This discovery shows that regarding the groups of similar regression and other strategies to identify variables that have
cognitive styles or quite different cognitive styles, the perfor- correlations with or dependence on CI. However, these meth-
mance of CI is maximized when the cognitive divergence is ods cannot demonstrate the causal relationship between vari-
moderate. ables and CI, and struggle to explain the causes of CI.
Excessive diversity leads to gaps in communication Second, the research on the negative effects of social influ-
and collaboration between individuals and obstructs CI. ence and the corresponding solutions is still in infancy. The
Knight et al. [51] investigated that diversity in terms of func- asynchrony of social effects between individuals greatly dis-
tional background, age, education and employment tenure of torts collective decisions by introducing social biasing effects

VOLUME 7, 2019 170219


F. He et al.: Collective Intelligence: A Taxonomy and Survey

TABLE 4. Some representative studies on feedback paradigm.

including herding and snowballing [59]. The herding effect (or status) during the traversal process. At the collective level,
indicates that an individual in a group follows the opinions f (Et , Xt ) calculates the amount of pheromone Et+1 at time
of the population majority. The snowballing effect means if t + 1, in which the feedback mechanism is represented by
an opinion earns an initial advantage, the influence becomes influencing Xt+1 at time t + 1. The CI eventually obtains the
increasingly obvious. path described by the pheromones.
Kennedy and Eberhart [61] then proposed Particle Swarm
V. FEEDBACK PARADIGM Optimization (PSO) algorithm in 1995, which simulated birds
The CI in the feedback paradigm is actually an emergence flocking. The individuals in a flock (also called particles)
phenomenon. On the one hand, simple individual collab- have two attributes: velocity and position. At the individual
orative behavior at the micro-level can influence variables level, the velocity vi,t of individual ai at time t is updated
at the macro-level and CI-related variables. On the other based on three variables (usually in vector form), vi,t−1 ,
hand, macro-level variables restrict or promote the micro- the best position of the individual (pbesti ) and the global
level through feedback mechanisms. With this iteration pro- optimal position of the group (gbest). gbest can be regarded
cess, a new characteristic or structure of CI at the macro-level as the environment E for individual interactions. The position
can be established. xi,t at time t, which is the individual output, is updated on
The following subsections §V.A and §V.B present research the basis of xi,t−1 and vi,t . At the group level, pbesti (i ∈
analysis on CI of the feedback paradigm in social animals and [1, . . . , n]) and gbest are updated according to the value of Xt ,
human societies respectively. These studies are summarized and affect Xt+1 at time t + 1. After the algorithm converges,
in Table 4. the output gbest is treated as CI.
An algorithm named Artificial Bee Colony (ABC) was
A. COLLECTIVE INTELLIGENCE EMERGENCE IN SOCIAL proposed by Karaboga and Basturk [62], which is used to
ANIMALS solve the optimization problem of a multi-variable function.
Since the 1990s, a series of CI algorithms for resolving In ABC algorithm, individuals can be divided into three roles:
optimization problems has been proposed by considering employed bees, onlooker bees and scout bees. They can trans-
social animals such as ants, bees and birds, which concentrate fer their roles in particular situation. ABC algorithm finds the
on the cooperative behavior of groups and the emergence global optimal solution through individual local optimization.
mechanism of CI. At the individual level, the employed bees search for new food
The earliest algorithm of this type is Ant Colony Opti- sources in the neighborhood of the current source and eval-
mization (ACO) Algorithm in 1992 proposed by Dorigo [60]. uate the fitness. Onlooker bees select a honey source based
ACO algorithm simulated the process of how ants find the on information shared by the employed bees and calculate
most effective path between the nest and a food source. the fitness value as well. If the fitness of the food source
In this algorithm, the environment Et refers to the amount has not increased after multiple iterations, the corresponding
of pheromone at time t between two positions (or status), employed bees change to onlooker bees and randomly gen-
which is generated by the behaviors of ant colonies before erate a new position to replace the current food source. The
time t. The individual ant ai generates indirect interactions output of individual ai at time t is a tuple xt = (si,t , fit(si,t )),
based on the environment Et , that is, the transition probability si,t refers to the position of the current food source of ai at
is calculated according to Et , and the next position (or status) time t and fit(si,t ) is the fitness of si,t . Contrasting Xt and
is transferred to other individual ants until all the positions Xt+1 , it shows that a rich source attracts more individuals by
are traversed. The behavior output of individual ai at time t positive feedback at the collective level, while a poor source
refers to the amount of pheromone xi,t between two positions is gradually abandoned.

170220 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

In recent years, people have proposed a series of more evolution equation can explain how the interactions between
complicated CI algorithms based on the behavior of social individuals of different types at the micro-level can develop
animals. Mirjalili et al. [63] raised the Grey Wolf Optimizer a structurally stable society at the macro-level. The authors
(GWO) algorithm to represent the hierarchy of members. The ranked ten countries with this model and found a remarkable
GWO algorithm simulates the leadership hierarchy of four concordance with certain standard economic rankings.
types of grey wolves, including alpha, beta, delta and omega, Tao [68] used Boltzmann distribution with Marginal
and also simulates three steps of prey hunting, involving Labor-Capital Return and Marginal Technology Return as
searching, encircling, and attacking. The GWO algorithm parameters to describe the enterprise income under the com-
achieves competitive results on 29 well-known test functions. petitive economy. Based on the differential equation of GDP,
Meng et al. [64] proposed an algorithm named Bird Swarm he deduced that the called technological level is a self-
Algorithm based on social behaviors in bird swarms, such as referential variable that can be considered to be an index
foraging, vigilance and flight behavior. The effectiveness of measuring the CI in an economic environment. The com-
bird swarm algorithms has been verified by comparison tests petitive behavior of enterprises helps a higher technological
on 18 benchmark problems. In addition to social animals, level to emerge, and the technological level controls or guides
people found that the swarm behaviors of lower organisms, enterprise behaviors by a feedback mechanism.
such as bacteria and physarum, also exhibit CI. The cor- The present studies aiming at the emergence mechanisms
responding algorithms have been designed [65], [66]. This of CI under the feedback paradigm mainly focus on groups
kind of algorithm is supposed to be robust, decentralized and with simple rules and slight discrepancies between individ-
parallelized. uals. Many groups in reality, especially human society, have
A synergetic approach can be used to describe and explain complex individual behaviors and great diversity. Although
this kind of CI. Synergetics is an interdisciplinary research theoretically we could demonstrate and model them by a syn-
area that illustrates how subsystems in open complex sys- ergetic method, it would still be hard to promote quantitative
tems form structures of space, time, or function through self- analysis because of the introduction of a large number of
organization [10]. variables and differential equations.
Utilizing a synergetic approach to interpret this kind of
CI mechanism can be treated as a process of finding order VI. RECENT APPLICATIONS OF CI
parameters. Generally, there are a large number of variables Many CI-related applications have been proposed to solve
in a mathematical model used to describe a complex system, practical problems. We discuss certain typical works
but most are effective only for a short time and have little in 2019 in this section, where each work can be categorized
influence on the evolution of the system. These variables appropriately to one of the three paradigms, with the hope
are called fast variables. Another kind of variables are called of helping researchers understand the current application
order parameters, also known as slow variables. This kind of scenarios of CI. These applications with contents and the
variables indicate the degree of macro-order in a system. They corresponding paradigms are shown in Table 5.
are the total contribution of the synergetic behavior from each
subsystem or individual and control the transformation of fast A. APPLICATIONS OF THE ISOLATION PARADIGM
variables. The order parameters play considerable roles in For the isolation paradigm, there are several applica-
the system evolution. Usually, there are fewer order param- tions focusing on providing intelligent services by col-
eters in the system, and sometimes only one. For example, lective decisions of independent individuals. For example,
in the previous ACO, PSO and ABC algorithms, the order Dambanemuya and Horvát [72] aggregated personal lending
parameters are the amount of pheromone, the position of behaviors and successful loan payments, which can be seen
particles and the fitness of honey source, respectively. The as individual behavior results, to provide reliable decision
adiabatic elimination procedure [69] can usually be used to support. This can overcome P2P’s weakness of information
eliminate an abundance of fast variables to distinguish the asymmetry about borrowers’ credibility. Lee et al. [73] pro-
order parameters. posed a pest diagnosis system using data acquisition and
deep learning through CI. The required data, in the prediction
B. COLLECTIVE INTELLIGENCE EMERGENCE IN HUMAN of pests arising from the plant cultivation, is independently
SOCIETY collected from experts and user participates. By aggregating
There also exists CI of feedback paradigm in human society, the data from a large number of surveyors, a rapid diagnosis
which is manifested as self-interested and dispersed inter- can be made.
actions of individuals at the micro-level, and orderly social
phenomena emerging at the macro-level, such as Invisible B. APPLICATIONS OF THE COLLABORATION PARADIGM
Hand [70] in market economy and Spontaneous Order [71] There are many CI-related applications of the collabora-
in human society. tion paradigm and we choose six of them as examples.
Yukalov et al. [67] proposed an evolution equation to Otoum et al. [74] presented a restricted Boltzmann
describe a society composed of different types of individ- machine-based clustered intrusion detection system (RBC-
uals, including cooperators, defectors, and regulators. The IDS) in the monitoring of critical infrastructures through

VOLUME 7, 2019 170221


F. He et al.: Collective Intelligence: A Taxonomy and Survey

TABLE 5. Some applications of CI in 2019.

wireless sensor networks (WSNs). In RBC-IDS, a selection to simulate the interactions between individuals and the envi-
method is accomplished to integrate information, which cal- ronment. Otoum et al. [80] proposed a reinforcement learn-
culates the weight of each sensor node and compares it with ing method for intrusion detection system (RL-IDS). With
the weights of other nodes. In densely crowded environments, RL-IDS, the agents (sensors in WSNs) interpret the present
Aloqaily et al. [75] described how to replicate data from the state of the environment and fulfill an action that eventually
cloud to the edge, and then to mobile devices to provide results in certain rewards to the agents. The experimental
faster data access for users. Through node collaboration, data results show that RL-IDS achieves state-of-the-art perfor-
replication and service composition offload some workload mance. Petrillo et al. [81] proposed Swarm Debugging Infras-
burdens from the cloud. Kotb et al. [76] proposed a workflow- tructure (SDI), which can collect data about developers’ inter-
net based framework for efficient cloud computing, through active debugging sessions, and also feedback recommenda-
which fog computing devices are able to form a cooperative tions to developers. Such a continuous iterative cycle reaches
Internet of Things (IoT) service delivery system and enhance the target of swarm debugging. Boveiri et al. [82] proposed
performance gains of cloud computing. Yeh et al. [77] a high-performance approach based on the MaxâĂ"Min Ant
focused on the case of modeling trajectory data collected System (MMAS), which is an efficient variation in the family
from multi-player sports games, namely basketball and soc- of ACO algorithms. In MMAS, optimal task-order can be
cer, where each player is treated as an agent. They leveraged achieved as a feedback by scheduling in homogeneous multi-
graph neural networks (GNNs) and variational recurrent processor environments. Therefore, MMAS can utilize local
neural networks (VRNNs) to process interaction information mini data-centers to release the burden from the main data-
between the agents and then perform conditional prediction. center.
Belbachir et al. [78] proposed a self-adaptive mechanism In summary, recent CI-related applications, especially
for traffic regulation based on cooperative agents, which can those of the collaboration and feedback paradigms, have
minimize traffic congestion. The cooperative behaviors are been widely applied in various scenarios, which meet the
not pre-defined, but they emerge from the agent interactions results of literature analysis in Section II. Some mainstream
at a local level. Awal et al. [79] predicted user preferences models or methods of artificial intelligence have been used in
and future behaviors by embedding collective knowledge CI, such as deep learning and reinforcement learning.
in social networks. Concretely, they presented an extension
of the signed modularity and determined users’ degrees of VII. FUTURE RESEARCH DIRECTIONS
affiliation towards various communities. The signs (posi- In this section, we summarize six future research directions
tive or negative) of the links are considered for collective related to three paradigms of CI.
behavior prediction in signed social networks. Nonnumerical Estimation under Isolation Paradigm.
At present, in CI of the isolation paradigm, statistics are
C. APPLICATIONS OF THE FEEDBACK PARADIGM mainly used for numerical estimation. The results of indi-
There are several CI-related recent applications of the feed- vidual estimation are usually nonnumerical, such as inter-
back paradigm, which mainly integrate certain advanced val and probability distributions. Simply treating the results
algorithms, such as reinforcement learning and ACO, as a numerical value eliminates uncertainty but causes

170222 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

information loss [28]. In addition, at the group level, peo- on: (1) How to discriminate dysergy phenomena in the pro-
ple also want to obtain nonnumerical aggregation results. cess of individual interactions; (2) How to eliminate or relieve
Therefore, the aggregation model oriented to nonnumerical dysergy phenomena on CI.
estimation should be focused on. Measurement of Collective Intelligence. There are stud-
Causality Analysis in Collective Intelligence. Correla- ies showing that groups also have the similar factor of general
tion analysis is mainly used to analyze the influencing factors intelligence, called g factor, of individuals dealing with mul-
of collective intelligence, but there is still a lack of studies tiple cognitive tasks. Thus far, there have been some studies
on causality analysis. Causality refers to the interactions on the measurement of CI. Woolley et al. [54] found that the
between variables, and the change of one variable is triggered level of CI is weakly correlated with the average or maximum
by another. Although correlation analysis is significant in the intelligence of individuals in a group. Kosinski et al. [91]
mining of causality, it is neither necessary nor sufficient to proposed the intelligence level of a group, called Crowd
establish causation [83]. Moreover, causality is an important IQ, by a method similar to an IQ questionnaire. However,
foundation for restraining the negative effects and improving there are still some problems to be solved: (1) Measurement
the CI. There are still some problems to be solved: (1) Causal- indicators of CI in different scenarios, such as prediction
ity map and recognition of latent variables in CI scenarios; (2) and collaborative learning; (2) Real-time measurement and
Control and regulation of CI based on causal inference. dynamic regulation of CI.
Human-Computer Hybrid Collective Intelligence.
Actually, many engineering problems might involve intelli- VIII. CONCLUSION
gent computing tasks under uncertain and open conditions. CI is a long-standing and vast research field, involving soci-
The human brain is better at dealing with such tasks than ology, economics, biology, artificial intelligence and other
the available artificial intelligence. For this reason, human different fields. The overlapping or closely related directions
computing [84] and human-in-the-loop augmented intelli- include swarm intelligence, wisdom of the crowd, human
gence [85] have been proposed, aiming at studying how to use computation, and crowdsourcing. In this survey, we define CI
humans as computing components and solving complex prob- as the intelligence that emerges from the macro-level of a col-
lems in reality by combining humans and computers. From lection and transcends that of the individuals. Additionally,
the perspective of CI, this kind of work could also be consid- we try to provide an overview of the research that matches
ered to provide solutions for complex tasks through human- with this definition. First, we demonstrate a taxonomy of CI
computer collaboration in heterogeneous groups. The tasks under three paradigms: isolation, collaboration and feedback,
are difficult to accomplish by only human beings or com- depending on the existence of interactions between individu-
puters. The main problems include: (1) Model and emer- als and the feedback mechanism in the aggregation process.
gence mechanism of human-computer hybrid CI; (2) Human- Second, from the perspective of statistical literature analysis,
computer perception and understanding under uncertain and we explain the differences among these three paradigms and
open conditions; (3) Task modeling and dynamic allocation; their development in recent years. Third, we analyze the CI
(4) Measurement of human-computer hybrid CI. studies of each paradigm and explain the generation mech-
Synergetics Model of Networked Collective Intelli- anisms and theoretical bases of the different types. Fourth,
gence. The Internet breaks the barrier of physical space-time we describe certain CI-related applications in 2019, utiliz-
on human group collaboration and helps to form more com- ing different techniques to address some frontier challenges
plex networked CI (NCI) in larger scales of time and space. in different areas. Finally, we summarize the prospective
This group has the attributes of large individual differences research directions of CI.
(interest, ability, and diversity of objectives), high dynamic We hope that this survey provides a guideline for beginners
(the joining and exiting of members are not constrained), and in CI, helps them classify existing studies and sheds light on
cross-temporal distribution, which greatly improves the com- future research.
plexity of modeling. The main problems to be solved include:
(1) The evolutionary model of NCI; (2) The identification REFERENCES
of order parameters and the construction of order parameter [1] P. Michelucci and J. L. Dickinson, ‘‘The power of crowds,’’ Science,
vol. 351, no. 6268, pp. 32–33, 2016.
equations; (3) The application in Computer-Supported Coop- [2] T. Yan, M. Marzilli, R. Holmes, D. Ganesan, and M. Corner, ‘‘mCrowd:
erative Work (CSCW) [86], [87] and Computer-Supported A platform for mobile crowdsourcing,’’ in Proc. 7th ACM Conf. Embedded
Cooperative Learning (CSCL) [88], [89]. Netw. Sensor Syst., 2009, pp. 347–348.
[3] C. Yu, Y. Chai, and Y. Liu, ‘‘Literature review on collective intelligence:
Discrimination and Elimination of Dysergy Phe- A crowd science perspective,’’ Int. J. Crowd Sci., vol. 2, no. 1, pp. 64–73,
nomenon. CI is still unable to emerge in most cases in human 2018.
society even with the characteristics of interactions and feed- [4] J. Salminen, ‘‘Collective intelligence in humans: A literature review,’’
2012, arXiv:1204.3401. [Online]. Available: https://arxiv.org/abs/
back. From the perspective of synergetics, the main reason 1204.3401
is the existence of the dysergy phenomenon in groups [90]. [5] J. Krause, G. D. Ruxton, and S. Krause, ‘‘Swarm intelligence in animals
Dysergy refers to the interactions between individuals that and humans,’’ Trends Ecol. Evol., vol. 25, no. 1, pp. 28–34, 2010.
[6] J. Lorenz, H. Rauhut, F. Schweitzer, and D. Helbing, ‘‘How social influ-
promote one function but impede another. In order to ensure ence can undermine the wisdom of crowd effect,’’ Proc. Nat. Acad. Sci.
the emergence and sustainability of CI, it is necessary to work USA, vol. 108, no. 22, pp. 9020–9025, 2011.

VOLUME 7, 2019 170223


F. He et al.: Collective Intelligence: A Taxonomy and Survey

[7] D. Berend and J. Paroush, ‘‘When is condorcet’s jury theorem valid?’’ [35] D. Hristovski, B. Peterlin, J. A. Mitchell, and S. M. Humphrey, ‘‘Using
Social Choice Welfare, vol. 15, no. 4, pp. 481–488, 1998. literature-based discovery to identify disease candidate genes,’’ Int. J. Med.
[8] M. Nielsen, Reinventing Discovery: The New Era of Networked Science. Informat., vol. 74, nos. 2–4, pp. 289–298, 2005.
Princeton, NJ, USA: Princeton Univ. Press, 2011. [36] M. Weeber, ‘‘Drug discovery as an example of literature-based discovery,’’
[9] (2017). Synergetics (Haken). Accessed: Oct. 16, 2019. [Online]. Available: in Computational Discovery of Scientific Knowledge. Berlin, Germany:
https://en.wikipedia.org/w/index.php?title=Synergetics_(Haken)&oldid= Springer, 2007, pp. 290–306.
805750827 [37] M. Weeber, H. Klein, A. R. Aronson, J. G. Mork, L. de Jong-van Den Berg,
[10] H. Haken, ‘‘Synergetics—An outline,’’ in Synergetic Computers and Cog- and R. Vos, ‘‘Text-based discovery in biomedicine: The architecture of the
nition. Berlin, Germany: Springer, 2004, pp. 20–35. dad-system,’’ in Proc. AMIA Symp., 2000, p. 903.
[11] W. Cui, Y. Wu, S. Liu, F. Wei, M. X. Zhou, and H. Qu, ‘‘Context preserving [38] (2019). Truncated Mean. Accessed: Oct. 16, 2019. [Online]. Available:
dynamic word cloud visualization,’’ in Proc. IEEE Pacific Vis. Symp. https://en.wikipedia.org/w/index.php?title=Truncated_mean&oldid=8882
(PacificVis), Mar. 2010, pp. 121–128. 10015
[12] F. Heimerl, S. Lohmann, S. Lange, and T. Ertl, ‘‘Word cloud explorer: Text [39] (2019). Winsorized Mean. Accessed: Oct. 16, 2019. [Online]. Available:
analytics based on word clouds,’’ in Proc. 47th Hawaii Int. Conf. Syst. Sci., https://en.wikipedia.org/w/index.php?title=Winsorized_mean&oldid=88
2014, pp. 1833–1842. 1012445
[13] F. Galton, ‘‘Vox populi,’’ Nature, vol. 75, no. 7, pp. 450–451, 1907. [40] (2005). The Arrowsmith Project Homepage. Accessed: Oct. 16, 2019.
[14] S. Krause, R. James, J. J. Faria, G. D. Ruxton, and J. Krause, ‘‘Swarm intel- [Online]. Available: http://arrowsmith.psych.uic.edu
ligence in humans: Diversity can trump ability,’’ Animal Behav., vol. 81, [41] (2006). Bitola—Biomedical Discovery Support System. Accessed:
no. 5, pp. 941–948, 2011. Oct. 16, 2019. [Online]. Available: http://ibmi.mf.uni-lj.si/sl/bitola
[15] C. E. Agnew, ‘‘Bayesian consensus forecasts of macroeconomic vari- [42] W. Pratt and M. Yetisgen-Yildiz, ‘‘LitLinker: Capturing connections across
ables,’’ J. Forecasting, vol. 4, no. 4, pp. 363–376, 1985. the biomedical literature,’’ in Proc. 2nd Int. Conf. Knowl. Capture, 2003,
[16] S. K. McNees, ‘‘The uses and abuses of ‘consensus’ forecasts,’’ J. Fore- pp. 105–112.
casting, vol. 11, no. 8, pp. 703–710, 1992. [43] B. Gürçay, B. A. Mellers, and J. Baron, ‘‘The power of social influence on
[17] V. Zarnowitz, ‘‘The accuracy of individual and group forecasts from busi- estimation accuracy,’’ J. Behav. Decis. Making, vol. 28, no. 3, pp. 250–261,
ness outlook surveys,’’ J. Forecasting, vol. 3, no. 1, pp. 11–26, 1984. 2014.
[18] J. H. Stock and M. W. Watson, ‘‘Combination forecasts of output growth [44] E. Bernstein, J. Shore, and D. Lazer, ‘‘How intermittent breaks in interac-
in a seven-country data set,’’ J. Forecasting, vol. 23, no. 6, pp. 405–430, tion improve collective intelligence,’’ Proc. Nat. Acad. Sci. USA, vol. 115,
Sep. 2004. no. 35, pp. 8734–8739, 2018.
[19] V. Genre, G. Kenny, A. Meyler, and A. Timmermann, ‘‘Combining expert [45] N. Klein and N. Epley, ‘‘Group discussion improves lie detection,’’ Proc.
forecasts: Can anything beat the simple average?’’ Int. J. Forecasting, Nat. Acad. Sci. USA, vol. 112, no. 24, pp. 7460–7465, 2015.
vol. 29, no. 1, pp. 108–121, 2013. [46] L. Muchnik, S. Aral, and S. J. Taylor, ‘‘Social influence bias: A randomized
[20] E. Vul and H. Pashler, ‘‘Measuring the crowd within probabilistic repre- experiment,’’ Science, vol. 341, no. 6146, pp. 647–651, 2013.
sentations within individuals,’’ Psychol. Sci., vol. 19, no. 7, pp. 645–647, [47] J. Becker, D. Brackbill, and D. Centola, ‘‘Network dynamics of social
2008. influence in the wisdom of crowds,’’ Proc. Nat. Acad. Sci. USA, vol. 114,
[21] D. van Dolder and M. J. van den Assem, ‘‘The wisdom of the inner crowd no. 26, pp. E5070–E5076, 2017.
in three large natural experiments,’’ Nature Hum. Behav., vol. 2, no. 1, [48] L. Hong and S. E. Page, ‘‘Groups of diverse problem solvers can outper-
pp. 21–26, 2018. form groups of high-ability problem solvers,’’ Proc. Nat. Acad. Sci. USA,
[22] S. M. Herzog and R. Hertwig, ‘‘The wisdom of many in one mind: vol. 101, no. 46, pp. 16385–16389, 2004.
Improving individual judgments with dialectical bootstrapping,’’ Psychol. [49] J. T. Polzer, L. P. Milton, and W. B. Swarm, Jr., ‘‘Capitalizing on diversity:
Sci., vol. 20, no. 2, pp. 231–237, 2009. Interpersonal congruence in small work groups,’’ Administ. Sci. Quart.,
[23] V. R. R. Jose and R. L. Winkler, ‘‘Simple robust averages of forecasts: vol. 47, no. 2, pp. 296–324, 2002.
Some empirical results,’’ Int. J. Forecasting, vol. 24, no. 1, pp. 163–169, [50] I. Aggarwal, A. W. Woolley, C. F. Chabris, and T. W. Malone, ‘‘Cognitive
2008. diversity, collective intelligence, and learning in teams,’’ in Proc. Collec-
[24] D. V. Budescu and E. Chen, ‘‘Identifying expertise to extract the wisdom tive Intell. Conf., 2015, vol. 1, nos. 3.1, p. 3.
of crowds,’’ Manage. Sci., vol. 61, no. 2, pp. 267–280, 2015. [51] D. Knight, C. L. Pearce, K. G. Smith, J. D. Olian, H. P. Sims, K. A. Smith,
[25] R. H. J. M. Kurvers, S. M. Herzog, R. Hertwig, J. Krause, P. A. Carney, and P. Flood, ‘‘Top management team diversity, group process, and strate-
A. Bogart, G. Argenziano, I. Zalaudek, and M. Wolf, ‘‘Boosting medical gic consensus,’’ Strategic Manage. J., vol. 20, no. 5, pp. 445–465, 1999.
diagnostics by pooling independent judgments,’’ Proc. Nat. Acad. Sci. [52] I. Aggarwal and A. W. Woolley, ‘‘Do you see what I see? The effect of
USA, vol. 113, no. 31, pp. 8777–8782, 2016. members’ cognitive styles on team processes and errors in task execution,’’
[26] R. Hastie and T. Kameda, ‘‘The robust beauty of majority rules in group Org. Behav. Hum. Decis. Process., vol. 122, no. 1, pp. 92–99, 2013.
decisions,’’ Psychol. Rev., vol. 112, no. 2, p. 494, 2005. [53] P. Mavrodiev, C. J. Tessone, and F. Schweitzer, ‘‘Effects of social influence
[27] R. Snow, B. O’Connor, D. Jurafsky, and A. Y. Ng, ‘‘Cheap and fast— on the wisdom of crowds,’’ 2012, arXiv:1204.3463. [Online]. Available:
But is it good?: Evaluating non-expert annotations for natural language https://arxiv.org/abs/1204.3463
tasks,’’ in Proc. Conf. Empirical Methods Natural Lang. Process., 2008, [54] A. W. Woolley, C. F. Chabris, A. Pentland, N. Hashmi, and T. W. Malone,
pp. 254–263. ‘‘Evidence for a collective intelligence factor in the performance of human
[28] V. S. Sheng, F. Provost, and P. G. Ipeirotis, ‘‘Get another label? Improving groups,’’ Science, vol. 330, no. 6004, pp. 686–688, 2010.
data quality and data mining using multiple, noisy labelers,’’ in Proc. [55] T. Gowers and M. Nielsen, ‘‘Massively collaborative mathematics,’’
14th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining, 2008, Nature, vol. 461, no. 7266, p. 879, 2009.
pp. 614–622. [56] J. Cranshaw and A. Kittur, ‘‘The polymath project: Lessons from a suc-
[29] T. Tian and J. Zhu, ‘‘Max-margin majority voting for learning from cessful online collaboration in mathematics,’’ in Proc. SIGCHI Conf. Hum.
crowds,’’ in Proc. Adv. Neural Inf. Process. Syst., 2015, pp. 1621–1629. Factors Comput. Syst., 2011, pp. 1865–1874.
[30] D. R. Swanson and N. R. Smalheiser, ‘‘An interactive system for find- [57] R. P. Mann and D. Helbing, ‘‘Optimal incentives for collective intelli-
ing complementary literatures: A stimulus to scientific discovery,’’ Artif. gence,’’ Proc. Nat. Acad. Sci. USA, vol. 114, no. 20, pp. 5077–5082, 2017.
Intell., vol. 91, no. 2, pp. 183–203, 1997. [58] D. Geifman and D. R. Raban, ‘‘Collective problem-solving: The role of
[31] D. R. Swanson, ‘‘Fish oil, raynaud’s syndrome, and undiscovered public self-efficacy, skill, and prior knowledge,’’ Interdiscipl. J. E-Learn. Learn.
knowledge,’’ Perspect. Biol. Med., vol. 30, no. 1, pp. 7–18, 1986. Objects, vol. 11, no. 1, pp. 159–178, 2015.
[32] D. R. Swanson, ‘‘Migraine and magnesium: Eleven neglected connec- [59] L. B. Rosenberg, ‘‘Human swarming, a real-time method for parallel
tions,’’ Perspect. Biol. Med., vol. 31, no. 4, pp. 526–557, 1988. distributed intelligence,’’ in Proc. Swarm/Hum. Blended Intell. Workshop
[33] N. R. Smalheiser, ‘‘The Arrowsmith project: 2005 status report,’’ in Proc. (SHBI), Sep. 2015, pp. 1–7.
Int. Conf. Discovery Sci. Berlin, Germany: Springer, 2005, pp. 26–43. [60] M. Dorigo and T. Stützle, Ant Colony Optimization. Cambridge, MA, USA:
[34] D. Hristovski, C. Friedman, T. C. Rindflesch, and B. Peterlin, ‘‘Exploiting MIT Press, 2004.
semantic relations for literature-based discovery,’’ in Proc. AMIA Annu. [61] J. Kennedy and R. C. Eberhart, ‘‘Particle swarm optimization,’’ in Proc.
Symp., 2006, p. 349. Int. Conf. Netw., 1995, pp. 1942–1948.

170224 VOLUME 7, 2019


F. He et al.: Collective Intelligence: A Taxonomy and Survey

[62] D. Karaboga and B. Basturk, ‘‘A powerful and efficient algorithm for [87] (2019). Computer-Supported Cooperative Work. Accessed:
numerical function optimization: Artificial bee colony (ABC) algorithm,’’ Oct. 16, 2019. [Online]. Available: https://en.wikipedia.org/w/index.php?
J. Global Optim., vol. 39, no. 3, pp. 459–471, Apr. 2007. title=Computer-supported_cooperative_work&oldid=905294828
[63] S. Mirjalili, S. M. Mirjalili, and A. Lewis, ‘‘Grey wolf optimizer,’’ Adv. [88] G. Stahl, ‘‘A view of computer-supported collaborative learning research
Eng. Softw., vol. 69, pp. 46–61, Mar. 2014. today,’’ in Proc. Int. Conf. Collaboration Technol. Syst. (CTS), 2011,
[64] X.-B. Meng, X. Z. Gao, L. Lu, Y. Liu, and H. Zhang, ‘‘A new bio-inspired p. 325.
optimisation algorithm: Bird swarm algorithm,’’ J. Exp. Theor. Artif. Intell., [89] (2019). Computer-Supported Collaborative Learning. Accessed:
vol. 28, pp. 673–687, Jun. 2015. Oct. 16, 2019. [Online]. Available: https://en.wikipedia.org/w/index.
[65] A. Shklarsh, G. Ariel, E. Schneidman, and E. Ben-Jacob, ‘‘Smart swarms php?title=Computer-supported_collaborative_learning&oldid=921034480
of bacteria-inspired agents with performance adaptable interactions,’’ [90] N. A. Coulter, Human Synergistics (The Revised Internet Edition). Bergen
PLoS Comput. Biol., vol. 7, no. 9, p. e1002177, 2011. County, NJ, USA: The Time-Binders, 2002.
[66] L. Liu, Y. Song, H. Zhang, H. Ma, and A. V. Vasilakos, ‘‘Physarum [91] M. Kosinski, Y. Bachrach, G. Kasneci, J. Van-Gael, and T. Graepel,
optimization: A biology-inspired algorithm for the steiner tree problem in ‘‘Crowd IQ: Measuring the intelligence of crowdsourcing platforms,’’ in
networks,’’ IEEE Trans. Comput., vol. 64, no. 3, pp. 818–831, Mar. 2015. Proc. 4th Annu. ACM Web Sci. Conf., 2012, pp. 151—160.
[67] V. I. Yukalov, E. P. Yukalova, and D. Sornette, ‘‘Utility rate equations of
group population dynamics in biological and social systems,’’ PLoS ONE,
vol. 8, no. 12, p. e83225, 2013. FEIJUAN HE received the B.S. degree from Xi’an
[68] Y. Tao, ‘‘Swarm intelligence in humans: A perspective of emergent evolu- Jiaotong University, in 2010, and the M.A. degree
tion,’’ Phys. A, Stat. Mech. Appl., vol. 502, pp. 436–446, Jul. 2018. from The Hong Kong Polytechnic University,
[69] (2019). Autler-Townes. Accessed: Oct. 16, 2019. [Online]. Available: in 2012, all in computer science. She is currently
https://en.wikipedia.org/w/index.php?title=Autler%E2%80%93Townes_ an Associate Professor with the Department of
effect&oldid=920444858 Computer Science, Xi’an Jiaotong University City
[70] (2019). Invisible Hand. Accessed: Oct. 16, 2019. [Online]. Available: College. She has published more than 20 research
https://en.wikipedia.org/w/index.php?title=Invisible_hand&oldid=91469 papers in various journals. Her main research inter-
2334 ests include text mining, collective intelligence,
[71] (2019). Spontaneous Order. Accessed: Oct. 16, 2019. [Online]. Available: and e-learning.
https://en.wikipedia.org/w/index.php?title=Spontaneous_order&oldid=92
1202468
[72] H. K. Dambanemuya and E.-Á. Horvát, ‘‘Harnessing collective intelli- YUDAI PAN received the B.E. degree in com-
gence in P2P lending,’’ in Proc. 10th ACM Conf. Web Sci., 2019, pp. 57–64. puter science and technology from Xi’an Jiaotong
[73] H.-Y. Lee, D.-H. Kim, and K.-R. Park, ‘‘Pest diagnosis system based University, Shaanxi, China in 2015, and the M.S.
on deep learning using collective intelligence,’’ Int. J. Elect. Eng. Edu., degree in computer science from Syracuse Univer-
pp. 1–15, 2019. sity, Syracuse, NY, USA. She is currently pursuing
[74] S. Otoum, B. Kantarci, and H. T. Mouftah, ‘‘On the feasibility of deep the Ph.D. degree with the Faculty of Electronic
learning in sensor network intrusion detection,’’ IEEE Netw. Lett., vol. 1, and Information Engineering, Xi’an Jiaotong Uni-
no. 2, pp. 68–71, Jun. 2019. versity, Shaanxi. Her research interests include
[75] M. Aloqaily, I. Al Ridhawi, H. B. Salameh, and Y. Jararweh, ‘‘Data machine learning and data mining.
and service management in densely crowded environments: Challenges,
opportunities, and recent developments,’’ IEEE Commun. Mag., vol. 57,
no. 4, pp. 81–87, Apr. 2019.
[76] Y. Kotb, I. Al Ridhawi, M. Aloqaily, T. Baker, Y. Jararweh, and H. Tawfik, QIKA LIN received the B.E. and M.E. degrees
‘‘Cloud-based multi-agent cooperation for IoT devices using workflow- from the Beijing Institute of Technology, Beijing,
nets,’’ J. Grid Comput., pp. 1–26, Jun. 2019. China, in 2016 and 2019, respectively. He is cur-
[77] R. A. Yeh, A. G. Schwing, J. Huang, and K. Murphy, ‘‘Diverse generation rently pursuing the Ph.D. degree with the Faculty
for multi-agent sports games,’’ in Proc. IEEE Conf. Comput. Vis. Pattern of Electronic and Information Engineering, Xi’an
Recognit., Jun. 2019, pp. 4610–4619. Jiaotong University, Shaanxi, China. His research
[78] A. Belbachir, A. El Fallah-Seghrouchni, A. Casals, and M. Pasin, ‘‘Smart interests include deep learning and e-learning.
mobility using multi-agent system,’’ Procedia Comput. Sci., vol. 151,
pp. 447–454, Apr. 2019.
[79] G. K. Awal and K. Bharadwaj, ‘‘Leveraging collective intelligence for
behavioral prediction in signed social networks through evolutionary
approach,’’ Inf. Syst. Frontiers, vol. 21, no. 2, pp. 417–439, 2019. XIANGLIN MIAO received the B.S. degree from
[80] S. Otoum, B. Kantarci, and H. Mouftah, ‘‘Empowering reinforcement Xi’an Jiaotong University, in 1976. is currently
learning on big sensed data for intrusion detection,’’ in Proc. IEEE Int. a Professor with the Department of Computer
Conf. Commun. (ICC), May 2019, pp. 1–7.
Science, Xi’an Jiaotong University City College.
[81] F. Petrillo, Y.-G. Guéhéneuc, M. Pimenta, C. D. S. Freitas, and F. Khomh,
He has published more than 40 research papers
‘‘Swarm debugging: The collective intelligence on interactive debugging,’’
J. Syst. Softw., vol. 153, pp. 152–174, Jul. 2019. in various journals and conferences. His main
[82] H. R. Boveiri, R. Khayami, M. Elhoseny, and M. Gunasekaran, ‘‘An effi- research interests include data mining and infor-
cient swarm-intelligence approach for task scheduling in cloud-based mation management.
Internet of Things applications,’’ J. Ambient Intell. Humanized Comput.,
vol. 10, no. 9, pp. 3469–3479, 2019.
[83] G. Sugihara, R. May, H. Ye, C.-H. Hsieh, E. Deyle, M. Fogarty,
ZHOUGUO CHEN received the master’s degree
and S. Munch, ‘‘Detecting causality in complex ecosystems,’’ Science,
vol. 338, no. 6106, pp. 496–500, Oct. 2012. in signal and information processing from the
[84] L. Von Ahn, ‘‘Human computation,’’ in Proc. IEEE 24th Int. Conf. Data University of Electronic Science and Technol-
Eng., Apr. 2008, pp. 1–2. ogy of China (UESTC), in 2006. He is currently
[85] N.-N. Zheng, Z.-Y. Liu, P.-J. Ren, Y.-Q. Ma, S.-T. Chen, S.-Y. Yu, a Network Security Senior Engineer with The
J.-R. Xue, B.-D. Chen, and F.-Y. Wang, ‘‘Hybrid-augmented intelligence: 30th Research Institute of China Electronics Tech-
Collaboration and cognition,’’ Frontiers Inf. Technol. Electron. Eng., nology Group Corporation, Chengdu, China. His
vol. 18, no. 2, pp. 153–179, Feb. 2017. research interests include network security, big
[86] J. Grudin, ‘‘Computer-supported cooperative work: History and focus,’’ data, and network forensics.
Computer, vol. 27, no. 5, pp. 19–26, May 1994.

VOLUME 7, 2019 170225

You might also like