Species Distribution Models Predict Rare Species Occurrences Despite Significant Effects of Landscape Context

Journal of Applied Ecology 2016, 53, 1871–1879 doi: 10.1111/1365-2664.
12702
Species distribution models predict rare species

occurrences despite significant effects of landscape
context
J. L. McCune*
Department of Integrative Biology, University of Guelph, Science Complex, Guelph, ON N1G 2W1, Canada
Summary
1. The true status of many endangered plants is uncertain because the locations of all extant
populations are not known. Species distribution models (SDMs) can direct searches for addi-
tional populations, but habitat fragmentation may influence the distribution of rare species
more than climatic or edaphic factors in human-dominated landscapes. In this study, I test
the ability of SDMs to predict rare plant occurrences in a fragmented landscape and the
importance of predicted habitat suitability versus landscape context.
2. I built SDMs for eight rare woodland plants and assessed them using an independent data
set including plant community surveys of 51 sites. I used community data to determine
whether SDMs predict the right habitat type even when the target rare species was absent. I
then modelled rare species presence based on predicted habitat suitability, distance to the
nearest known population and the amount of forest habitat within 500 m of the site.
3. SDMs were effective for seven of the eight species, with the degree of predicted habitat
suitability positively related to species’ occurrence. I found new populations of four of the
eight species. However, the amount of forest habitat available in the vicinity of a plot was
also a positive predictor of rare plant occurrences. Among sites predicted to be suitable, the
distance to the nearest known population was the strongest predictor of rare plant occur-
rence.
4. Synthesis and applications. Species distribution models (SDMs) can effectively target
searches for populations of rare species even in human-dominated landscapes. Surveying the
plant community at sites predicted to be suitable can help to improve the SDM. SDMs used
in conjunction with data on landscape context can maximize the efficiency of searches for rare
species and show which species are restricted by dispersal limitation and habitat
fragmentation in addition to edaphic and climatic factors.
Key-words: associated species, dispersal limitation, disturbance, forest, habitat fragmenta-
tion, human-dominated landscapes, MAXENT, rare plants, species distribution model,
woodland
populations are likely present on privately owned land

Introduction
that has not been surveyed (Lovett-Doust et al. 2003).
An important question for maintaining biodiversity in The greatest threats to Canada’s endangered plant species
human-dominated landscapes is whether rare species can are residential and commercial development and recre-
cope with disturbance and fragmentation. But for many ational activities (McCune et al. 2013), and thus, undis-
rare species we lack the basic data on population size, covered populations of rare plants are in danger of
locations and changes over time that we need to answer inadvertent destruction by humans.
it. Canada’s rare plant species are a prime example of One way to improve our knowledge of the status of
this. Data on the current and historical status of many of rare species is to use species distribution models (SDMs)
these species are sparse or absent. Undocumented to prioritize areas for field surveys. SDMs predict a spe-
cies’ distribution across space based on georeferenced
*Correspondence author. E-mail: jmccune@uoguelph.ca occurrence records and environmental predictors (Guisan
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society
1872 J. L. McCune
& Zimmermann 2000). Hundreds of papers involving

SDMs are published every year (Franklin 2013), and the
distributions of thousands of individual species have been
modelled (e.g. Kharouba et al. 2013). SDMs are used for
a broad range of applications, including forecasting range
shifts with climate change and invasions by introduced
species (Franklin 2013). SDMs also have important
applications to conservation, including reserve selection
(Guisan et al. 2013).
Several studies have used SDMs to predict habitat for
rare plant species (e.g. Engler, Guisan & Rechsteiner
2004; Bourg, McShea & Gill 2005; Guisan et al. 2006;
Williams et al. 2009; Le Lay et al. 2010). However, rare
species may be absent from suitable sites because of dis-
Fig. 1. The location of the study area in southern Ontario (inset,
persal limitation, competition and historical factors shaded), and a close-up of the study area (shaded) showing the
(Boetsch, Van Manen & Clark 2003; Gogol-Prokurat Precambrian shield (crosshatched), the Niagara escarpment (thick
2011). In human-dominated landscapes where habitat is black line), three major cities (black stars) and the locations of 51
fragmented, the history of land use within and surround- surveyed 1-ha grid squares (white dots).
ing a habitat patch and its isolation from other patches
may determine occurrences to an equal or greater degree ranges from 7 to 40% of pre-settlement cover, depending on the
than edaphic and climatic conditions (Rogers et al. 2009). county (Larson et al. 1999). The main land use is agriculture,
except for where the Precambrian shield extends southward, and
Several authors have pointed out the need to incorporate
very shallow soils over the bedrock prevent intensive agriculture
landscape connectivity and/or dispersal into SDMs (e.g.
(Crins et al. 2009; Fig. 1). The total extent of the study area is
Franklin 2010; Folt^ete et al. 2012). Yet there have been
approximately 11 million hectares, with elevation ranging from
no studies that test the importance of climatic and 20 to 575 m above sea level. The topography is gentle, except for
edaphic factors versus habitat fragmentation and dispersal the Niagara escarpment, a thin line of limestone cliffs (Crins
limitation for the distribution of rare plants by building et al. 2009; Fig. 1).
and testing SDMs in conjunction with data on the state There are over 700 plant species ranked S1 (extremely rare, 1–5
of the landscape. known populations), S2 (very rare, 6–20 known populations) or
In this study, I used SDMs to predict suitable habitat S3 (rare to uncommon, 21–80 known populations) (Oldham &
for rare woodland plant species in southern Ontario, Brinker 2009). I chose eight of these species with different range
Canada. Southern Ontario represents <1% of Canada’s types, habitat specificity and rarity levels (Table 1). Many of
land area, but supports 35% of the human population southern Ontario’s rare plant species are at the northern edge of
their range here, with the bulk of their distribution extending
(Government of Ontario 2000). I chose eight rare plant
south into the United States. I selected two species which have
species that vary in their distribution, habitat specificity
the majority of their range in Ontario (Table 1). Distributions of
and prevalence. I had two objectives: (i) to assess the util- species are often more accurately predicted by SDMs at the range
ity of SDMs for predicting suitable habitat for rare plants edge, presumably because the species relies on particular environ-
with different range and habitat characteristics by using mental conditions that are patchy near the edge of the range
multiple measures, including plant community data, and (MacDougall & Loo 2002; Luoto et al. 2005).
(ii) to test the importance of predicted habitat suitability The eight species also differ in their habitat specificity. For
(based only on geology, soils, climate and topography) example, the fern Asplenium scolopendrium grows only on the
and landscape context for the probability of detecting rocky limestone substrates of the Niagara escarpment (Oldham &
each species. I hypothesized that range-edge species and Brinker 2009), whereas the orchid Cypripedium arietinum grows
those with more specialist habitat requirements would be in a wider range of habitats (Brzeskiewicz 2000; Blaney & Maze-
more accurately modelled by SDMs. If landscape context rolle 2007; Table 1). Habitat specialists are often more accurately
modelled than generalists because of their reliance on spatially
has a role in determining rare plant distributions, then I
restricted environmental conditions (Hernandez et al. 2006; Le
predicted that the presence of a species would be more
Lay et al. 2010). All eight species are easy to recognize in the
accurately modelled if the amount of forest on the sur- field, and primarily associated with woodland habitats. I com-
rounding landscape and/or the distance to the nearest piled a list of common associated species of each of the eight spe-
known population were taken into account. cies from the literature (Appendix S1 in Supporting Information).
Materials and methods BUILDING MODELS
I built SDMs for each species using MAXENT (Phillips, Anderson

THE STUDY REGION AND SPECIES
& Schapire 2006; Elith et al. 2011), which performs as well or
Southern Ontario was once dominated by mixed deciduous and better than other modelling methods (Hernandez et al. 2006; Wil-
coniferous forests (Crins et al. 2009; Fig. 1). Current forest cover liams et al. 2009). I gathered data on soils, topography, surficial
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
Predicting rare species occurrences 1873
Table 1. Range characteristics and habitat preferences of the eight plant species. Status designations are special concern (SC), threatened
(THR) and endangered (END). Species without a status category have not yet been assessed. See text for S rank definitions. The final
column indicates the beta diversity of surveyed plots predicted to be suitable for each species, as estimated by the average Bray–Curtis
distance to the centroid of that group of plots. The beta diversity of all 51 plots was 4615
Beta diversity of plots

% of range in predicted to be
Scientific name Status S rank Ontario Breadth of habitat suitable
Arisaema dracontium SC S3 <5 Medium (floodplain forests) 4263

Asplenium scolopendrium SC S3 >50 Narrow (limestone, Niagara escarpment) 3396
Chimaphila maculata END S1 <5 Narrow (dry sandy soil near Great Lakes) 3708
Corallorhiza odontorhiza S2 <5 Narrow (dry sandy soil near Great Lakes) 3188
Cornus florida END S2? <5 Wide (deciduous forest edges/understorey) 4395
Cypripedium arietinum S3 >50 Wide (swamps or shallow soil over limestone) 4527
Phegopteris hexagonoptera SC S3 <5 Medium (rich, moist, mature maple/beech forests) 4306
Vaccinium stamineum THR S1 <5 Narrow (recently burned areas near Great Lakes) 3311
geology and climatic factors (see Appendix S2), and converted of 2014, I surveyed 51 grid squares (Fig. 1), with the goal of
each variable to a 100 9 100 m grain size. I chose a set of 14 maximizing the variation in the range of habitat suitability pre-
predictors that were minimally intercorrelated. I obtained georef- dicted by the model, and distance from the nearest record of the
erenced observation records for all eight target species from the species. At each site, we used a global positioning system to
Ontario Ministry of Natural Resources and Forestry (OMNRF) locate the centre of the grid square. We thoroughly searched each
Natural Heritage Information Centre (NHIC). The records include square, with search times ranging from 3 to 6 person hours
herbarium records, opportunistic sightings and purposeful depending on the vegetation and terrain. We recorded all vascu-
searches by biologists, and are represented by polygons which indi- lar plant species present, and estimated the abundance of each
cate the spatial certainty of the observation. I eliminated all using a coarse 5-category scale.
records with an area of >100 m radius; 91% of the records were
from 1980 or later, with only one record from prior to 1958. High
spatial accuracy of records was the most important criterion for STATISTICAL ANALYSES
inclusion, and given the relatively low number of records available,

I did not remove spatially accurate pre-1980 records, considering Standard model assessments
them to be good indicators of the topographic, geological, edaphic
MAXENT provides the AUC (area under a receiver operating char-
and climatic conditions in which the species could survive.
acteristic curve; see Fielding & Bell 1997) for training and test
I used MAXENT version 3.3.3 to model the distribution of each
data. I also calculated AUC, sensitivity (the proportion of pres-
species (https://www.cs.princeton.edu/ schapire/maxent/; Phil-
ences predicted correctly by the model) and specificity (the pro-
lips, Anderson & Schapire 2006). I excluded duplicate observa-
portion of absences predicted correctly by the model) using an
tion records from the same 100 9 100 m grid square, which
independent data set that was not used to build the SDMs. This
resulted in a total number of observations ranging from 5 to 295,
data set was a combination of records from the 51 forest surveys
depending on the species (Table 2). Although SDM accuracy can
plus records held by the NHIC that had not yet been incorpo-
decline when based on fewer than 30 records (Wisz et al. 2008),
rated into the official data base. I used only records with ≤100 m
MAXENT has been shown to perform well even with sample sizes
positional accuracy, and eliminated any records in the same grid
as low as 5 (Hernandez et al. 2006; Pearson et al. 2007). For each
square as a record used to build the SDM. This data set also
species, I built seven models with slightly different MAXENT set-
included a few absence records resulting from unsuccessful tar-
tings, each time randomly excluding 25% of the observation
geted searches by NHIC staff for a particular species. These
records to use as test data, repeating this subsampling 10 times
absences are biased, because targeted searches are performed only
and taking the average of all 10 replicates (Appendix S2). I
in areas that seem to have suitable habitat according to expert
selected the best model based on minimal predicted area (MPA;
botanists. However, the bulk of the absences for each species
Engler, Guisan & Rechsteiner 2004). I used the cumulative model
resulted from my field surveys, which include sites with both suit-
output, which avoids assumptions about species prevalence (Phil-
able and unsuitable habitats; therefore, I consider the absences in
lips, Anderson & Schapire 2006; Merow, Smith & Silander 2013).
combination to be a relatively unbiased sample of the region. I
The cumulative value at each grid square ranges from 0 to 100%,
eliminated any absences that were in the same grid square as a
with 100% being the most suitable. I set the threshold for pre-
presence, which represent re-surveys of areas formerly known to
dicting suitable habitat to the minimum value resulting in the
support a population of the species. I consider these to be false
correct prediction of 90% of all records used to build the model
absences. I calculated AUC, sensitivity and specificity for the
(i.e. a 10% omission rate) and selected the model that predicted
independent data set using the ‘dismo’ package in R (R Founda-
the lowest percentage of the study area to be suitable.
tion for Statistical Computing, Vienna, Austria).
FIELD SAMPLING
Plant community and landscape analyses
I randomly selected 100 forested grid squares predicted to be suit-
able for each species, stratified by the distance to the nearest To characterize the plant community composition of the sur-
observation used to build the model. In the spring and summer veyed grid squares, I converted the coarse abundance scale used
1874 J. L. McCune
in the field surveys to an ordinal scale from 1 (very rare) to 5
Specificity
model. Model threshold is the threshold of habitat suitability (MAXENT’s cumulative output) above which a grid square is predicted to be suitable. See text for explanations of AUC, sensitivity
Table 2. Summary of species distribution model (SDM) characteristics and evaluation measures. MPA (minimal predicted area) is the percentage of the study area predicted suitable by the best
(dominant) and ordinated all squares using non-metric multidi-
(%)
mensional scaling (NMDS) based on Bray–Curtis similarity
62
86
82
92
62
76
49
94
(McCune & Grace 2002), excluding the target species. If the
SDMs were useful, I expected that species with narrow habitat
preferences would have lower beta diversity among plots pre-
Sensitivity dicted suitable compared to species with wide habitat tolerances.
(%) To test this, I calculated the multivariate dispersion (average
NA
74
60
78
14
68
91
22
community similarity) for the 51 sites as a whole, for the group
of sites predicted to be suitable for each target species where the
species was absent, and for the group of sites predicted to be
07594
08908
08707
08739
07700
08908
04172
NA
suitable for each target species in which the species was found
AUC
(Anderson, Ellingsen & McArdle 2006). I tested for significant

differences in community dispersion using a distance-based test
for homogeneity of multivariate dispersions (PERMDISP;
Anderson, Ellingsen & McArdle 2006) in the program PRIMER
absences)
absences)
absences)
absences)
absences)
absences)
absences)
absences)
No. independent
(Clarke & Gorley 2015). I also compared the beta diversity

among suitable sites with and without the target species present
using the same method.
(55
(49
(49
(51
(53
(41
(53
(51
records
I compared the relative abundance of associated species in

69
58
58
64
62
51
142
131
squares that were predicted suitable versus unsuitable using Wil-

coxon rank-sum tests. Then, I used generalized linear models
Independent data
(glm) with a binomial link to model suitable versus unsuitable

field searches
sites based on relative abundance of associates (see

ratio from
Appendix S3).
Success
I also calculated the relative abundance of disturbance-asso-

0/3
0/3
2/17
3/10
2/11
5/20
0/10
0/24
ciated species in each surveyed square. I designated a species as

disturbance-associated if its habitat description in the Michigan
Flora (Reznicek, Voss & Walters 2011) included one or more of
the terms ‘disturbed places’, ‘roadsides’, ‘railroads’, ‘openings and
threshold
trails’, ‘fencerows’, ‘powerline clearings’, ‘ditches and swales’ or

Model
2988
2609
5995
2795
2778
6383
705
212
‘meadows and pastures’. Disturbance-associated species tend to

colonize sites with less natural vegetation on the surrounding
landscape (e.g. Vallet et al. 2010; McCune & Vellend 2015). I
used ArcGIS to calculate the total area forested within 500 m of
MPA
081
047
008
027
122
360
212
005
each surveyed plot, and then used a Spearman rank correlation

(%)
test to determine whether there was a significant correlation

between forested area and the relative abundance of disturbance-
associated species.
09743
09861
09453
06737
09726
09454
09276
09742
AUC
(test)
Assessing the relative importance of distance and

landscape disturbance
09904
09926
09958
09924
09841
09662
09862
09988
(train)
AUC
Using the independent data set for each species, I built a glm of
species presence/absence based on the habitat suitability predicted
by the MAXENT model and the total area of forest within 500 m
Model building
of the square. I did not include distance to the nearest known

No. records
population in these models because this variable is highly corre-

lated with predicted habitat suitability. I used spline correlograms
144
295
73
15
77
38
5
to check for spatial autocorrelation in the model residuals. If I

detected spatial autocorrelation, I incorporated a spatial autoco-
variate in a revised model (Dormann 2007; Bardos, Guillera-
Arroita & Wintle 2015). For each species, I compared a model
Phegopteris hexagonoptera
Asplenium scolopendrium
Corallorhiza odontorhiza
with intercept only to models with habitat suitability only, forest

Cypripedium arietinum
Vaccinium stamineum
Arisaema dracontium
area only, both habitat suitability and forest area, and both pre-
Chimaphila maculata
dictors as well as an interaction term. I considered a variable to

be significantly related to species presence/absence when the dif-
and specificity
Cornus florida
ference in AIC between a model with intercept only and a model

including the factor was >5, which corresponds to strength of evi-
Species
dence at least 12 times stronger than the alternative model (Burn-

ham, Anderson & Huyvaert 2011).
To examine the importance of distance from the nearest known
2
(a) Asplenium scolopendrium
population, I examined the independent data for grid squares
predicted to be suitable only. In this subset of the data, distance
and habitat suitability are not highly correlated (r ranges from
1
021 to 051). I built logistic models relating species presence/ab-
sence to combinations of habitat suitability, distance and forest
area, plus a spatial autocovariate when necessary.
NMDS2
Logistic models were built in R, using package ‘NCF’ for spline
0
correlograms and ‘SPDEP’ for preparing spatial autocovariates.
−1
Results
STANDARD MODEL TESTS
−2
AUC values >090 generally indicate a very useful model
(Swets 1988). All species except C. odontorhiza had high −2 −1 0 1 2
AUC values based on the subset of records withheld from NMDS1
the training data (Table 2). The proportion of the study
area predicted suitable for each species (MPA) ranged
2
(b) Cornus florida
from 005% to 36%, and generally corresponded to the
prevalence and distribution of the species. For example,
C. maculata (MPA = 008%) had only 15 presence
1
records mainly located in two very small regions within

the study area, whereas C. arietinum (MPA = 36%) has
NMDS2
77 presence records spread across a wide area.

0
Appendix S4 shows a map for each species.

Field surveys led to the discovery of new populations
of four species, with the highest success for A. scolopen-
−1
drium, which was present at three of the ten survey sites

predicted suitable (Table 2). I did not find any target spe- Not suitable, absent
cies in a square that was not predicted suitable. I was able Suitable, absent
to search only four and three squares, respectively, for Suitable, present
−2
C. odontorhiza and V. stamineum, and did not find these

species at any sites. The NHIC held no additional occur- −2 −1 0 1 2
rence records for V. stamineum, and therefore, I was NMDS1
unable to carry out an independent test of the model for
Fig. 2. Non-metric multidimensional scaling (NMDS) ordination
this species beyond the three grid squares I surveyed. of all 51 surveyed grid squares in plant species space based on
For all species, AUC based on the independent data set the Bray–Curtis dissimilarity measure. Stress in three dimensions
was lower than AUC for the data used to build the SDM is 13. (a) Grid squares predicted suitable and unsuitable, with
(Table 2), which is a common finding (e.g. Elith & Burgman and without the species present, for habitat specialist Asplenium
2002). However, all species except for P. hexagonoptera had scolopendrium. (b) Grid squares predicted suitable and unsuitable,
with and without the species present, for habitat generalist
independent AUC values above 07, which is the threshold Cornus florida.
for a ‘useful’ model (Swets 1988). Sensitivity was good, with
a majority of presences being predicted suitable by the associates to be significantly related to the predicted habi-
model, except for C. odontorhiza (14%) and tat suitability for only two species (Appendix S3).
P. hexagonoptera (22%) (Table 2). The majority of inde- The beta diversity of grid squares predicted to be suit-
pendent absences were correctly predicted by the models for able for each species generally matched habitat breadth,
all species except for P. hexagonoptera (Table 2). whereby species with a narrow habitat range had lower
beta diversity among sites predicted to be suitable than did
habitat generalists (Table 1; Fig. 2). The sites where a tar-
PLANT COMMUNITY DATA
get species was found tended to be clustered together in
The relative abundance of associated species was signifi- plant community space, with a lower level of beta diversity
cantly higher in grid squares predicted to be suitable by than all suitable plots as a whole (Fig. 2). This was true
the SDM for five of the eight species, although there was regardless of the habitat breadth of the species. Compar-
a lot of variation for both suitable and unsuitable sites isons between the beta diversity among suitable sites with
(Appendix S3). Logistic models accounting for spatial and without each target species were not always statisti-
autocorrelation showed the relative abundance of cally significant, but in these cases there were few sites with
1876 J. L. McCune
Arisaema dracontium Asplenium scolopendrium Chimaphila maculata Corallorhiza odontorhiza

80 100
60
40
20
Habitat suitability (%)
0
abs pres abs pres abs pres abs pres

Cornus florida Cypripedium arietinum Phegopteris hexagonoptera
80 100
<40-ha forest within 500 m

>40-ha forest within 500 m
60
40
20
0
abs pres abs pres abs pres
Fig. 3. Habitat suitability (based on MAXENT’s cumulative output) versus presence (pres) or absence (abs) of each species in the indepen-
dent data set. The data is indicated in grey, with open dots for sites with <40-ha total forest cover within 500 m and closed dots for sites
with >40-ha total forest cover within 500 m. Black dots indicate the mean habitat suitability for low forest (open dots, dashed lines) and
high forest (closed dots, solid lines). Grey bars indicate the overall mean for both forest categories.
the species present, providing insufficient power to detect a species except for C. odontorhiza and C. arietinum (Fig. 4;
significant difference by permutation tests (Appendix S5). Appendix S6). There was a strong positive influence of
total forest area within 500 m on the probability of
presence for three species: A. dracontium, C. maculata and
THE IMPORTANCE OF LANDSCAPE CONTEXT
P. hexagonoptera (Appendix S6).
The total area of forest within 500 m of a site was negatively
correlated with the relative abundance of disturbance-asso-
Discussion
ciated species (Spearman’s r = 048, P = 000043). For the
independent occurrence data, predicted habitat suitability SDMs based on presence-only records and data on soils,
was included in the best model of species presence/absence geology, topography and climate effectively predicted the
for all species except A. scolopendrium. The other species distributions of seven of these eight rare woodland plants.
showed a positive relationship between predicted habitat While independent test data yielded a low sensitivity
suitability and presence except for P. hexagonoptera, for (14%) for C. odontorhiza, the AUC value was quite high
which habitat suitability was a negative predictor of pres- (Table 2), suggesting that the choice of threshold for suit-
ence (Fig. 3, Appendix S6). The area of forest within 500 m ability was ineffective. Pearson et al. (2007) suggest lower-
was also an important predictor of presence for Chimaphila ing the habitat suitability threshold to 0% omission when
maculata and Corallorhiza odontorhiza – in fact neither spe- the number of records is very low, as in this case. When I
cies was ever present in sites with surrounding forest area did this, the sensitivity based on the independent data set
lower than 40 ha (Fig. 3). There was an interaction between improved to 86%, while the specificity was still good at
habitat suitability and forest area for three species: A. dra- 76%. I used this new threshold for the subsequent model
contium, C. florida and P. hexagonoptera. For the first two, tests for this species.
habitat suitability was a stronger predictor of species pres- Forest surveys based on these SDMs resulted in the dis-
ence in sites with high forest area within 500 m, whereas for covery of new populations of four of the eight species,
P. hexagonoptera, the negative relationship between habitat even though only 51 sites were searched. Although the
suitability and species presence became even stronger in success rate was relatively low – ranging from 12% to
sites with low forest area (Fig. 3). 30% of sites predicted to be suitable – this is a common
For sites above the threshold for habitat suitability, the finding for rare species (MacDougall & Loo 2002; Wil-
degree of habitat suitability was no longer a good predic- liams et al. 2009), and is an improvement over the 0%
tor of species presence for any species. The strongest pre- discovery rate at sites where the habitat was not predicted
dictor of species presence at suitable sites was the distance suitable. Sites predicted to be suitable tended to have a
from the nearest known occurrence of the species, with higher relative abundance of common associates of the
sites closer to known records having a much higher prob- target rare plant. However, there was a large amount of
ability of the species being present. This was true for all variation between sites. This is to be expected, especially
Arisaema dracontium Asplenium scolopendrium Chimaphila maculata Corallorhiza odontorhiza
60 65 70 75 80 85
90
80
70
70
70
50
50
Habitat suitability (%)
60
30
30
0 5 10 15 20 25 30 0 5 10 15 20 25 30 0 5 10 15 0 5 10 15 20 25 30
Cornus florida Cypripedium arietinum Phegopteris hexagonoptera

100
Species absent
90
90
Species present
80
70
70
60
50
50
40
30
20
30
0 5 10 15 0 10 20 30 40 0 5 10 15 20 25 30
Distance to nearest record (km)
Fig. 4. Habitat suitability (based on MAXENT’s cumulative output) versus distance to the nearest record used to build the SDM for the
independent data set, showing sites with habitat suitability above the suitability threshold only. Distance is negatively related to species
presence for all species except C. odontorhiza and C. arietinum. Note that the scale of axes may differ by species.
for species with a wide habitat breadth like C. florida, model would likely improve predictive power for
which has a wide range of common species associated A. scolopendrium and perhaps for other species as well. In
with it in different contexts. However, this variation in the case of C. maculata, unoccupied suitable sites tended to
the relative abundance of associates may also reflect habi- have higher frequency and abundance of the shrubs Lin-
tat characteristics that are not well represented by the pre- dera benzoin and Sambucus racemosa. Both flourish in gaps
dictors used in the SDMs. Factors like historical grazing, caused by canopy disturbance (Cipollini, Wallace-Senft &
successional stage and canopy type that are not accounted Whigham 1994; Reznicek, Voss & Walters 2011). Data on
for in the models are likely reflected in this high variabil- disturbance history could therefore improve SDMs for
ity of associate abundance. C. maculata, as it has for rare plants in other regions
Plant community diversity was a good representative of (Bourg, McShea & Gill 2005; Gogol-Prokurat 2011). How-
habitat breadth, with the beta diversity of suitable sites gen- ever, information on the frequency or intensity of canopy
erally corresponding to the habitat breadth of each species. disturbance is often not readily available as broadscale
Habitat specialists did not have qualitatively more useful geospatial data, and is probably best incorporated as a fil-
SDMs than species with wider habitat breadth. Similarly, ter after SDM building, during site selection in the field.
range-edge species were not more effectively modelled than It is critical to assess whether predicted habitat suitability
species with most of their range in southern Ontario. Elith is actually proportional to the probability of a species’ pres-
& Burgman (2002) found similar results for eight rare plants ence when using SDMs in conservation-related applications
in Australia, for which SDM utility was not associated with (Gogol-Prokurat 2011; Merow, Smith & Silander 2013). In
the species’ level of rarity or range within the study region. this study, habitat suitability as predicted by MAXENT’s
The higher community similarity of sites where a species cumulative output was positively related to the probability
was found compared to all suitable sites suggests that there of species presence for all species except A. scolopendrium
was an important environmental factor or factors not and P. hexagonoptera. For A. scolopendrium, this may be
accounted for by the variables included in the SDM. These an artefact of correcting for spatial autocorrelation.
are probably factors related to canopy cover, successional Because this fern specializes on the spatially limited lime-
state or disturbance history. An indicator species analysis stone substrate of the Niagara escarpment, both habitat
of suitable sites with and without the target species sup- suitability and occurrence records tend to be strongly spa-
ports this theory for two of the target species tially correlated. By ‘correcting’ for this, the relationship
(Appendix S1). Suitable sites for A. scolopendrium that between the two may have been obscured (e.g. Dormann
lacked the species tended to include the conifer Thuja occi- 2007). Indeed, when I built a glm without the spatial auto-
dentalis. Asplenium scolopendrium is reportedly never found covariate, habitat suitability was an important predictor of
beneath conifers (Cinquemani Kuehn & Leopold 1992). the species’ presence (results not shown).
Other studies have found vegetation type to be an impor- For P. hexagonoptera, predicted habitat suitability was a
tant predictor of habitat suitability for some rare plants negative predictor of the species’ presence (Fig. 3). While
(e.g. Bourg, McShea & Gill 2005; Williams et al. 2009; the AUC in the model building stage was very high, the
Gogol-Prokurat 2011). Incorporating canopy type into the model performed very poorly for the independent data set
1878 J. L. McCune
(Table 2). There are a number of possible explanations.

Conclusions
First, perhaps the model failed to accurately characterize
the species’ habitat requirements because important envi- The conservation of rare species relies first on accurate
ronmental factors were missing. Second, the species may knowledge of their current distribution and status. This
have been missed during site surveys. However, this fern is study has shown that even in fragmented, human-domi-
a perennial with distinctive fronds which emerge in late nated landscapes, SDMs based on climatic and edaphic
May and persist until autumn, and is unlikely to be missed factors can be effective tools to prioritize areas for field
in careful surveys, and there were also many errors of the surveys. In addition, by ground-testing SDM predictions
opposite kind, where the species was present in areas pre- in conjunction with plant community and landscape con-
dicted to be unsuitable. Finally, it could be that text data, conservation biologists can determine which
P. hexagonoptera is not currently in equilibrium with the environmental variables could improve SDM accuracy,
environment because of population fluctuations resulting and which rare species may be most affected by landscape
from high levels of human disturbance and/or stochastic context and/or dispersal limitation.
extirpations of small populations and temporary survival
of ‘sink’ populations in unsuitable areas (MacDougall &
Acknowledgements
Loo 2002; Le Lay et al. 2010). This is the most likely expla-
nation for the poor model performance. Of 71 total occur- K. Tisshaw and M. Strub were excellent field assistants. M. Oldham
rences of this species in Ontario, only 21 have been advised me on the selection of plant species and assisted with plant identi-
fication. C. Lacroix also helped with species identifications. The OMNRF
confirmed in the last 20 years, and searches of known loca- NHIC provided location data for rare plant populations. The Canadian
tions have frequently been negative (van Overbeeke, Jalava Forest Service provided climate data. The Bruce Trail Association, the
& Donley 2013). The inability of the SDM to predict pres- Grand River Conservation Authority, the Hamilton Naturalists Club, the
Long Point Basin Land Trust and 36 private landowners gave permission
ence and absence of this species may be a further indication to access land. A. MacDougall and two anonymous reviewers provided
that it is on its way to extirpation in southern Ontario. helpful comments on the manuscript. This research was funded by the
In addition to habitat suitability based on environmen- Liber Ero fellowship Programme.
tal predictors, the amount of forest cover within 500 m of

a site is an important determinant of rare woodland plant
species presence in this landscape. The area of forest Data accessibility
within 500 m was negatively correlated with the relative Species occurrence records: available from the Natural Heritage
abundance of disturbance-associated species in the 51 sur- Information Centre of Ontario (nhicrequests@ontario.ca). Envi-
veyed sites, suggesting that rare plants are more likely to ronmental data (topography, geology, soils, climate): see
occur in areas of lower disturbance. However, the amount Appendix S2. Plant community and landscape data for the 51
of forest in the vicinity of a site also indicates degree of surveyed sites: see Appendices S7 and S8. Independent occurrence
isolation from other potentially suitable patches, and it is records with associated landscape data: see Appendices S9 and
S10.
difficult to tease apart the relative influence of disturbance
and fragmentation, per se. Nevertheless, there is a higher
likelihood of finding these rare plants in areas with more References
intact forest, and these areas should be prioritized for Anderson, M.J., Ellingsen, K.E. & McArdle, B.H. (2006) Multivariate dis-
field surveys. Boetsch, Van Manen & Clark (2003) found persion as a measure of beta diversity. Ecology Letters, 9, 683–693.
a very similar pattern for Cardamine clematitis, which was Bardos, D.C., Guillera-Arroita, G. & Wintle, B.A. (2015) Valid auto-
models for spatially autocorrelated occupancy and abundance
absent in suitable patches <40 ha in size. data. Methods in Ecology and Evolution, 6, 1137–1149.
For sites predicted to have suitable habitat, the degree Blaney, S. & Mazerolle, D. (2007) Nova Scotia Provincial Status Report on
of habitat suitability was not an important predictor of Ram’s-Head Lady’s Slipper. Atlantic Canada Conservation Data Centre,
Sackville, NB.
presence for any species. Once over the threshold, a site Boetsch, J.R., Van Manen, F.K. & Clark, J.D. (2003) Predicting rare plant
with cumulative output of 50% is just as likely to contain occurrence in Great Smoky Mountains National Park, USA. Natural
the species as one with 100%. For this subset of sites, the Areas Journal, 23, 229–237.
Bourg, N.A., McShea, W.J. & Gill, D.E. (2005) Putting a cart before the
most important predictor of species presence was the dis- search: successful habitat prediction for a rare forest herb. Ecology, 86,
tance to the nearest known population, independent of 2793–2804.
relative habitat suitability and spatial autocorrelation. Brzeskiewicz, M. (2000) Conservation Assessment for Ram’s Head Lady
Slipper (Cypripedium arietinum). USDA Forest Service, Eastern
Interestingly, this was true for all species except C. ariet- Region.
inum and C. odontorhiza, both of which are orchids with Burnham, K.P., Anderson, D.R. & Huyvaert, K.P. (2011) AIC model
tiny, windblown seeds. However, C. maculata and the two selection and multimodel inference in behavioral ecology: some back-
ground, observations, and comparisons. Behavioral Ecology and Sociobi-
fern species also have extremely light and small propag- ology, 65, 23–35.
ules, so the importance of isolation from other popula- Cinquemani Kuehn, D.M. & Leopold, D.J. (1992) Long-term demography
tions is not entirely predictable based on propagule size. of Phyllitis scolopendrium (L.) Newm. var. americana Fern. in central
New York. Bulletin Torrey Botanical Club, 119, 65–76.
Nevertheless, this evidence supports the hypothesis that Cipollini, M.L., Wallace-Senft, D.A. & Whigham, D.F. (1994) A model of
dispersal limitation plays a strong role in determining the patch dynamics, seed dispersal, and sex ration in the dioecious shrub
distributions of some rare plants on this landscape. Lindera benzoin (Lauraceae). Journal of Ecology, 82, 621–633.
Clarke, K.R. & Gorley, R.N. (2015) Primer v7 Permanova+. Primer-E Merow, C., Smith, M.J. & Silander, J.A. (2013) A practical guide to Max-
Ltd., Plymouth, UK. Ent for modeling species’ distributions: what it does, and why inputs
Crins, W.J., Gray, P.A., Uhlig, P.W.C. & Wester, M.C. (2009) The and settings matter. Ecography, 36, 1058–1069.
Ecosystems of Ontario, Part I: Ecozones and Ecoregions. Ontario Min- Oldham, M.J. & Brinker, S.R. (2009) Rare Vascular Plants of Ontario, 4th
istry of Natural Resources, Peterborough, ON. edn. Natural Heritage Information Centre, Ontario Ministry of Natural
Dormann, C.F. (2007) Effects of incorporating spatial autocorrelation into Resources. Peterborough, ON.
the analysis of species distribution data. Global Ecology and Biogeogra- van Overbeeke, J.C., Jalava, J.V. & Donley, R.H. (2013) Management
phy, 16, 129–138. Plan for the Broad Beech Fern (Phegopteris hexagonoptera) in Ontario.
Elith, J. & Burgman, M. (2002) Predictions and their validation: rare Ontario Management Plan Series. Prepared for the Ontario Ministry of
plants in the Central Highlands, Victoria, Australia. Predicting Species Natural Resources, Peterborough, ON.
Occurrences: Issues of Accuracy and Scale (eds J.M. Scott, P.J. Heglund, Pearson, R.G., Raxworthy, C.J., Nakamura, M. & Peterson, A.T. (2007)
M.L. Morrison, J.B. Haufler, M.G. Raphael, W.A. Wall & F.B. Sam- Predicting species distributions from small numbers of occurrence
son), pp. 303–313. Island Press, Washington, DC. records: a test case using cryptic geckos in Madagascar. Journal of Bio-
Elith, J., Phillips, S.J., Hastie, T., Dudik, M., Chee, Y.E. & Yates, C.J. geography, 34, 102–117.
(2011) A statistical explanation of MaxEnt for ecologists. Diversity and Phillips, S.J., Anderson, R.P. & Schapire, R.E. (2006) Maximum entropy
Distributions, 17, 43–57. modeling of species geographic distributions. Ecological Modelling, 190,
Engler, R., Guisan, A. & Rechsteiner, R. (2004) An improved approach for 231–259.
predicting the distribution of rare and endangered species from occur- Reznicek, A.A., Voss, E.G. & Walters, B.S. (2011). Michigan Flora
rence and pseudo-absence data. Journal of Applied Ecology, 41, 263–274. Online. University of Michigan. Web. 3-16-2015. http://www.michigan-
Fielding, A.H. & Bell, J.F. (1997) A review of methods for the assessment flora.net/home.aspx.
of prediction errors in conservation presence/absence models. Environ- Rogers, D.A., Rooney, T.P., Hawbaker, T.J., Radeloff, V.C. & Waller,
mental Conservation, 24, 38–49. D.M. (2009) Paying the extinction debt in Southern Wisconsin forest
Folt^ete, J.-C., Clauzel, C., Vuidel, G. & Tournant, P. (2012) Integrating understories. Conservation Biology, 23, 1497–1506.
graph-based connectivity metrics into species distribution models. Land- Swets, J.A. (1988) Measuring the accuracy of diagnostic systems. Science,
scape Ecology, 27, 557–569. 240, 1285–1293.
Franklin, J. (2010) Moving beyond static species distribution models in Vallet, J., Daniel, H., Beaujouan, V., Roze, F. & Pavoine, S. (2010) Using
support of conservation biogeography. Diversity and Distributions, 16, biological traits to assess how urbanization filters plant species of small
321–330. woodlands. Applied Vegetation Science, 13, 412–424.
Franklin, J. (2013) Species distribution models in conservation biogeography: Williams, J.N., Seo, C., Thorne, J., Nelson, J.K., Erwin, S., O’Brien, J.M. &
developments and challenges. Diversity and Distributions, 19, 1217–1223. Schwartz, M.W. (2009) Using species distribution models to predict new
Gogol-Prokurat, M. (2011) Predicting habitat suitability for rare plants at occurrences for rare plants. Diversity and Distributions, 15, 565–576.
local spatial scales using a species distribution model. Ecological Appli- Wisz, M.S., Hijmans, R.J., Li, J., Peterson, A.T., Graham, C.H.,
cations, 21, 33–47. Guisan, A. & Distribut, N.P.S. (2008) Effects of sample size on the
Government of Ontario (2000) Ontario Population Projections, 1999–2028. performance of species distribution models. Diversity and Distribu-
Queen’s Printer for Ontario, Toronto, ON. tions, 14, 763–773.
Guisan, A. & Zimmermann, N.E. (2000) Predictive habitat distribution
models in ecology. Ecological Modelling, 135, 147–186. Received 14 February 2016; accepted 16 May 2016
Guisan, A., Broennimann, O., Engler, R., Vust, M., Yoccoz, N.G., Leh- Handling Editor: Christopher Baraloto
mann, A. & Zimmerman, N.E. (2006) Using niche-based models to
improve the sampling of rare species. Conservation Biology, 20, 501–511.
Guisan, A., Tingley, R., Baumgartner, J.B., Naujokaitis-Lewis, I., Sut- Supporting Information
cliffe, P.R., Tulloch, A.I.T. et al. (2013) Predicting species distributions
for conservation decisions. Ecology Letters, 16, 1424–1435. Additional Supporting Information may be found in the online version
Hernandez, P.A., Graham, C.H., Master, L.L. & Albert, D.L. (2006) The
of this article.
effect of sample size and species characteristics on performance of dif-
ferent species distribution modeling methods. Ecography, 29, 773–785.
Kharouba, H.M., McCune, J.L., Thuiller, W. & Huntley, B. (2013) Do Appendix S1. Species descriptions, associated species, and indica-
ecological differences between taxonomic groups influence the relation- tor species analysis results.
ship between species’ distributions and climate? A global meta-analysis
using species distribution models. Ecography, 36, 657–664.
Appendix S2. Details of MAXENT modelling, including environ-
Larson, B.M., Riley, J.L., Snell, E.A. & Godschalk, H.G. (1999) The
Woodland Heritage of Southern Ontario. Federation of Ontario Natural- mental data sources.
ists, Don Mills, ON.
Le Lay, G., Engler, R., Franc, E. & Guisan, A. (2010) Prospective sam- Appendix S3. Associated species model methods and results.
pling based on model ensembles improves the detection of rare species.
Ecography, 33, 1015–1027.
Lovett-Doust, J., Biernacki, M., Page, R., Chan, M., Natgunarajah, R. & Appendix S4. Maps for each species showing the results of the
Timis, G. (2003) Effects of land ownership and landscape-level factors MAXENT model.
on rare-species richness in natural areas of southern Ontario, Canada.
Landscape Ecology, 18, 621–633. Appendix S5. PERMDISP results.
Luoto, M., Poyry, J., Heikkinen, R.K. & Saarinen, K. (2005) Uncertainty
of bioclimate envelope models based on the geographical distribution of
species. Global Ecology and Biogeography, 14, 575–584. Appendix S6. glm results.
MacDougall, A.S. & Loo, J.A. (2002) Predicting occurrences of geographi-
cally restricted rare floral elements with qualitative habitat data. Envi- Appendix S7. Plant community and landscape data for the 51
ronmental Reviews, 10, 167–190. surveyed sites.
McCune, B. & Grace, B. (2002) Analysis of Ecological Communities. MjM
Software Design, Gleneden Beach, OR.
McCune, J.L. & Vellend, M. (2015) Using plant traits to predict the sensi- Appendix S8. Metadata for Appendix S7.
tivity of colonizations and extirpations to landscape context. Oecologia,
178, 511–524. Appendix S9. All independent occurrence records with associated
McCune, J.L., Harrower, W.L., Avery-Gomm, S., Brogan, J.M., Czergo,
landscape data.
A.-M., Davidson, L.N.K. et al. (2013) Threats to Canadian species at
risk: an analysis of finalized Recovery Strategies. Biological Conserva-
tion, 166, 254–265. Appendix S10. Metadata for Appendix S9.

Species Distribution Models Predict Rare Species Occurrences Despite Significant Effects of Landscape Context

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Species Distribution Models Predict Rare Species Occurrences Despite Significant Effects of Landscape Context

Uploaded by

Copyright:

Available Formats

Journal of Applied Ecology 2016, 53, 1871–1879 doi: 10.1111/1365-2664.

Species distribution models predict rare species

populations are likely present on privately owned land

& Zimmermann 2000). Hundreds of papers involving

Materials and methods BUILDING MODELS

I built SDMs for each species using MAXENT (Phillips, Anderson

Beta diversity of plots

Arisaema dracontium SC S3 <5 Medium (floodplain forests) 4263

inclusion, and given the relatively low number of records available,

in the field surveys to an ordinal scale from 1 (very rare) to 5

(Anderson, Ellingsen & McArdle 2006). I tested for significant

(Clarke & Gorley 2015). I also compared the beta diversity

I compared the relative abundance of associated species in

squares that were predicted suitable versus unsuitable using Wil-

(glm) with a binomial link to model suitable versus unsuitable

sites based on relative abundance of associates (see

I also calculated the relative abundance of disturbance-asso-

ciated species in each surveyed square. I designated a species as

trails’, ‘fencerows’, ‘powerline clearings’, ‘ditches and swales’ or

‘meadows and pastures’. Disturbance-associated species tend to

each surveyed plot, and then used a Spearman rank correlation

test to determine whether there was a significant correlation

Assessing the relative importance of distance and

of the square. I did not include distance to the nearest known

population in these models because this variable is highly corre-

to check for spatial autocorrelation in the model residuals. If I

with intercept only to models with habitat suitability only, forest

dictors as well as an interaction term. I considered a variable to

ference in AIC between a model with intercept only and a model

dence at least 12 times stronger than the alternative model (Burn-

To examine the importance of distance from the nearest known

STANDARD MODEL TESTS

records mainly located in two very small regions within

77 presence records spread across a wide area.

Appendix S4 shows a map for each species.

drium, which was present at three of the ten survey sites

C. odontorhiza and V. stamineum, and did not find these

Arisaema dracontium Asplenium scolopendrium Chimaphila maculata Corallorhiza odontorhiza

abs pres abs pres abs pres abs pres

<40-ha forest within 500 m

abs pres abs pres abs pres

Arisaema dracontium Asplenium scolopendrium Chimaphila maculata Corallorhiza odontorhiza

Cornus florida Cypripedium arietinum Phegopteris hexagonoptera

(Table 2). There are a number of possible explanations.

tal predictors, the amount of forest cover within 500 m of

You might also like