Professional Documents
Culture Documents
Species Distribution Models Predict Rare Species Occurrences Despite Significant Effects of Landscape Context
Species Distribution Models Predict Rare Species Occurrences Despite Significant Effects of Landscape Context
12702
Summary
1. The true status of many endangered plants is uncertain because the locations of all extant
populations are not known. Species distribution models (SDMs) can direct searches for addi-
tional populations, but habitat fragmentation may influence the distribution of rare species
more than climatic or edaphic factors in human-dominated landscapes. In this study, I test
the ability of SDMs to predict rare plant occurrences in a fragmented landscape and the
importance of predicted habitat suitability versus landscape context.
2. I built SDMs for eight rare woodland plants and assessed them using an independent data
set including plant community surveys of 51 sites. I used community data to determine
whether SDMs predict the right habitat type even when the target rare species was absent. I
then modelled rare species presence based on predicted habitat suitability, distance to the
nearest known population and the amount of forest habitat within 500 m of the site.
3. SDMs were effective for seven of the eight species, with the degree of predicted habitat
suitability positively related to species’ occurrence. I found new populations of four of the
eight species. However, the amount of forest habitat available in the vicinity of a plot was
also a positive predictor of rare plant occurrences. Among sites predicted to be suitable, the
distance to the nearest known population was the strongest predictor of rare plant occur-
rence.
4. Synthesis and applications. Species distribution models (SDMs) can effectively target
searches for populations of rare species even in human-dominated landscapes. Surveying the
plant community at sites predicted to be suitable can help to improve the SDM. SDMs used
in conjunction with data on landscape context can maximize the efficiency of searches for rare
species and show which species are restricted by dispersal limitation and habitat
fragmentation in addition to edaphic and climatic factors.
Key-words: associated species, dispersal limitation, disturbance, forest, habitat fragmenta-
tion, human-dominated landscapes, MAXENT, rare plants, species distribution model,
woodland
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society
1872 J. L. McCune
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
Predicting rare species occurrences 1873
Table 1. Range characteristics and habitat preferences of the eight plant species. Status designations are special concern (SC), threatened
(THR) and endangered (END). Species without a status category have not yet been assessed. See text for S rank definitions. The final
column indicates the beta diversity of surveyed plots predicted to be suitable for each species, as estimated by the average Bray–Curtis
distance to the centroid of that group of plots. The beta diversity of all 51 plots was 4615
geology and climatic factors (see Appendix S2), and converted of 2014, I surveyed 51 grid squares (Fig. 1), with the goal of
each variable to a 100 9 100 m grain size. I chose a set of 14 maximizing the variation in the range of habitat suitability pre-
predictors that were minimally intercorrelated. I obtained georef- dicted by the model, and distance from the nearest record of the
erenced observation records for all eight target species from the species. At each site, we used a global positioning system to
Ontario Ministry of Natural Resources and Forestry (OMNRF) locate the centre of the grid square. We thoroughly searched each
Natural Heritage Information Centre (NHIC). The records include square, with search times ranging from 3 to 6 person hours
herbarium records, opportunistic sightings and purposeful depending on the vegetation and terrain. We recorded all vascu-
searches by biologists, and are represented by polygons which indi- lar plant species present, and estimated the abundance of each
cate the spatial certainty of the observation. I eliminated all using a coarse 5-category scale.
records with an area of >100 m radius; 91% of the records were
from 1980 or later, with only one record from prior to 1958. High
spatial accuracy of records was the most important criterion for STATISTICAL ANALYSES
FIELD SAMPLING
Plant community and landscape analyses
I randomly selected 100 forested grid squares predicted to be suit-
able for each species, stratified by the distance to the nearest To characterize the plant community composition of the sur-
observation used to build the model. In the spring and summer veyed grid squares, I converted the coarse abundance scale used
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
1874 J. L. McCune
Specificity
model. Model threshold is the threshold of habitat suitability (MAXENT’s cumulative output) above which a grid square is predicted to be suitable. See text for explanations of AUC, sensitivity
Table 2. Summary of species distribution model (SDM) characteristics and evaluation measures. MPA (minimal predicted area) is the percentage of the study area predicted suitable by the best
(dominant) and ordinated all squares using non-metric multidi-
(%)
mensional scaling (NMDS) based on Bray–Curtis similarity
62
86
82
92
62
76
49
94
(McCune & Grace 2002), excluding the target species. If the
SDMs were useful, I expected that species with narrow habitat
preferences would have lower beta diversity among plots pre-
Sensitivity dicted suitable compared to species with wide habitat tolerances.
(%) To test this, I calculated the multivariate dispersion (average
NA
74
60
78
14
68
91
22
community similarity) for the 51 sites as a whole, for the group
of sites predicted to be suitable for each target species where the
species was absent, and for the group of sites predicted to be
07594
08908
08707
08739
07700
08908
04172
NA
suitable for each target species in which the species was found
AUC
64
62
51
142
131
Appendix S3).
Success
0/3
2/17
3/10
2/11
5/20
0/10
0/24
2988
2609
5995
2795
2778
6383
705
212
081
047
008
027
122
360
212
005
Using the independent data set for each species, I built a glm of
species presence/absence based on the habitat suitability predicted
by the MAXENT model and the total area of forest within 500 m
Model building
295
73
15
77
38
5
Corallorhiza odontorhiza
Vaccinium stamineum
Arisaema dracontium
area only, both habitat suitability and forest area, and both pre-
Chimaphila maculata
Cornus florida
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
Predicting rare species occurrences 1875
2
(a) Asplenium scolopendrium
population, I examined the independent data for grid squares
predicted to be suitable only. In this subset of the data, distance
and habitat suitability are not highly correlated (r ranges from
1
021 to 051). I built logistic models relating species presence/ab-
sence to combinations of habitat suitability, distance and forest
area, plus a spatial autocovariate when necessary.
NMDS2
Logistic models were built in R, using package ‘NCF’ for spline
0
correlograms and ‘SPDEP’ for preparing spatial autocovariates.
−1
Results
−2
AUC values >090 generally indicate a very useful model
(Swets 1988). All species except C. odontorhiza had high −2 −1 0 1 2
AUC values based on the subset of records withheld from NMDS1
the training data (Table 2). The proportion of the study
area predicted suitable for each species (MPA) ranged
2
(b) Cornus florida
from 005% to 36%, and generally corresponded to the
prevalence and distribution of the species. For example,
C. maculata (MPA = 008%) had only 15 presence
1
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
1876 J. L. McCune
Fig. 3. Habitat suitability (based on MAXENT’s cumulative output) versus presence (pres) or absence (abs) of each species in the indepen-
dent data set. The data is indicated in grey, with open dots for sites with <40-ha total forest cover within 500 m and closed dots for sites
with >40-ha total forest cover within 500 m. Black dots indicate the mean habitat suitability for low forest (open dots, dashed lines) and
high forest (closed dots, solid lines). Grey bars indicate the overall mean for both forest categories.
the species present, providing insufficient power to detect a species except for C. odontorhiza and C. arietinum (Fig. 4;
significant difference by permutation tests (Appendix S5). Appendix S6). There was a strong positive influence of
total forest area within 500 m on the probability of
presence for three species: A. dracontium, C. maculata and
THE IMPORTANCE OF LANDSCAPE CONTEXT
P. hexagonoptera (Appendix S6).
The total area of forest within 500 m of a site was negatively
correlated with the relative abundance of disturbance-asso-
Discussion
ciated species (Spearman’s r = 048, P = 000043). For the
independent occurrence data, predicted habitat suitability SDMs based on presence-only records and data on soils,
was included in the best model of species presence/absence geology, topography and climate effectively predicted the
for all species except A. scolopendrium. The other species distributions of seven of these eight rare woodland plants.
showed a positive relationship between predicted habitat While independent test data yielded a low sensitivity
suitability and presence except for P. hexagonoptera, for (14%) for C. odontorhiza, the AUC value was quite high
which habitat suitability was a negative predictor of pres- (Table 2), suggesting that the choice of threshold for suit-
ence (Fig. 3, Appendix S6). The area of forest within 500 m ability was ineffective. Pearson et al. (2007) suggest lower-
was also an important predictor of presence for Chimaphila ing the habitat suitability threshold to 0% omission when
maculata and Corallorhiza odontorhiza – in fact neither spe- the number of records is very low, as in this case. When I
cies was ever present in sites with surrounding forest area did this, the sensitivity based on the independent data set
lower than 40 ha (Fig. 3). There was an interaction between improved to 86%, while the specificity was still good at
habitat suitability and forest area for three species: A. dra- 76%. I used this new threshold for the subsequent model
contium, C. florida and P. hexagonoptera. For the first two, tests for this species.
habitat suitability was a stronger predictor of species pres- Forest surveys based on these SDMs resulted in the dis-
ence in sites with high forest area within 500 m, whereas for covery of new populations of four of the eight species,
P. hexagonoptera, the negative relationship between habitat even though only 51 sites were searched. Although the
suitability and species presence became even stronger in success rate was relatively low – ranging from 12% to
sites with low forest area (Fig. 3). 30% of sites predicted to be suitable – this is a common
For sites above the threshold for habitat suitability, the finding for rare species (MacDougall & Loo 2002; Wil-
degree of habitat suitability was no longer a good predic- liams et al. 2009), and is an improvement over the 0%
tor of species presence for any species. The strongest pre- discovery rate at sites where the habitat was not predicted
dictor of species presence at suitable sites was the distance suitable. Sites predicted to be suitable tended to have a
from the nearest known occurrence of the species, with higher relative abundance of common associates of the
sites closer to known records having a much higher prob- target rare plant. However, there was a large amount of
ability of the species being present. This was true for all variation between sites. This is to be expected, especially
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
Predicting rare species occurrences 1877
60 65 70 75 80 85
90
80
70
70
70
50
50
Habitat suitability (%)
60
30
30
0 5 10 15 20 25 30 0 5 10 15 20 25 30 0 5 10 15 0 5 10 15 20 25 30
Species absent
90
90
Species present
80
70
70
60
50
50
40
30
20
30
0 5 10 15 0 10 20 30 40 0 5 10 15 20 25 30
Distance to nearest record (km)
Fig. 4. Habitat suitability (based on MAXENT’s cumulative output) versus distance to the nearest record used to build the SDM for the
independent data set, showing sites with habitat suitability above the suitability threshold only. Distance is negatively related to species
presence for all species except C. odontorhiza and C. arietinum. Note that the scale of axes may differ by species.
for species with a wide habitat breadth like C. florida, model would likely improve predictive power for
which has a wide range of common species associated A. scolopendrium and perhaps for other species as well. In
with it in different contexts. However, this variation in the case of C. maculata, unoccupied suitable sites tended to
the relative abundance of associates may also reflect habi- have higher frequency and abundance of the shrubs Lin-
tat characteristics that are not well represented by the pre- dera benzoin and Sambucus racemosa. Both flourish in gaps
dictors used in the SDMs. Factors like historical grazing, caused by canopy disturbance (Cipollini, Wallace-Senft &
successional stage and canopy type that are not accounted Whigham 1994; Reznicek, Voss & Walters 2011). Data on
for in the models are likely reflected in this high variabil- disturbance history could therefore improve SDMs for
ity of associate abundance. C. maculata, as it has for rare plants in other regions
Plant community diversity was a good representative of (Bourg, McShea & Gill 2005; Gogol-Prokurat 2011). How-
habitat breadth, with the beta diversity of suitable sites gen- ever, information on the frequency or intensity of canopy
erally corresponding to the habitat breadth of each species. disturbance is often not readily available as broadscale
Habitat specialists did not have qualitatively more useful geospatial data, and is probably best incorporated as a fil-
SDMs than species with wider habitat breadth. Similarly, ter after SDM building, during site selection in the field.
range-edge species were not more effectively modelled than It is critical to assess whether predicted habitat suitability
species with most of their range in southern Ontario. Elith is actually proportional to the probability of a species’ pres-
& Burgman (2002) found similar results for eight rare plants ence when using SDMs in conservation-related applications
in Australia, for which SDM utility was not associated with (Gogol-Prokurat 2011; Merow, Smith & Silander 2013). In
the species’ level of rarity or range within the study region. this study, habitat suitability as predicted by MAXENT’s
The higher community similarity of sites where a species cumulative output was positively related to the probability
was found compared to all suitable sites suggests that there of species presence for all species except A. scolopendrium
was an important environmental factor or factors not and P. hexagonoptera. For A. scolopendrium, this may be
accounted for by the variables included in the SDM. These an artefact of correcting for spatial autocorrelation.
are probably factors related to canopy cover, successional Because this fern specializes on the spatially limited lime-
state or disturbance history. An indicator species analysis stone substrate of the Niagara escarpment, both habitat
of suitable sites with and without the target species sup- suitability and occurrence records tend to be strongly spa-
ports this theory for two of the target species tially correlated. By ‘correcting’ for this, the relationship
(Appendix S1). Suitable sites for A. scolopendrium that between the two may have been obscured (e.g. Dormann
lacked the species tended to include the conifer Thuja occi- 2007). Indeed, when I built a glm without the spatial auto-
dentalis. Asplenium scolopendrium is reportedly never found covariate, habitat suitability was an important predictor of
beneath conifers (Cinquemani Kuehn & Leopold 1992). the species’ presence (results not shown).
Other studies have found vegetation type to be an impor- For P. hexagonoptera, predicted habitat suitability was a
tant predictor of habitat suitability for some rare plants negative predictor of the species’ presence (Fig. 3). While
(e.g. Bourg, McShea & Gill 2005; Williams et al. 2009; the AUC in the model building stage was very high, the
Gogol-Prokurat 2011). Incorporating canopy type into the model performed very poorly for the independent data set
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
1878 J. L. McCune
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879
Predicting rare species occurrences 1879
Clarke, K.R. & Gorley, R.N. (2015) Primer v7 Permanova+. Primer-E Merow, C., Smith, M.J. & Silander, J.A. (2013) A practical guide to Max-
Ltd., Plymouth, UK. Ent for modeling species’ distributions: what it does, and why inputs
Crins, W.J., Gray, P.A., Uhlig, P.W.C. & Wester, M.C. (2009) The and settings matter. Ecography, 36, 1058–1069.
Ecosystems of Ontario, Part I: Ecozones and Ecoregions. Ontario Min- Oldham, M.J. & Brinker, S.R. (2009) Rare Vascular Plants of Ontario, 4th
istry of Natural Resources, Peterborough, ON. edn. Natural Heritage Information Centre, Ontario Ministry of Natural
Dormann, C.F. (2007) Effects of incorporating spatial autocorrelation into Resources. Peterborough, ON.
the analysis of species distribution data. Global Ecology and Biogeogra- van Overbeeke, J.C., Jalava, J.V. & Donley, R.H. (2013) Management
phy, 16, 129–138. Plan for the Broad Beech Fern (Phegopteris hexagonoptera) in Ontario.
Elith, J. & Burgman, M. (2002) Predictions and their validation: rare Ontario Management Plan Series. Prepared for the Ontario Ministry of
plants in the Central Highlands, Victoria, Australia. Predicting Species Natural Resources, Peterborough, ON.
Occurrences: Issues of Accuracy and Scale (eds J.M. Scott, P.J. Heglund, Pearson, R.G., Raxworthy, C.J., Nakamura, M. & Peterson, A.T. (2007)
M.L. Morrison, J.B. Haufler, M.G. Raphael, W.A. Wall & F.B. Sam- Predicting species distributions from small numbers of occurrence
son), pp. 303–313. Island Press, Washington, DC. records: a test case using cryptic geckos in Madagascar. Journal of Bio-
Elith, J., Phillips, S.J., Hastie, T., Dudik, M., Chee, Y.E. & Yates, C.J. geography, 34, 102–117.
(2011) A statistical explanation of MaxEnt for ecologists. Diversity and Phillips, S.J., Anderson, R.P. & Schapire, R.E. (2006) Maximum entropy
Distributions, 17, 43–57. modeling of species geographic distributions. Ecological Modelling, 190,
Engler, R., Guisan, A. & Rechsteiner, R. (2004) An improved approach for 231–259.
predicting the distribution of rare and endangered species from occur- Reznicek, A.A., Voss, E.G. & Walters, B.S. (2011). Michigan Flora
rence and pseudo-absence data. Journal of Applied Ecology, 41, 263–274. Online. University of Michigan. Web. 3-16-2015. http://www.michigan-
Fielding, A.H. & Bell, J.F. (1997) A review of methods for the assessment flora.net/home.aspx.
of prediction errors in conservation presence/absence models. Environ- Rogers, D.A., Rooney, T.P., Hawbaker, T.J., Radeloff, V.C. & Waller,
mental Conservation, 24, 38–49. D.M. (2009) Paying the extinction debt in Southern Wisconsin forest
Folt^ete, J.-C., Clauzel, C., Vuidel, G. & Tournant, P. (2012) Integrating understories. Conservation Biology, 23, 1497–1506.
graph-based connectivity metrics into species distribution models. Land- Swets, J.A. (1988) Measuring the accuracy of diagnostic systems. Science,
scape Ecology, 27, 557–569. 240, 1285–1293.
Franklin, J. (2010) Moving beyond static species distribution models in Vallet, J., Daniel, H., Beaujouan, V., Roze, F. & Pavoine, S. (2010) Using
support of conservation biogeography. Diversity and Distributions, 16, biological traits to assess how urbanization filters plant species of small
321–330. woodlands. Applied Vegetation Science, 13, 412–424.
Franklin, J. (2013) Species distribution models in conservation biogeography: Williams, J.N., Seo, C., Thorne, J., Nelson, J.K., Erwin, S., O’Brien, J.M. &
developments and challenges. Diversity and Distributions, 19, 1217–1223. Schwartz, M.W. (2009) Using species distribution models to predict new
Gogol-Prokurat, M. (2011) Predicting habitat suitability for rare plants at occurrences for rare plants. Diversity and Distributions, 15, 565–576.
local spatial scales using a species distribution model. Ecological Appli- Wisz, M.S., Hijmans, R.J., Li, J., Peterson, A.T., Graham, C.H.,
cations, 21, 33–47. Guisan, A. & Distribut, N.P.S. (2008) Effects of sample size on the
Government of Ontario (2000) Ontario Population Projections, 1999–2028. performance of species distribution models. Diversity and Distribu-
Queen’s Printer for Ontario, Toronto, ON. tions, 14, 763–773.
Guisan, A. & Zimmermann, N.E. (2000) Predictive habitat distribution
models in ecology. Ecological Modelling, 135, 147–186. Received 14 February 2016; accepted 16 May 2016
Guisan, A., Broennimann, O., Engler, R., Vust, M., Yoccoz, N.G., Leh- Handling Editor: Christopher Baraloto
mann, A. & Zimmerman, N.E. (2006) Using niche-based models to
improve the sampling of rare species. Conservation Biology, 20, 501–511.
Guisan, A., Tingley, R., Baumgartner, J.B., Naujokaitis-Lewis, I., Sut- Supporting Information
cliffe, P.R., Tulloch, A.I.T. et al. (2013) Predicting species distributions
for conservation decisions. Ecology Letters, 16, 1424–1435. Additional Supporting Information may be found in the online version
Hernandez, P.A., Graham, C.H., Master, L.L. & Albert, D.L. (2006) The
of this article.
effect of sample size and species characteristics on performance of dif-
ferent species distribution modeling methods. Ecography, 29, 773–785.
Kharouba, H.M., McCune, J.L., Thuiller, W. & Huntley, B. (2013) Do Appendix S1. Species descriptions, associated species, and indica-
ecological differences between taxonomic groups influence the relation- tor species analysis results.
ship between species’ distributions and climate? A global meta-analysis
using species distribution models. Ecography, 36, 657–664.
Appendix S2. Details of MAXENT modelling, including environ-
Larson, B.M., Riley, J.L., Snell, E.A. & Godschalk, H.G. (1999) The
Woodland Heritage of Southern Ontario. Federation of Ontario Natural- mental data sources.
ists, Don Mills, ON.
Le Lay, G., Engler, R., Franc, E. & Guisan, A. (2010) Prospective sam- Appendix S3. Associated species model methods and results.
pling based on model ensembles improves the detection of rare species.
Ecography, 33, 1015–1027.
Lovett-Doust, J., Biernacki, M., Page, R., Chan, M., Natgunarajah, R. & Appendix S4. Maps for each species showing the results of the
Timis, G. (2003) Effects of land ownership and landscape-level factors MAXENT model.
on rare-species richness in natural areas of southern Ontario, Canada.
Landscape Ecology, 18, 621–633. Appendix S5. PERMDISP results.
Luoto, M., Poyry, J., Heikkinen, R.K. & Saarinen, K. (2005) Uncertainty
of bioclimate envelope models based on the geographical distribution of
species. Global Ecology and Biogeography, 14, 575–584. Appendix S6. glm results.
MacDougall, A.S. & Loo, J.A. (2002) Predicting occurrences of geographi-
cally restricted rare floral elements with qualitative habitat data. Envi- Appendix S7. Plant community and landscape data for the 51
ronmental Reviews, 10, 167–190. surveyed sites.
McCune, B. & Grace, B. (2002) Analysis of Ecological Communities. MjM
Software Design, Gleneden Beach, OR.
McCune, J.L. & Vellend, M. (2015) Using plant traits to predict the sensi- Appendix S8. Metadata for Appendix S7.
tivity of colonizations and extirpations to landscape context. Oecologia,
178, 511–524. Appendix S9. All independent occurrence records with associated
McCune, J.L., Harrower, W.L., Avery-Gomm, S., Brogan, J.M., Czergo,
landscape data.
A.-M., Davidson, L.N.K. et al. (2013) Threats to Canadian species at
risk: an analysis of finalized Recovery Strategies. Biological Conserva-
tion, 166, 254–265. Appendix S10. Metadata for Appendix S9.
© 2016 The Author. Journal of Applied Ecology © 2016 British Ecological Society, Journal of Applied Ecology, 53, 1871–1879