Professional Documents
Culture Documents
Andrew K. Saydjari Catherine Zucker J. E. G. Peek Douglas P. Finkbeiner
Andrew K. Saydjari Catherine Zucker J. E. G. Peek Douglas P. Finkbeiner
Andrew K. Saydjari Catherine Zucker J. E. G. Peek Douglas P. Finkbeiner
Abstract
Diffuse interstellar bands (DIBs) are broad absorption features associated with interstellar dust and
can serve as chemical and kinematic tracers. Conventional measurements of DIBs in stellar spectra
are complicated by residuals between observations and best-fit stellar models. To overcome this,
we simultaneously model the spectrum as a combination of stellar, dust, and residual components,
with full posteriors on the joint distribution of the components. This decomposition is obtained by
modeling each component as a draw from a high-dimensional Gaussian distribution in the data-space
(the observed spectrum)—a method we call “Marginalized Analytic Data-space Gaussian Inference for
Component Separation” (MADGICS). We use a data-driven prior for the stellar component, which
avoids missing stellar features not included in synthetic line lists. This technique provides statistically
rigorous uncertainties and detection thresholds, which are required to work in the low signal-to-noise
regime that is commonplace for dusty lines of sight. We reprocess all public Gaia DR3 RVS spectra and
present an improved 8621 Å DIB catalog, free of detectable stellar line contamination. We constrain
the rest-frame wavelength to 8623.14 ± 0.087 Å (vacuum), find no significant evidence for DIBs in
the Local Bubble from the 1/6th of RVS spectra that are public, and show unprecedented correlation
with kinematic substructure in Galactic CO maps. We validate the catalog, its reported uncertainties,
and biases using synthetic injection tests. We believe MADGICS provides a viable path forward for
large-scale spectral line measurements in the presence of complex spectral contamination.
Keywords: Diffuse interstellar bands (379), Astronomy data reduction (1861), Catalogs (205)
tained in the moments of the Gaussian profile of the visits, and interpolated onto a common linear (0.1 Å
DIBs is significantly reduced in that they are averaged spacing) wavelength grid (2401 bins). The wavelength
over all of the combined spectra. calibration of the spectra are reported in vacuum.1 Both
Now, sufficient data quality and quantities are avail- the flux and flux uncertainty per wavelength bin are
able to enable large-scale single detection DIB catalogs. provided by Gaia. See Section 5.2 of Vallenari et al.
In producing a catalog, we produce a low-dimensional 2022 for more details.
representation of the detected DIBs and their proper-
Table 1. 8621 Å DIB Catalogs
ties, generally by listing their 0th , 1st , and 2nd moments
along with the stars toward which they were detected.
Cuts MADGICS Gaia HQ Gaia Full
This has been done in the NIR for the 15273 Å DIB in
APOGEE (Zasowski et al. 2015), which searched over SNR Cuts
∼ 100k spectra, and in the optical for the 8621 Å DIB Stellar SNR > 15 > 70 > 70
in Gaia RVS spectra (Schultheis et al. 2022), which
DIB SNR > 3.8 > 2.86
searched over ∼ 5.5M spectra. Yet, statistical meth-
ods for modeling and detecting DIBs have not kept pace Overall Spectrum Goodness of Fit Cuts
with the increase in data quantity and quality. χ2tot /dof 0.71 − 1.41
DIB catalogs are still made by obtaining a best fit GSP-Spec Flags <2
stellar spectrum, dividing the observed spectrum by the αDIB αDIB > Ri
stellar model, and fitting a Gaussian peak to the resid-
λDIB 8620 − 8626 Å
uals centered near the DIB wavelength (Zasowski et al.
2015; Schultheis et al. 2022). However, stellar line lists σDIB 0.6 − 3.2 Å
and models often have significant residuals relative to Number of Spectra with Detections
observations, which can cause confusion when fitting or Public (999,645) 7,789 9,763 50,787
finding DIB features in the residuals. Further, such fits
Total (5,591,594) 137,536 476,532
are susceptible to biases resulting from variability in the
continuum normalization and are often limited to the
high stellar SNR regime, despite the fact that dusty lines The primary Gaia analysis of these spectra falls under
of sight are more likely to have low stellar SNR due to the General Stellar Parameteriser-spectroscopy (GSP-
dust extinction. Spec) module, which is a purely spectroscopic analysis to
In this work, we introduce a new method for detecting obtain chemo-physical stellar parameters from combined
and modeling DIBs that jointly models the stellar and RVS spectra Recio-Blanco et al. (2022). Within GSP-
DIB components of the spectra, marginalizing over stel- Spec, DIB detections and parameter estimates were
lar types and uncertainties in stellar lines. This method made predominantly by dividing each RVS spectrum
is termed “Marginalized Analytic Data-space Gaussian by the best matching synthetic spectrum (using the
Inference for Component Separation” (MADGICS). We GSP-Spec atmospheric parameters) and performing a
reprocess all public Gaia DR3 RVS spectra with MADG- least-squares fit of a Gaussian function (Schultheis et al.
ICS and demonstrate reduced stellar line contamination 2022).
in the new MADGICS DIB catalog presented herein. Of the 5,591,594 RVS spectra processed by GSP-Spec,
We believe that the speed and stability of the method, only 999,645 are publicly available. Based on the full
even at low stellar SNR, establishes MADGICS as an im- 5,591,594 spectra, the Gaia DIB catalog has entries for
portant statistical tool in the era of large spectroscopic 476,532 spectra, of which 50,787 have public spectra
(DIB) surveys. We anticipate the applicability of this (Table 1). The Gaia DIB team strongly suggests re-
method also extends to atomic interstellar absorption stricting this catalog to a “high quality” (HQ) sample,
(Welty & Hobbs 2001) and emission (Kollmeier et al. which includes 137,536 spectra, of which 9,763 are public
2017) line surveys. (Schultheis et al. 2022). While the Gaia DIB catalog en-
We begin by comparing the 8621 Å MADGICS DIB tries are available for all stars—even those without pub-
catalog to the Gaia DIB catalog and presenting astro- lic spectra—we will restrict our comparisons with the
physical applications and validations of the catalog (Sec- Gaia DIB catalog to stars with public spectra, unless
tion 3). Then, we describe the method, catalog construc- otherwise specified. The Gaia DIB catalog, and other
tion, and further validation tests (Sections 4–5). Gaia-derived properties of the stars or spectral process-
ing were obtained from the DR3 “source” and “astro-
2. DATA physical parameters” tables.
The primary data used in this work are the Gaia DR3
RVS spectra, which were obtained from the Gaia archive 1 Throughout this work, we work exclusively in vacuum wave-
(Gaia Collaboration et al. 2016; Vallenari et al. 2022). length, except in referring to the catalog and DIB by the common
These are optical spectra spanning 8460−8700 Å, which 8621 Å nomenclature reflective of the historical identification of
are shifted into the stellar rest frame, averaged over all DIBs by wavelength in air.
MADGICS 8621 Å DIB Catalog 3
Figure 1. Density of detections in the MADGICS (this work, left) and Gaia (Schultheis et al. 2022, right) 8621 Å DIB catalog
as a function of the DIB width (σDIB ) and its central wavelength (λDIB ). The color scale indicates density, where darker colors
indicate more detections, and is a common logarithmic stretch for both catalogs. The (λDIB , σDIB ) cuts imposed in defining the
Gaia HQ sample are shown as a gray box, inside of which the HQ sample is shown. Outside of that box, the Gaia catalog is
shown with only the stellar SNR, DIB SNR, and GSP-Spec flag cuts imposed. Guidelines for the central wavelength of the DIB
and stellar lines are labeled and color coded between the plots. Stellar lines in magenta indicate lines not in the GSP-Spec line
list, and the Gaia DIB detections preferentially cluster at those wavelengths, especially at low σDIB . Because the MADGICS
catalog inference marginalizes over a data-driven stellar model, it is free from detectable contamination from these stellar lines
and provides a more robust characterization of the DIB properties.
The cuts defining the Gaia HQ sample are summa- The selection function of public RVS spectra remains
rized in Table 1. We divide these cuts into two classes: forthcoming (Seabroke et al. 2022, in prep.), but is
cuts on (1) SNR and (2) the overall model goodness nonuniform across the sky and generally concentrated
of fit for the spectrum. The entire Gaia 8621 Å DIB within 1-2 kpc of the Sun (see Appendix A). There is
catalog is limited to Stellar SNR > 70. A further also significant heterogeneity in the normalization of the
SNR cut on the DIB detection is imposed by requiring RVS spectra (see Appendix B) as a result of a discrete
EWDIB /σ(EWDIB ) > 2.86. The efficacy of cuts imposed choice in the Gaia pipeline to either pseudo-continuum
this way of course depends strongly on the quality of the normalize or rescale the spectra by a constant, as in the
σ(EWDIB ) estimate. case of cool or low SNR stars. (Vallenari et al. 2022).
Additional cuts (Schultheis et al. 2022) are imposed to 3. ASTROPHYSICAL CATALOG VALIDATION
remove low-quality detections, often as a result of poor
3.1. Stellar Frame
stellar modeling. The GSP-Spec flags 1-13, which in-
dicate the goodness of fit for the stellar modeling, are We first compare the MADGICS and Gaia HQ 8621 Å
required to be < 2 (reasonably good). Cuts on the DIB DIB catalog in terms of the 1st and 2nd moments of
peak fit parameters limit λDIB and σDIB to reasonable the detected peak—the center wavelength in the stellar
ranges, 8620 − 8626 Å and 0.6 − 3.2 Å, respectively. frame λDIB and its width σDIB (Figure 1). The advan-
The amplitude of the DIB peak, αDIB , is required to be tage of plotting the catalog in the stellar frame is that
larger than a residual threshold Ri , which has a con- it readily identifies the impact of stellar features on the
stant threshold of at least 0.15. The residual threshold DIB identification. In both panels, we provide guidelines
Ri depends on σDIB and is max(RA , RB , 0.15) for 0.6 < corresponding to:
σDIB < 1.2 and max(RB , 0.15) for 1.2 < σDIB < 3.2. • the putative DIB rest-frame wavelength λDIBrest
Here RA denotes the standard deviation of the “global” (black),
data-model residuals 8605-8640 Å and RB denotes the • stellar lines in the GSP-Spec line list (turquoise,
standard deviation of the “local” residuals “within the Contursi et al. 2021a,b),
DIB profile,” neither of which are publicly available • and stellar lines not in GSP-Spec, but coincident
(Schultheis et al. 2022). with features of stellar origin (see also Section 4.2)
(magenta).
4 Saydjari et al.
For the lines not in GSP-Spec, we do not claim or goodness of fit cuts that eliminate any influence of stel-
validate assignment of the observed stellar features as lar lines and (2) use methods that are maximally robust
resultant from these lines. We simply plot the wave- to stellar mismodeling so that real DIBs can be detected
length and identification of the nearest transition in the and modeled well in the largest number of spectra pos-
“Atomic Line List v3.00b4” (van Hoof 2018). We fo- sible.
cus on demonstrating the impact of stellar lines on DIB In the left panel, the MADGICS catalog morphology
catalogs rather than an identification of the stellar lines is dominated by a Gaussian centered at (λDIB , σDIB ) =
themselves. (8623.37 Å, 1.9 Å). It has a slightly heavy tail toward
In the right panel, the Gaia HQ catalog is shown larger σDIB , which can be understood in terms of a low-
within the central gray box, which denotes the DIB-SNR detection bias (see Section 5.3). The λDIB
(λDIB , σDIB ) cuts imposed in defining the Gaia HQ sam- dispersion is 1.3 Å. Assuming all of the λDIB dispersion
ple (see Table 1). Outside of that box, we show DIB is the result of the relative star-dust velocity dispersion,
detections from the broader Gaia DIB catalog with only this λDIB dispersion corresponds to a velocity disper-
the stellar SNR, DIB SNR, and GSP-Spec flag cuts im- sion of 46 km/s. This velocity dispersion is consistent
posed. Even inside the HQ catalog limits (gray box), the with a value of 49 km/s, which is obtained by adding
influence of stellar features is apparent. The largest den- in quadrature the dispersion of radial velocities for the
sity of detections peaks at the edge of the σDIB cut (0.6 stars in the sample measured by Gaia (35 km/s) and
Å) and is centered on the 8625.24 Å FeI] line.2 This peak the typical velocity dispersion of CO gas (∼ 35 km/s, as
in density extends to larger σDIB until ∼ 1.5 Å when a proxy for the dust velocity) (Dame et al. 2001). We
it broadens and shifts slowly to shorter wavelengths. explicitly note there is no variation in the density of de-
While not clearly associated with a specific stellar line tections or the σDIB as the λDIB approaches any of the
above σDIB = 1.5 Å, the correlation between σDIB and stellar features indicated.
λDIB suggests a non-negligible impact of the stellar lines
on the properties of the DIB detection. There is also a
second peak at low σDIB which appears to straddle the
8620.51 Å Fe i, 8620.79 Å Ti i, and 8621.62 Å Ca i] lines.
These correlations continue and become more promi-
nent outside the parameter cuts (gray box) included in
the definition of the HQ sample. An additional peak
in the detection density appears near the wavelength of
the 8617.75 Å Ca i line. The broad 8620-8622 peak in
density splits at lower σDIB into two modes centered on
either the 8620.51 Å Fe i and 8620.79 Å Ti i lines or the
8621.62 Å Ca i] line. Peaks in density near other GSP-
Spec stellar lines appear to a lesser extent.
To contextualize the impact of these wavelength pile-
ups on kinematic DIB studies, the right y-axis shows the
radial velocity corresponding to the wavelength shift rel-
ative to the DIB center wavelength. This is not indica-
tive of the true DIB radial velocity, which must account
for the stellar radial velocity (see Section 3.4.1).
This comparison with the less restrictive sample shows
that the Gaia goodness of fit cuts on (αDIB , λDIB , σDIB )
are not fully effective at removing the influence of stellar
lines on the DIB catalog, even within the HQ sample.
Figure 2. MADGICS DIB detections within the Local Bub-
This list of stellar features missing from GSP-Spec that
impact the Gaia 8623Å DIB catalog is not meant to ble, where color indicates the SNR of the detections. A 2D
be exhaustive, but merely illustrative that detections of slice at the midplane (Z=0 pc) of the boundary for a model
unmodeled stellar features dominate the catalog, with a of the Local Bubble (Pelgrims et al. 2020; Pelgrims 2022) is
small contribution from stellar features known to GSP- shown (dashed gray). In the MADGICS catalog, no confi-
Spec as well. A comparison of both the MADGICS and dent detections (DIB SNR > 5) are present within the Local
Gaia catalogs without any goodness of fit cuts is pro- Bubble model. This is in contrast to previous work, though
vided in Appendix C. In general, we aim to (1) impose we find evidence that previously reported detections were
associated with stellar features (Figure 1). A 3D interactive
version of this figure is available here.
2 The bracket is spectroscopic notation indicating that the transi-
tion is an intercombination line, meaning it is semi-forbidden.
MADGICS 8621 Å DIB Catalog 5
Figure 3. Projections of the 3D DIB EW map in (left) angular (`, b) and (right) Cartesian (X, Y ) Galactic coordinates. The
inverse-variance weighted mean of EWDIB is shown in color, averaging over dimensions not shown.
Thus, the MADGICS catalog delivers a distribution necessity of marginalizing over possible stellar models
of DIB detections, free of detectable contamination by when reporting the confidence of DIB detections. Using
stellar lines, with a catalog defined only by simple SNR- a single fixed stellar model per star can easily lead to
based detection thresholds and a single χ2 cut represent- stellar residuals that can confuse conventional DIB fit-
ing the overall goodness of fit. This constitutes a clear ting approaches because there is confidently a feature in
improvement over the current status quo in the field. the stellar residuals, but that feature is of stellar origin.
The MADGICS DIB SNR is instead based upon the ∆χ2
3.2. Local Bubble
resulting from adding a DIB component to the compo-
While DIBs are known to be correlated with dust ex- nent separation, using an empirical model to marginalize
tinction, their carriers are suspected to be similar to over stellar features (see Section 4).
the only clearly identified DIB carrier, the buckminster- Because the MADGICS stellar model is empirical (see
fullerene cation (C+ 60 ) (Edwards & Leach 1993; Camp- Section 4.2), one might worry that, if they exist, low EW
bell et al. 2015; Walker et al. 2015). Thus, DIBs should DIBs uncorrelated with dust are baked into the stellar
be detected when large molecules are present, not dust model. While possible, we show in Section 4.2 no fea-
grains themselves, and the correlation is likely due to tures resembling the DIB in the stellar component prior.
coupled formation mechanisms. A recent body of liter- Further, upon loosening the cut on training stars used
ature has claimed detections of DIBs within the Local in constructing the stellar prior (allowing dustier stars
Bubble, despite the low dust extinction therein (Bailey into the training sample), we find only slightly reduced
et al. 2016; Farhang et al. 2019; Schultheis et al. 2022). sensitivity to DIB detection (See Appendix D). Thus,
The Gaia HQ sample supports these claims by reporting using a more careful definition of DIB SNR, we find no
confident detections, albeit of low EW, within the Local evidence for DIB detections within the Local Bubble in
Bubble.3 public Gaia DR3 RVS spectra (Schultheis et al. 2022),
Within the MADGICS DIB catalog, there are no con- which calls for careful reconsideration of related previ-
fident DIB detections (DIB SNR > 5) within the Local ous claims. We look forward to the future release of
Bubble (Figure 2). All confident DIB detections fall out- more and higher stellar SNR spectra to help resolve this
side the bubble and cluster along lines of sight passing question.
through known local molecular clouds such as Taurus,
3.3. Moment-0
Perseus, and Ophiuchus (Zucker et al. 2022). In projec-
tion, three significant detections appear to fall inside the The moment-0 (EWDIB ) map of DIB detections (Fig-
bubble, but do not when examined in 3D. The 3D dis- ure 3) closely resembles a noisy version of 3D dust maps
tribution can be visualized with the interactive version obtained using other techniques (Green et al. 2019).
of Figure 2. This is an important demonstration of the Figure 3 also clearly features detections predominantly
along lines of sight with the highest extinction. This
makes sense given the known positive correlation be-
3 A large number of such detections are included in the public tween DIBs and dust and the low to moderate SNR
subset of Gaia RVS spectra. of most of the stellar spectra; confident DIB detections
6 Saydjari et al.
Figure 5. Projections of the 4D DIB radial velocity map in (left) angular (`, b) and (right) Cartesian (X, Y ) Galactic coordinates.
LSR
The inverse-variance weighted mean of vDIB is shown in color, averaging over dimensions not shown.
It is essential to measure the rest-frame (intrinsic) we only know the distance to the background star, we
wavelength of the DIB to enable the use of the DIB will use this to approximate d, the distance to the DIB
in studies of Galactic kinematics. Additionally, precise source. To help this approximation hold, we limit the
measurement of the rest-frame wavelength aids in iden- fit to background stars within 3 kpc.
tifying its carrier in comparison to theoretical chemical In Figure 6, we present a least-squares fit of λLSR as
models. Here we use the common method of measuring a function of `0 and d using Equation 4 and a constant
the wavelength intercept of the DIB detections toward term to measure λrest . We use inverse-variance weights
the Galactic anti-center (Zasowski et al. 2015; Munari that sum in quadrature the MADGICS σ(λDIB ) and the
et al. 2008; Zhao et al. 2021; Schultheis et al. 2022). background star radial velocity uncertainty from Gaia.
Assuming a purely tangential, azimuthally symmetric We restrict to |`0 | < 10° so that we can safely apply
Galactic rotation, DIB carriers rotating with the Galaxy the small angle approximation and |b| < 10° to help
would exhibit no shift in the λLSR relative to the rest- approximate the Galactic rotation as uniform in the z-
frame wavelength toward the Galactic anti-center. direction. The intercept gives λrest = 8623.14 ± 0.087 Å
The expected change in λLSR of the DIB as a result of (vacuum).
Galactic rotation can be found using Equation 3 when To obtain the uncertainty estimate on the least-
given the distance of the sun to the Galactic center R0 , squares model parameters, we divide the data into 0 <
the circular (tangential) velocity of Galactic rotation Vc d < 2 kpc and 2 < d < 3 kpc. We use the difference
at the location of the DIB source R,~ Galactic longitude in the fit parameters relative to the formal uncertain-
of the DIB source referenced to the Galactic anti-center ties to determine a scalar by which to inflate the formal
`0 = ` − 180, and distance from the sun to the DIB uncertainty so that the difference is only 1 σ. We per-
source projected on to the Galactic plane d. This can form this procedure to help account for uncertainty in
be linearized in the small angle approximation to yield R0 , resulting from the approximation of d, systemat-
Equation 4. ics associated with cuts in (`0 , b) that are imposed, and
the rather strong assumption of a purely tangential, az-
∆λLSR ~ sin `0
Vc (R) imuthally symmetric Galactic rotation.6 For example,
= ×
λrest c measurements from Tchernyshyov & Peek (2017) show
a net radial inflow past 1 kpc on the order of −3 km/s.
1 − q 1 We note that the 0.087 Å uncertainty reported here is
(3)
d2
sin2 `0 + R02
+ 2 Rd0 cos `0 + cos2 `0 comparable to that inflow velocity.
The value found in this work is bluer than that re-
∆λLSR ~
Vc (R) d ported in Schultheis et al. 2022 by 4.7 σ, using the un-
= `0 where `0 90° (4)
λrest c d + R0 certainties reported therein (8623.23±0.019 Å, vacuum),
Because Vc is believed to be locally flat (Bovy et al.
~ to be a constant. We fix
2012) we will take Vc (R) 6 ~Sun always remains.
The absolute uncertainty associated with V
R0 = 8.5 kpc consistent with Reid et al. (2019). While
8 Saydjari et al.
Figure 6. Scatter plot showing residuals after fitting λLSR versus `0 and d where the color shows the weighting of each point
in the fit. Only detections such that |`0 | < 10°, |b| < 10°, and d < 3 kpc are shown. Our derived rest-frame wavelength of
λrest = 8623.14 ± 0.087 Å is bluer than Schultheis et al. (2022), but our updated value is validated in Figure 8 by comparison
with CO gas velocities.
though only 1.03 σ using the uncertainties reported in To compare to other work, we can compute the ex-
this work. We further validate the λrest and its uncer- pected anti-center slope for a given d. Taking d = 2
tainty estimate reported here by comparison with 12 CO kpc, we expect a slope of 19 mÅ/deg. Assuming a simi-
velocities in Section 3.4.2. lar DIB carrier distance distribution, we can compare to
An alternative method for determining the rest-frame Zasowski et al. 2015 by multiplying by the ratio of the
wavelength is referencing to known atomic ISM transi- rest-frame wavelengths of the DIBs, = 0.56.7 This would
tions, such as Na i (Krelowski 1988; Galazutdinov et al. predict 32.1 ± 4.5 mÅ/deg. Schultheis et al. 2022 report
2000). While the wavelength of the Na i transition is 45 ± 20 mÅ/deg, which uses the same DIB, but a differ-
known with high precision and its use alleviates assump- ent sample of detections and a clearly different λDIB (see
tions about Galactic rotation, surveys measuring both Figure 1). However, the comparison is made more dif-
DIBs and transitions for atomic ISM species have his- ficult by the fact that other work often only correct for
torically had far fewer samples. Further, referencing to the USun velocity (Zasowski et al. 2015; Schultheis et al.
Na i requires assuming the Na i and DIB carriers are co- 2022). The correction resulting from the VSun velocity
moving, which can have large scatter (Farhang et al. is approximately linear with `0 near the Galactic anti-
2015). center and applying the correction reduces the resulting
Other applications of the Galactic anti-center method slope significantly.
to DIBs have conventionally performed a simple linear With these validations (and those in Section 3.4.2), we
LSR
fit, holding d to be a fixed constant, and reported the can combine vDIB with the Gaia location and parallax
intercept as λrest and the slope in mÅ/deg. However, it measurements to present a 4D map showing the radial
is very difficult to compare these slopes between DIBs velocity of DIB carriers as a function of position. Fig-
at different wavelengths or even the same DIB measured ure 5 shows a projection of this map in angular (`, b) and
in samples with a different distribution of DIB carrier Cartesian (X, Y ) Galactic coordinates, taking an inverse
distances d (see Equation 4). Instead, our fit parameter variance weighted mean over the dimensions not shown.
in Equation 4 is Vc = 204 ± 60 km/s. This can be easily A lower noise map could be achieved with stricter DIB
compared to Galactic rotation models. This value is SNR cuts, though this comes at the expense of the num-
< 1 σ from that of Bovy et al. 2012, which is 216 ± 6
km/s. However, given the approximations made above,
7 Zasowski et al. 2015 restricted their fit to background stars within
this should not be viewed as a measurement of Vc , but
2 kpc, which helps make this assumption more valid.
as a sanity check.
MADGICS 8621 Å DIB Catalog 9
LSR
Figure 7. Scatter plot of vDIB for DIB detections with DIB SNR > 5.5 overlaid on longitude-velocity diagram of CO emission
from Dame et al. 2001, integrated over b (colorbar top). Scatter points are colored by the distance to the background star
(colorbar right).
We compare DIB kinematics to the Dame et al. 2001
moment-masked, composite 12 CO survey.8 We use the
Full Galaxy cube (−180° < ` < 180°, |b| < 30°) which is
reported on a rectilinear Galactic Cartesian grid (1/8 °
LSR
spacing) and a velocity grid (|vCO | < 320 km/s) with
1.3 km/s spacing. The LSR frame assumes the “classic”
solar motion of 20 km/s toward (RA, Dec) = (18h,30°)
(epoch 1900), which is approximately V ~Sun = (10.3, 15.3,
7.7) km/s (private communication, Dame, 2022). Fig-
ure 7 shows the CO map, integrated along b.
LSR
In order to compare vDIB to the CO kinematics, we
LSR
convert vDIB into the LSR frame convention used by
Dame et al. 2001 and over plot the DIB detections with
LSR
DIB SNR > 5.5 in Figure 7. The average vDIB as a
function of ` follows the intensity-weighted average vCO
(Figure 8). Excitingly, we further observed a subpopu-
lation of the DIBs that coincide with higher-amplitude
oscillations in the CO map. Coloring the scatter plot by
the distance to the background star, we find this sub-
population of DIB detections occur along lines of sight
Figure 8. Histogram (2D) illustrating correlation between to more distant stars (compared to the average within
LSR
the vDIB and intensity-weighted velocity of nearest CO com- the sample). This is consistent with the interpretation
ponent in the Dame et al. 2001 CO map. The color scale of higher-amplitude oscillations in 12 CO as kinemati-
indicates density, where darker colors indicate more detec- cally coherent Galactic substructure at larger distances,
tions and a logarithmic stretch has been applied. sometimes attributed to spiral arms.
We quantitatively demonstrate the correlation be-
LSR
ber of detections and spatial coverage given the limited tween vDIB in Figure 8. Motivated by the multimodality
public Gaia RVS sample. The usual features (alternat- shown in Figure 7, before taking the intensity-weighted
ing sign quadrants) of a smooth Galactic rotation curve average of vCO , we find the peak in the CO spectrum
LSR
are observed. We defer searching for anomalies rela- nearest to vDIB for the (`, b)-pixel in the CO map cor-
tive to a smooth Galactic rotation model until having a responding to the line of sight for a DIB detection.
larger sample. Peak-finding has a tendency to slightly overstate the
correlation, but the clumpiness of the CO necessitates
3.4.2. CO Comparison such an approach. This peak-finding algorithm is dis-
12
CO is a believed to be a good tracer of DIBs and
dust because it has a well-known, easy-to-detect line 8 https://lweb.cfa.harvard.edu/rtdc/CO/CompositeSurveys/
and indicates the presence of carbonaceous molecules.
10 Saydjari et al.
Figure 11. Example MADGICS decomposition of Gaia RVS spectrum under a star + residual (red) and star + DIB × star +
residual model (black).
then built by multiplying the matrix of these observa- 4.2.2. Residual Prior
tions by its transpose, with the relative contributions In all measurements, there is contribution from noise
weighted by stellar SNR.10 However, since the contin- that will follow its own covariance structure, and thus
uum normalization is not stable (see Appendix A), we we also include a “residual” component to capture this
add 0.1 times the ones matrix to the covariance so that noise. Ideally, these measurement uncertainties are un-
the stellar component will absorb overall shifts in the correlated and the residual prior is a simple diagonal.
continuum normalization.11 In that case, unmodeled features in the data which do
The resulting stellar covariance matrix (the MADG- not conform well to the prior of any other component in
ICS prior) is shown in Figure 10. Despite the com- the model (such as a previously unidentified DIB) will
plexity of stellar spectra, this covariance is in general simply appear in the residuals.
well-described by a low-rank approximation, which also Gaia DR3 includes a per-pixel flux uncertainty vec-
significantly accelerates the computation. We keep the tor, which could be used on the diagonal of the residual
first 50 eigenvectors to describe the stellar covariance, covariance prior. However, when doing so, one finds
one of which is shown in Figure 10. We overplot the that the MADGICS residual component in fact has cor-
same guidelines as in Figure 1. We show that the three relations, with a kernel that appears to be the result
stellar lines (FeI], CaI], CaI) not in the GSP-Spec line of upstream processing interpolating the spectra onto
list (but identified in Figure 1 as having the largest im- a common grid.12 These correlations impact the un-
pact on biasing the Gaia DIB catalog) have significant certainty of EWDIB , but we want to retain a diagonal
peaks comparable to those for lines in the GSP-Spec line residual prior, so we inflate the residual prior by mul-
list. We explicitly check for and do not find significant tiplying by a constant (3.6) such that the sum of the
peaks centered at the DIB wavelength. correlated noise and uncorrelated noise covariances are
the same.
Figure 12. Histograms (2D) in the space of GSP-Spec stellar parameters for all (left) and training (middle) spectra. (Right)
The overall “goodness of fit”, measured by how close χ2 /dof is to 1, is shown. Each pixel is the median of log10 (χ2 /dof), which
is zero in the ideal case.
4.2.3. DIB Prior star + DIB + residual model (see Figure 11). We can
then update x̂star , using the component from the star +
The contribution from DIBs is fundamentally mul-
DIB + residual model, and iterate until convergence. In
tiplicative since it is an absorption of the background
practice, we only iterate to refine the xDIB once.
starlight. Since MADGICS is additive, it finds a lin-
While MADGICS can easily marginalize over (co)-
ear decomposition of the data into components, we first
variations in peak amplitudes, it struggles to marginal-
model the spectra (even one containing a DIB detec-
ize over shifts in observed wavelength (radial velocity)
tion) as being star + residual components (Figure 11).
or changes in peak width (σDIB ) which are not well de-
Then, we express a prior on the line shape of the DIB ab-
scribed in the data-space representation. In order to
sorption. In this case, we use a Gaussian for simplicity,
optimize over radial velocity and σDIB , we build a grid
but since we need only specify the DIB prior as a pixel-
of covariance priors over a range of (λDIB , σDIB ) and
pixel covariance matrix, we can accommodate arbitrary
evaluate the ∆χ2 at each point, seeking the minimum
line shapes and/or degrees of asymmetry. Sampling over
(most negative) ∆χ2 .
variations in the line shape, we construct a pixel-pixel
In practice, we find the search of the 2D (λDIB , σDIB )-
covariance matrix. Here we keep only a two-component,
space well-approximated using line searches with a single
low-rank approximation for the DIB covariance, which
iteration of refinement. The λDIB scan is ±100 pixels, in
we generate analytically using the functional form of a
0.1 pixel steps (0.01 Å) at highest resolution. The σDIB
Gaussian and its derivative with respect to wavelength.
scan is 0.4 to 4 Å, in 0.01 Å steps at highest resolution
We use a constant amplitude prior of 0.1 and reduce the
(see code in Section 6 for more). The resolution of the
amplitude of the derivative by a factor of 0.01.
λDIB and σDIB scans were chosen such that the step
However, this covariance is a prior on x0DIB while
size is ≤ 10% the typical uncertainty in each parameter
the linear contribution of the DIB to the spectrum is
estimate.
xDIB = x0DIB × xstar . Since we do not know xstar , we
The λDIB and σDIB uncertainties are estimated from
use x̂star from the star + residual decomposition and
the shape of the ∆χ2 surface around the minimum, us-
rescale the covariance for x0DIB to obtain the covariance
ing the Hessian estimated with finite differences. For
for xDIB . Since the contribution from the DIB is pre-
a given point in the (λDIB , σDIB )-grid, we have esti-
dominantly in the residual component in the star +
mates of EWDIB and the uncertainty in EWDIB given
residual model, the x̂star changes only slightly in the
14 Saydjari et al.
Figure 15. Histograms (2D) for DIB models as a function of EWDIB , λDIB , σDIB , and stellar SNR in the columns, left to right
respectively. The λDIB is reported relative to the reference center of the MADGICS parameter grid. Positive EWDIB indicates
absorption. The rows show this distribution for (top) the total χ2 per degree of freedom of the component fit, the negative
change in χ2 as a result of adding the DIB component before (middle) and after (bottom) imposing cuts on χ2tot /dof. Guidelines
indicate cuts imposed in each space. The color scale indicates density, where darker colors indicate more detections, and each
panel has its own logarithmic stretch.
should be increased to obtain higher-quality samples as mag (Schlegel et al. 1998), and with high-quality up-
desired. The DIB SNR provides a well-motivated con- stream processing (GSP-Spec flags 1-13 equal to zero).
tinuous scale on which to make such quality cuts. Then, we randomly draw a position, width, and am-
The catalog created after these cuts is shown in Fig- plitude (λDIB , σDIB , αDIB ) as well as an index of a
ure 14 as 2D histograms in the space of the moment- “parent” spectrum for the synthetic DIB. Each of these
0, moment-1, moment-2, and DIB SNR of the detec- parameters are uniformly sampled over a wide range:
tions. Clearly, the vast majority of the catalog has the λDIB ∈ [8618.47, 8628.47] Å, σDIB ∈ [0.6, 3.1] Å, αDIB ∈
expected (and likely true) positive absorption and mod- [0.25, 0.0] Å.
erate σDIB ∼ 1.9 Å. Further, Figure 14 illustrates that By uniformly sampling over “parent” spectra, the in-
spurious negative EWDIB detections are only a small jection tests approximately follow the distribution of
fraction of the catalog and are only present at low DIB stellar SNR in the overall sample (though the SFD and
SNR. This explicitly demonstrates how DIB SNR can GSP-Spec flag requirements modify the distribution).
be use as a well-motivated quality cut. Since DIB absorption is multiplicative, absorbing the
background star light, to obtain the additive DIB com-
5.2. Injection Tests ponent we must multiply the Gaussian profile with the
To quantify biases and validate reported uncertain- drawn parameters by an estimate of the stellar compo-
ties in both the component separation and catalog cre- nent for the “parent” source. Since we have a stellar
ation steps, we created a set of injection tests. To gen- component estimate from a star+residuals model (see
erate these tests we start from a “parent” set of sources Section 4), we use that stellar component for the “par-
along lines of sight with low-reddening, SFD < 0.05 ent” source. Then, the synthetic spectrum is the original
MADGICS 8621 Å DIB Catalog 17
Figure 16. Histograms illustrating the bias of DIB detections relative to reported uncertainties (Z-scores) for injection tests
where ground truth is known. Histograms are a function of injected EWDIB , injected λDIB , injected σDIB , stellar SNR and DIB
SNR in the rows, and show the bias in the recovered quantities EWDIB , λDIB , σDIB in the columns (left to right). The λDIB
is reported relative to the reference center of the MADGICS parameter grid. The color scale indicates density, where darker
colors indicate more detections, and each panel has its own logarithmic stretch. Dashed guidelines provide reference for 0 and
±1σ. Solid lines indicate the center (red) and spread (white) of the Z-score distribution.
18 Saydjari et al.
measurements, we have a physical prior expectation that source densities in future data releases, larger high-
most true σDIB values lie within 1.5 - 2.3 Å for which quality 4D kinematic maps of the ISM will enable de-
the σDIB bias is predominantly positive. tailed studies of ISM substructure and dynamics (Tch-
The form of the low SNR bias can be reproduced ernyshyov et al. 2018).
theoretically only by combining the effects of both the ACKNOWLEDGMENTS
fixed ∆χ2 detection cut and bias correction for using
parameter estimates in a Gaussian formalism for a like- A.K.S. gratefully acknowledges support by a Na-
lihood that has nonlinear dependence on those param- tional Science Foundation Graduate Research Fel-
eters (Zanolin et al. 2010; Portillo et al. 2017). This lowship (DGE-1745303). D.P.F. acknowledges sup-
is because we are approximating as Gaussian the pa- port by NSF grant AST-1614941, “Exploring the
rameter posteriors that are non-Gaussian due to that Galaxy: 3-Dimensional Structure and Stellar Streams.”
nonlinearity. The quantitative theoretical model will be D.P.F. acknowledges support by NASA ADAP grant
described in a future work. 80NSSC21K0634 “Knitting Together the Milky Way:
An Integrated Model of the Galaxy’s Stars, Gas, and
6. DATA/CODE AVAILABILITY Dust.” C.Z. acknowledges that support for this work was
The final catalog, code that implements MADGICS, provided by NASA through the NASA Hubble Fellow-
and several small summary files are provided via Zen- ship grant HST-HF2-51498.001 awarded by the Space
odo (2 GB). Jupyter notebooks that generate the priors, Telescope Science Institute (STScI), which is operated
injection tests, and the catalogs as well as reproduce all by the Association of Universities for Research in As-
figures are also provided. Individual files contained in tronomy, Inc., for NASA, under contract NAS5-26555.
the Zenodo, including the final catalog alone (3 MB), The support and resources from the Center for High
are available from the project website. The quality of Performance Computing at the University of Utah are
the final catalog can be further refined by imposing a cut gratefully acknowledged. A portion of this work was also
on the DIB SNR field. The full set of intermediate data enabled by the FASRC Cannon cluster supported by the
products, including the full star+residual components, FAS Division of Science Research Computing Group at
star+DIB+residual components, and injection tests are Harvard University.
also available from the project website (500 GB total). This work was supported by the National Sci-
7. CONCLUSION ence Foundation under Cooperative Agreement PHY-
In this work, we introduced a new technique based on 2019786 (The NSF AI Institute for Artificial Intelli-
Marginalized Analytic Data-space Gaussian Inference gence and Fundamental Interactions). The authors ac-
for Component Separation (MADGICS) to detect and knowledge Interstellar Institute’s program “With Two
measure the properties of DIBs in stellar spectra. Using Eyes” and the Paris-Saclay University’s Institut Pascal
MADGICS, we pushed detections to lower stellar SNR for hosting discussions that nourished the development
and marginalized over stellar features, resulting in a new of the ideas behind this work.
Gaia RVS 8621 Å DIB catalog free from detectable stel- We acknowledge helpful discussions with Mathias
lar contamination. In this catalog, we find no evidence Schultheis, He Zhao, Morgan Fouesneau, Jos de Brui-
for DIB detections within the Local Bubble. jne, Gordian Edenhofer, Torsten Enßlin, Gail Za-
We present and validate a 4D map of ISM kinematics sowski, Joel Brownstein, Tom Dame, Greg Green, Ed-
based on the 1st of the DIBs. As part of this work, die Schlafly, Jonathan Bird, Adam Wheeler, and Andy
we adjusted the measured rest-frame wavelength to Casey. A.K.S. acknowledges Sophia Sánchez-Maes for
8623.14 ± 0.087 Å, with a precision accounting for mod- helpful discussions and much support.
eling uncertainties. Further, we show unprecedented This work has made use of data from the Euro-
correlation of the DIBs with kinematic substructure in pean Space Agency (ESA) mission Gaia (https://www.
Galactic CO maps. An interpretable and physically re- cosmos.esa.int/gaia), processed by the Gaia Data Pro-
alistic trend in the 2nd moment of the DIBs was also pre- cessing and Analysis Consortium (DPAC, https://www.
sented. Rigorous validation of the catalog, its reported cosmos.esa.int/web/gaia/dpac/consortium). Funding
uncertainties, and the choices made in constructing it for the DPAC has been provided by national institu-
are provided via a series of synthetic injection tests. This tions, in particular the institutions participating in the
allows modeling of the low DIB SNR detection biases in Gaia Multilateral Agreement. This research has made
both EWDIB and σDIB . use of the VizieR catalog access tool, CDS, Strasbourg,
In future work, MADGICS could be extended to more France (Ochsenbein et al. 2000).
complicated spectral component separation tasks, in- Facility: Gaia
cluding identifying multiple stellar components (and the Software: Julia (Bezanson et al. 2017), FITSIO.jl
associated radial velocities), removing sky emission lines (Pence et al. 2010), HDF5.jl (The HDF Group 1997-
(from ground-based observations), and modeling DIB 2022), Healpix.jl (Tomasi & Li 2021), BenchmarkTools.jl
detections with multiple cloud components (velocities) (Chen & Revels 2016), astropy (Astropy Collaboration
along the line of sight. Looking forward to increased et al. 2013), ipython (Perez & Granger 2007), mat-
plotlib (Hunter 2007), scipy (Virtanen et al. 2020).
20 Saydjari et al.
Figure 18. Density of public RVS spectra on the sky (left, HEALPix grid at NSide = 64) and projected onto the Galactic
plane in Cartesian coordinates (right).
APPENDIX
A. PUBLIC RVS SELECTION FUNCTION tions as a function of both DIB SNR and stellar SNR.
We illustrate the distribution of public RVS spectra Clearly, many high-quality detections (large DIB SNR)
in Figure 18. The highly heterogeneous distribution on were found below the stellar SNR cutoff imposed by the
the sky (shown in Galactic coordinates) shows clear pat- Gaia DIB catalog. Because a large fraction of the spec-
terns associated with the footprints of other surveys, tra have low SNR, it is important to leverage their sta-
such as Kepler (Borucki et al. 2010), Galactic Archae- tistical power. The peak at stellar SNR ∼ 20 is simply
ology with Hermes (GALAH, Buder et al. 2021), the a reflection of the unusually large number of spectra at
Large sky Area Multi-Object Fiber Spectroscopic Tele- that stellar SNR in the public Gaia DR3 RVS spectra.
scope (LAMOST, Luo et al. 2015), and the Apache Point We have confirmed that the fraction of DIB detections
Observatory Galactic Evolution Experiment (APOGEE, passing our DIB SNR cut actually decreases as a func-
Majewski et al. 2017). We understand this to be because tion of stellar SNR. This lends confidence that MADG-
the majority of the public RVS spectra were those used ICS is correctly finding the large DIB features which
for validation by comparing to other spectroscopic mea- have high DIB SNR even on low stellar SNR spectra,
surements of the same source (Vallenari et al. 2022). without overfitting noise as DIB detections.
Heterogeneity is also seen with respect to the stellar C. NO GOODNESS OF FIT CUTS
SNR, as shown in Figure 19. Plotting as a function of
stellar SNR also highlights the significant heterogeneity As stated in the main text, we aim to (1) impose good-
in the normalization of the RVS spectra as a result of ness of fit cuts that eliminate any influence of stellar lines
a discrete choice in the Gaia pipeline to either pseudo- and (2) use methods that are maximally robust to stellar
continuum normalize or rescale the spectra by a constant mismodeling so that real DIBs can be detected and well-
(in the case of cool stars or low stellar SNR) (Vallenari modeled in the largest number of spectra possible. To
et al. 2022). help compare the catalogs with respect to (2), we repli-
cate Figure 1 without any goodness of fit cuts imposed.
Without goodness of fit cuts, MADGICS would return
B. DIB SNR DISTRIBUTION detections in 19,071 spectra compared to the 50,787 de-
It is useful to describe the MADGICS DIB catalog tections in the Gaia Full catalog on public spectra.
detections as a function of SNR. In the top of Figure Clearly, there are detections associated with several of
20, we show the cumulative detections as a function of the stellar lines identified in Figure 1 within the MADG-
the reported DIB SNR. This provides an idea of the ICS detections when goodness of fit cuts are not im-
number of detections remaining in the catalog for some posed. This is to be expected. When the star is severely
choice of quality cut using the DIB SNR. In the bottom mismodeled, dividing by the stellar model and finding
of Figure 20, we show the 2D histogram of the detec- the broad multiplicative absorption of the DIB is nearly
MADGICS 8621 Å DIB Catalog 21
Figure 19. (Top) Histogram of public Gaia RVS sources Figure 20. (Top) Cumulative counts of MADGICS DIB de-
with respect to stellar SNR. (Bottom) 2D histogram of pub- tections as a function of DIB SNR. (Bottom) 2D histogram
lic Gaia RVS sources with respect to stellar SNR and the of MADGICS DIB detections as a function of DIB and stellar
mean normalized flux in the RVS spectrum. The color scale SNR. The color scale indicates density, where darker colors
indicates density, where darker colors indicate more sources, indicate more sources, and is a logarithmic stretch. A guide-
and is a logarithmic stretch. A guideline is shown at stel- line is shown at stellar SNR 70, which is the cutoff imposed
lar SNR 70, which is the cutoff imposed by the Gaia DIB by the Gaia DIB catalog.
catalog.
contributions from low EWDIB lines of sight that are
impossible. The question is then “How robust is the DIB present in the training data despite the cut on SFD <
identification to the stellar mismodeling?” The lower 0.05. However, we can easily show the impact of the
density of detections associated with stellar features in inverse, where we loosen the cut to SFD < 0.1 during
the MADGICS catalog panel of Figure 21 suggests that the construction of the stellar prior, thereby including
the MADGICS approach is also more robust (2), in ad- much dustier lines of sight that are likely to have more
dition to having the high-quality goodness of fit cut (1) contributions from low EWDIB DIBs. We then use this
demonstrated in Section 3.1. “dusty” stellar prior to detect DIBs on the synthetic
injection tests described in the main text, and show the
D. DUSTY STELLAR PRIOR
resulting change in the DIB catalog in Figure 22.
It is difficult to prove that a data-driven stellar model
(the stellar covariance herein) does not have any small
22 Saydjari et al.
Figure 21. Density of detections in the MADGICS (left) and Gaia (right) 8623Å DIB catalog as a function of λDIB and σDIB ,
without any cut imposed on the goodness of fit of the model to the overall spectrum. The Gaia catalog shown is the full DIB
sample, including DIBs identified on spectra that are not public. The color scale indicates density, where darker colors indicate
more detections, and is a relative logarithmic stretch for both catalogs, with the maximum rescaled by relative catalog sizes.
Guidelines for the central wavelength of the DIB and stellar lines are labeled and color coded between the plots.
Figure 22 shows the median change in EWDIB as a
function of the true EWDIB and true DIB SNR of the
injected DIBs. We take this difference between DIB de-
tections appearing in both the original and “dusty stellar
prior” DIB catalogs built from the injection tests. Re-
gions of parameter space where > 25% of detections in
the original catalog were not refound are colored green.
Only a very small number (557/1,000,000) of injections
were detected in the “dusty stellar prior” catalog, but
not the original and are excluded from this plot. Re-
gions of parameter space where no DIBs were detected
in either catalog are shown in light gray, and regions not
probed by the injection tests are shown in dark gray.
Over the majority of the detections, there is a < 0.005
Å reduction in the reported EWDIB . Just near the de-
tection threshold, a < 0.005 Å increase is observed. This
can be interpreted as a shift in the detection threshold
carrying with it the usual positive detection bias (see
Section 5.3). For moderate EWDIB (> 0.15 Å), there is
very little change in the detectability of the synthetic in-
jections, even at low DIB SNR. At low EWDIB (< 0.15
Figure 22. Median change in EWDIB as a function of Å), there is a clear increase in the true DIB SNR re-
EWDIB and DIB SNR after including dustier lines of sight in quired in order for an injected DIB to be detected. This
the training data for the stellar covariance. Green indicates makes sense as some small contribution of the DIB peak
> 25% of detections not found, gray indicates detections not is present in the stellar model, though averaged over the
found with the clean stellar covariance, and dark gray in- relative velocity distribution between the star and dust.
dicates no injection tests present. The guideline (magneta, However, almost all of the DIBs that are lost have a true
dashed) indicates the detection cut on estimated DIB SNR DIB SNR below the DIB SNR cut imposed in making
impose in making the MADGICS catalog. the catalog, and thus were only detected previously be-
cause they fluctuated high. Further, it is important to
MADGICS 8621 Å DIB Catalog 23
note the persistence of low EWDIB detections at higher we find a similar correlation when restricting to low ve-
DIB SNR. locity CO without any peak-finding, only making the
Resolving the question of the presence of DIBs in the two broad cuts outlined above (Figure 23). Comput-
Local Bubble is important. In this work, we show that ing vCO as the intensity-weighted velocity over all ve-
approaches with synthetic models can incorrectly iden- locity channels, we restrict to |vCO | < 15 km/s to re-
tify stellar features as DIBs. However, it is difficult to duce the multimodality of the CO and DIB distribu-
prove that a data-driven stellar model is completely free tions. Then, we find a slope of 0.83 ± 0.04 and an in-
of contributions from low EWDIB DIBs. Yet, Figure 22 tercept of −0.51 ± 0.3 km/s. Both of these values are
makes it clear that data-driven stellar models can solve consistent with those discussed in the text, though the
this question if we push to higher stellar SNR spectra, slope is slightly flatter and there is larger scatter due to
since we can then push confident DIB detections to lower the additional averaging in vCO . However, the scatter
LSR
EWDIB limits. of residuals/σ(vDIB ), or Z-scores, was 0.96, indicating
that the reported uncertainties well-describe the scatter
E. DUST-CO CORRELATION in the residuals. Thus, we conclude the peak-finding to
determine vCO does not significantly change the conclu-
To account for the clumpiness of CO, we find the near- sions and it helps illustrate that strong correlations are
est (in velocity) peak in CO to the DIB peak before cal- present for high velocity DIBs and CO after accounting
culating the intensity-weighted mean CO velocity vCO for the multimodality.
that corresponds to that DIB. First we restrict to only
DIBs within the footprint of the Dame et al. 2001 CO F. PURITY AND COMPLETENESS
survey (3602). Further, we restrict to DIBs such that
there is at least some detection of CO within 1σ of the Combining injection tests with a choice for what a
DIB velocity (2272). Then, we find the nearest con- “good” detection is allows an estimate of the complete-
nected set of velocity channels with CO detections above ness of the catalog (Figure 24). Here, we take an arbi-
threshold, where the threshold starts at zero, and the trary choice for good detections to be |∆αDIB |/αDIB <
threshold increases in steps of 0.05 until the connected 0.2, |∆λDIB | < 2.5 Å, |∆σDIB |/σDIB < 0.2, where the
region (island) is less than 9 velocity pixels wide. After ∆ indicates differences with respect to the ground truth
this stopping condition is reached, we take the intensity values for the injected DIB profile. We show this com-
weighted average over the island to obtain vCO . pleteness as a function of both EWDIB and σDIB to illus-
While such a peak-finding algorithm will necessarily trate that it is more difficult to detect wider DIBs given
overestimate the correlation between the CO and DIBs, the same EWDIB . For EWDIB ≥ 0.2 Å, the catalog is
REFERENCES
Allende Prieto, C., Koesterke, L., Ludwig, H. G., Freytag, Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506,
B., & Caffau, E. 2013, A&A, 550, A103, 150, doi: 10.1093/mnras/stab1242
doi: 10.1051/0004-6361/201220064 Campbell, E. K., Holz, M., Gerlich, D., & Maier, J. P.
Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., 2015, Nature, 523, 322, doi: 10.1038/nature14566
et al. 2013, A&A, 558, A33, Chen, J., & Revels, J. 2016, arXiv e-prints.
doi: 10.1051/0004-6361/201322068 https://arxiv.org/abs/1608.04295
Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Contursi, G., de Laverny, P., Recio-Blanco, A., & Palicio,
Demleitner, M., & Andrae, R. 2021, AJ, 161, 147, P. A. 2021a, A&A, 654, A130,
doi: 10.3847/1538-3881/abd806 doi: 10.1051/0004-6361/202140912
—. 2021b, GSP-spec line list, Version 21-Sep-2022 (last
Bailey, M., van Loon, J. T., Farhang, A., et al. 2016, A&A,
modified), Centre de Donnees astronomique de
585, A12, doi: 10.1051/0004-6361/201526656
Strasbourg (CDS),
Bezanson, J., Edelman, A., Karpinski, S., & Shah, V. B.
doi: https://doi.org/10.26093/cds/vizier.36540130
2017, SIAM review, 59, 65.
Dame, T. M., Hartmann, D., & Thaddeus, P. 2001, ApJ,
https://doi.org/10.1137/141000671
547, 792, doi: 10.1086/318388
Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science,
Edwards, S. A., & Leach, S. 1993, A&A, 272, 533
327, 977, doi: 10.1126/science.1185402 Edwards, S. A., & Leach, S. 1994, in American Institute of
Bovy, J., Allende Prieto, C., Beers, T. C., et al. 2012, ApJ, Physics Conference Series, Vol. 312, Molecules and
759, 131, doi: 10.1088/0004-637X/759/2/131 Grains in Space, ed. I. Nenner, 589, doi: 10.1063/1.46582
MADGICS 8621 Å DIB Catalog 25
Farhang, A., van Loon, J. T., Khosroshahi, H. G., Javadi, Recio-Blanco, A., de Laverny, P., Palicio, P. A., et al. 2022,
A., & Bailey, M. 2019, Nature Astronomy, 3, 922, arXiv e-prints, arXiv:2206.05541.
doi: 10.1038/s41550-019-0814-z https://arxiv.org/abs/2206.05541
Farhang, A., Khosroshahi, H. G., Javadi, A., et al. 2015, Reid, M. J., Menten, K. M., Brunthaler, A., et al. 2019,
ApJ, 800, 64, doi: 10.1088/0004-637X/800/1/64 ApJ, 885, 131, doi: 10.3847/1538-4357/ab4a11
Gaia Collaboration, Prusti, T., de Bruijne, J. H. J., et al. Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ,
2016, A&A, 595, A1, doi: 10.1051/0004-6361/201629272 500, 525, doi: 10.1086/305772
Galazutdinov, G. A., Musaev, F. A., Krelowski, J., & Schultheis, M., Zhao, H., Zwitter, T., et al. 2022, arXiv
Walker, G. A. H. 2000, PASP, 112, 648, e-prints, arXiv:2206.05536.
doi: 10.1086/316570 https://arxiv.org/abs/2206.05536
Green, G. M., Schlafly, E., Zucker, C., Speagle, J. S., & Seabroke, G., Sartoretti, P., & et al. 2022, in prep.
Finkbeiner, D. 2019, ApJ, 887, 93, Snow, T. P., Zukowski, D., & Massey, P. 2002, ApJ, 578,
doi: 10.3847/1538-4357/ab5362 877, doi: 10.1086/342660
Hacar, A., Alves, J., Burkert, A., & Goldsmith, P. 2016, Steinmetz, M., Zwitter, T., Siebert, A., et al. 2006, AJ, 132,
A&A, 591, A104, doi: 10.1051/0004-6361/201527319 1645, doi: 10.1086/506564
Hunter, J. D. 2007, Computing in Science and Engineering, Tchernyshyov, K., & Peek, J. E. G. 2017, AJ, 153, 8,
9, 90, doi: 10.1109/MCSE.2007.55 doi: 10.3847/1538-3881/153/1/8
Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, Tchernyshyov, K., Peek, J. E. G., & Zasowski, G. 2018, AJ,
arXiv e-prints, arXiv:1711.03234. 156, 248, doi: 10.3847/1538-3881/aae68d
https://arxiv.org/abs/1711.03234 The HDF Group. 1997-2022, Hierarchical Data Format,
Kos, J., & Zwitter, T. 2013, ApJ, 774, 72, version 5
doi: 10.1088/0004-637X/774/1/72 Tomasi, M., & Li, Z. 2021, Healpix.jl: Julia-only port of the
Krelowski, J. 1988, PASP, 100, 896, doi: 10.1086/132253 HEALPix library, 3.0. http://ascl.net/2109.028
Lai, T. S. Y., Witt, A. N., Alvarez, C., & Cami, J. 2020, Vallenari, A., Brown, A. G. A., Prusti, T., et al. 2022,
MNRAS, 492, 5853, doi: 10.1093/mnras/staa223 arXiv e-prints, arXiv:2208.00211.
Lan, T.-W., Ménard, B., & Zhu, G. 2015, MNRAS, 452, https://arxiv.org/abs/2208.00211
3629, doi: 10.1093/mnras/stv1519 van Hoof, P. A. M. 2018, Galaxies, 6, 63,
Luo, A. L., Zhao, Y.-H., Zhao, G., et al. 2015, Research in doi: 10.3390/galaxies6020063
Astronomy and Astrophysics, 15, 1095, van Loon, J. T., Bailey, M., Tatton, B. L., et al. 2013,
doi: 10.1088/1674-4527/15/8/002 A&A, 550, A108, doi: 10.1051/0004-6361/201220210
Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020,
2017, AJ, 154, 94, doi: 10.3847/1538-3881/aa784d Nature Methods, 17, 261, doi: 10.1038/s41592-019-0686-2
Munari, U., Tomasella, L., Fiorucci, M., et al. 2008, A&A, Walker, G. A. H., Bohlender, D. A., Maier, J. P., &
488, 969, doi: 10.1051/0004-6361:200810232 Campbell, E. K. 2015, ApJL, 812, L8,
Ochsenbein, F., Bauer, P., & Marcout, J. 2000, A&AS, 143, doi: 10.1088/2041-8205/812/1/L8
23, doi: 10.1051/aas:2000169 Welty, D. E., & Hobbs, L. M. 2001, ApJS, 133, 345,
doi: 10.1086/320354
Pelgrims, V. 2022, The shape of the shell of the Local
Woodbury, M. A. 1950, in Memorandum Rept. 42,
Bubble, V1, Harvard Dataverse,
Statistical Research Group (Princeton Univ.), 4
doi: 10.7910/DVN/RHPVNC
York, D. G., Adelman, J., Anderson, John E., J., et al.
Pelgrims, V., Ferrière, K., Boulanger, F., Lallement, R., &
2000, AJ, 120, 1579, doi: 10.1086/301513
Montier, L. 2020, A&A, 636, A17,
Zanolin, M., Vitale, S., & Makris, N. 2010, PhRvD, 81,
doi: 10.1051/0004-6361/201937157
124048, doi: 10.1103/PhysRevD.81.124048
Pence, W. D., Chiappetti, L., Page, C. G., Shaw, R. A., &
Zasowski, G., Ménard, B., Bizyaev, D., et al. 2015, ApJ,
Stobie, E. 2010, A&A, 524, A42,
798, 35, doi: 10.1088/0004-637X/798/1/35
doi: 10.1051/0004-6361/201015362
Zhao, H., Schultheis, M., Rojas-Arriagada, A., et al. 2021,
Perez, F., & Granger, B. E. 2007, Computing in Science
A&A, 654, A116, doi: 10.1051/0004-6361/202141128
and Engineering, 9, 21, doi: 10.1109/MCSE.2007.53
Zucker, C., Goodman, A. A., Alves, J., et al. 2022, Nature,
Portillo, S. K. N., Lee, B. C. G., Daylan, T., & Finkbeiner,
601, 334, doi: 10.1038/s41586-021-04286-5
D. P. 2017, AJ, 154, 132, doi: 10.3847/1538-3881/aa8565