Professional Documents
Culture Documents
BORN DIGITAL
BORN DIGITAL
October 2013
Copyright © 2013 by Council on Library and Information Resources. This work is made available under the terms
of the Creative Commons Attribution-ShareAlike 3.0 license, http://creativecommons.org/licenses/by-sa/3.0/.
Contents
About the Authors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
A Word on Terminology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
6. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Figures
Figure 1: Components of the transfer process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Figure 2: Digital media with physical damage . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
iv
promoted the use of digital forensics with an article in the journal Nature:
“The future of saving our past.” In 2012, as project lead for the Personal
Digital Manuscripts Project at the British Library, he published a technology
watch paper for the Digital Preservation Coalition entitled Digital forensics
and preservation. He is presently serving on the Committee of the Section for
Archives and Technology of the Archives and Records Association of UK
& Ireland. Memberships have included the Library Committee of the Royal
Society and the Advisory Committee of the National Cataloguing Unit for the
Archives of Contemporary Scientists.
Nancy Kuhl is curator of poetry of the Yale Collection of American
Literature at the Beinecke Rare Book and Manuscript Library at Yale
University (http://beinecke.library.yale.edu/blog/YCAL). She is the author
of exhibition catalogs and collections of poetry, and is co-editor of Phylum
Press, a small poetry publisher (http://www.phylumpress.com).
Seth Shaw is assistant professor of archival studies at Clayton State
University. Previously, he served as the electronic records archivist for Duke
University Archives and was responsible for everything born-digital in both
the University Archives and special collections. He received his B.S. in infor-
mation systems from Brigham Young University–Idaho in 2005 and his M.S.
in information, archives and records management from the University of
Michigan’s School of Information in 2007.
Susan Thomas is digital archivist for the Bodleian Library’s Western
Manuscripts department, where she has responsibility for developing and
managing the Library’s capacity to curate, preserve, and provide access to
born-digital archives. Susan has been involved in digital archives work—pro-
jects, day-to-day internal and collaborative work—since 2005. Until recently,
she co-tutored the Digital Archiving module on the Masters programs of
the Centre for Archive and Information Studies at the University of Dundee.
She currently serves as an external examiner to the Archives and Records
Management program at University College London.
Acknowledgments
We would like to thank the directors of our respective institutions for their
guidance and support. Thank you also to Erin O’Meara and Kari R. Smith for
serving as our external reviewers, Kathleen Fitzpatrick at MediaCommons
Press, Kathlin Smith and Brian Leney at the Council on Library and
Information Resources, and everyone who contributed comments and feed-
back on this report.
vi
Preface
Until recently, digital media and files have been included in archival acquisi-
tions largely as an afterthought. Rare book and manuscript dealers may be
uncertain about how to handle born-digital materials or how to assign mon-
etary and cultural value to them in relation to analog materials. Repository
staff may not be prepared to manage incoming digital acquisitions, much
less serve as advocates for the physical and digital well-being of born-digital
materials throughout the acquisition process. And donors, nervous about the
contents of disks and e-mail messages from long ago, may be torn between
wanting to preserve and promote their lives and work on the one hand, while
also maintaining privacy for themselves, their families, and their friends on
the other hand. The idea for a report offering guidelines for born-digital ac-
quisitions arose from direct experience with these and other challenges.
In August 2011, a group of 10 archivists and curators—then at the Bei-
necke Rare Book and Manuscript Library, Yale University; the Bodleian
Library, University of Oxford; the British Library; the David M. Rubenstein
Rare Book & Manuscript Library, Duke University; the Manuscript, Archives,
and Rare Book Library (MARBL), Emory University; and the Harry Ransom
Center, The University of Texas at Austin—began working together to devel-
op recommendations for the acquisition and transfer of born-digital materials
to archival repositories. Our objective was to offer guidance to help ensure
that digital materials are handled and documented appropriately and arrive
at repositories in good condition. The target audience we had in mind in-
cluded rare book and manuscript dealers, donors, repository staff, and other
custodians in possession of digital media and files.
Four of the six institutions were each represented by a digital archivist
who worked directly with digital media and an acquisitions specialist who
was directly involved with collection development and acquisitions. The re-
maining two institutions, the Bodleian Library and the British Library, were
each represented by one person whose position encompassed both types
of expertise. We hoped that including people with both kinds of experience
would expand the perspective of the group and, in particular, encourage ar-
chivists and curators to talk with each other about born-digital acquisitions.
Because the project had no budget and included participants from the
United Kingdom and from different regions in the United States, we used
tools such as Vidyo (videoconferencing), Google Docs, and Dropbox to col-
laborate remotely. We also took advantage of separately funded travel to
conferences and other events to meet in person. We shared existing documen-
tation from our own institutions as we began framing guidelines for born-
digital acquisitions. Each cohort also contributed a short case study illustrat-
ing the current practices and challenges related to born-digital acquisitions
vii
A Word on Terminology
The authors of Born Digital: Guidance for Donors, Dealers, and Archival
Repositories fall into two general groups: archivists and special collections cu-
rators. We have tried to use terminology that will be clear to all segments of
our intended audience, but there remains an unavoidable professional bias.
Certain terms used throughout the report will mean different things to archi-
vists, dealers, curators, and donors.
In some instances, we have chosen to sacrifice accuracy for consistency.
For example, throughout the report we use the term donor to refer to the
person, family, organization, estate, or other entity that sells, gives, deposits,
loans, or otherwise transfers born-digital materials to an archival repository.
In reality, sometimes this person or entity will also be the creator of the digi-
tal records or will be selling the materials rather than donating them. As you
read the following pages, we invite you to substitute whatever language is
necessary to make the report’s recommendations relevant within your par-
ticular context.
The following are definitions for some of the terms used most frequently
in Born Digital. These and other definitions have been incorporated into the
main body of the report when necessary.
1. Introduction
Born Digital: Guidance for Donors, Dealers, and Archival Repositories of-
fers recommendations to help ensure the physical and intellectual
well-being of born-digital materials transferred from donors to ar-
chival repositories. The main body of the report surveys the primary
issues and concerns related to born-digital acquisitions and is intend-
ed for a broad audience with varying levels of interest and expertise,
including donors, dealers, and repository staff.
Each of the following sections of the report provides an over-
view of a key area of concern:
• Initial Collection Review outlines the considerations and ap-
proaches that inform interactions among repository staff, donors,
and dealers prior to acquisition.
• Privacy and Intellectual Property addresses ethical and practical
concerns related to intellectual property rights as well as those as-
sociated with private and sensitive information.
• Key Stages in Acquiring Digital Materials addresses acquisition
agreements and contracts, the transfer process, and initial han-
dling once the digital materials arrive at a repository.
• Post-Acquisition Review by the Repository focuses on staff as-
sessment of the condition and contents of digital media and files
after their arrival at a repository, as well as issues related to reten-
tion, disposal, and neglect.
2.1.3 Communication
Surveys and other pre-acquisition strategies can be supplemented by
conversations with donors about their past and present computing
environments. Any information that repository staff can learn about
a donor’s computing habits, including the environmental conditions
under which computer media have been stored in the past, will help
them contextualize and preserve the digital materials.
Donors and dealers should ask about a repository’s readiness
to receive and store digital media and files; strategies for long-term
stewardship of digital material; and policies regarding privacy, cap-
ture and storage methods, and security. If a donor relies on a tech-
nical specialist to manage digital files, it may be helpful to involve
that person or team in the collection review process. Guidelines for
describing and handling digital items prior to offering the materials
to a repository can help all parties anticipate problems and formulate
solutions well in advance of the actual transfer of materials. Com-
munication between repository staff and donors about the technical
nature and extent of the digital materials in a prospective acquisi-
tion, as well as about issues of privacy and confidentiality, access
and restrictions, and expectations regarding capture and transfer,
can forestall potential processing difficulties and ultimately enable
repositories to serve as better stewards of born-digital content.
collection of digital files and, if so, whether the disk image may need
to be amended to address privacy concerns. Further, donors may
wish to transfer disk images to a repository with the understand-
ing that although individual files may be extracted from them and
made public, the disk images themselves will serve as a master copy
that remains untouched and is restricted to staff use for preservation
purposes.
In some cases, especially if digital files come to a repository
through an estate, materials may be hidden behind an unknown
login or password. Content may also reside behind subscriber-only
paywalls or in a private intranet (especially for organizational re-
cords). Any acquisition agreement should designate whether a donor
allows repository staff to decrypt passwords and logins, or to recover
deleted files, and whether the donor or dealer grants ownership of
files recovered by these methods. Decryption and file recovery may
be the only way to gain access to some digital content. All parties
should be aware that repositories may discover materials not intend-
ed to be included in a collection (as indeed has long happened with
paper archives), such as files created by a user other than the donor
or items that fall outside the scope of the acquisition.
which repository staff members have handled the media as the ac-
quisition moves through different departments and what actions
they have taken.
• Consider what will be done with files that are transferred but do
not fit the scope of the acquisition agreement.
• Document the details of the acquisition with a written agreement
or contract.
• Determine how digital materials will be transferred to the
repository.
• If possible, establish direct communication with the donor or
estate.
• Establish protocols for how digital materials should be handled
and documented from their arrival at the repository until they
reach the digital archivist (or other appropriate person).
• Be prepared to research, develop, and test new capture methods
for novel acquisition scenarios.
6. Conclusions
The stewardship of born-digital archival collections promises noth-
ing if not routine encounters with the unexpected. Unfamiliar or
unannounced file formats, hardware, and additions to collections
seem to be reliable constants as repositories increasingly collect born-
digital archival content.
This report describes good practices that can help reduce archi-
val surprises. Conducting thorough and clear surveys, interviews,
and other types of assessment prior to acquisition can help reduce
the occurrence of unexpected large additions to collections, unfa-
miliar media and hardware, and unanticipated expansions to the
scope of an acquisition. Documented and well-formed acquisition
policy and practices may alleviate ambiguity about the details of
transferring born-digital materials, such as timing, packing and ship-
ping standards, frequency of accruals, and “rogue data” transferred
unintentionally or not fully addressed in the acquisition agreement
or contract. Furthermore, to enhance the quality of digital transfers
and accessions, dealers and repositories should work to improve
understanding and documentation of donor computing habits. Early
archival intervention in records and information management will
help shape the impact on archives of user and donor idiosyncrasies
around file management and data backup.
Born Digital: Guidance for Donors, Dealers, and Archival Repositories 19
APPENDIX A
Potential Staffing Activities for the Repository
Activities undertaken by staff represent the largest cost element to a repository acquiring born-digital
archives. There can be many steps from initial contact to first capture, and the intensity of activity in each
may vary from collection to collection.
APPENDIX B
List of Resources and Related Projects
• Erway, Ricky. 2012. You’ve Got to Walk Before You Can Run: First
Steps for Managing Born-Digital Content Received on Physical Media.
Dublin, Ohio: OCLC Research.
• Erway, Ricky. 2012. Swatting the Long Tail of Digital Media: A Call
for Collaboration. Dublin, Ohio: OCLC Research.
• Barrera-Gomez, Julianna, and Ricky Erway. 2013. Walk this Way:
Detailed Steps for Transferring Born-Digital Content from Media You
Can Read In-house. Dublin, Ohio: OCLC Research.
Walker, Rob. 2011. Cyberspace When You’re Dead. New York Times
Magazine. Available at
http://www.nytimes.com/2011/01/09/magazine/09Immortality-t.
html?pagewanted=all&_r=0.
This article presents several examples of how families and
friends have handled an individual’s digital personal effects after he
or she died. Walker provides an overview of the key concerns related
to digital estate planning for oneself and the task of managing some-
one else’s digital legacy after death. Information includes
• case studies
• examples of digital estate management services and tools
24
APPENDIX C
Preparing for the Unexpected:
Recommendations
Donors, dealers, and repositories can anticipate and prepare for the complexities of transferring, manag-
ing, and preserving born-digital and hybrid collections. The following table organizes select recommenda-
tions from the report across the categories of policy, collaboration, preparation, and documentation.
• Know the circumstances under which the material was working environment with the need to respect donor
created. Was it part of your work? Your research? Your privacy and wishes.
personal correspondence? • Develop strategies to manage restriction, redaction,
and discovery of sensitive material that may have been
overlooked in the initial review.
• Ask repositories for guidance and documentation. • Prior to acquisition, gather information about archives
• If possible, establish direct communication with the containing born-digital content.
COLLABORATION
repository staff to facilitate understanding. • Share documentation with donors and dealers.
• Discuss the overall process, timetable, and likely outcomes • Clarify expectations regarding digital preservation and
with repository staff. access.
• Inform the repository of the presence of legally protected
private files, such as medical records, and especially
sensitive types of information, such as Social Security
numbers.
• Involve technical or legal support people in the process, if
needed or suggested by the repository staff.
• Consider the amount of time available to understand, and • Consider their potential future use when assessing the
contribute fully, to the process of transferring the digital value of disk images, deleted files, and automatically
material. saved files.
• Decide which content you want to transfer to the • Assess the collection’s location and complexity, need
repository. for ongoing communication, and possibility of remote
PREPARATION
• Flag areas of concern for repository staff. the digital files to a different repository.
• Carefully review all agreements. • Develop and distribute guidelines for the transfer of
digital files.
• Clearly define which materials are to be included in the
acquisition.
• Document how to handle sensitive information included
in the collection.
• Document the handling of passwords, encryption, and
deleted content.
• Document the type of access that users may have to
processed collections.
25
APPENDIX D
Checklist of Recommendations for
Donors and Dealers
❑ If offering digital media to a repository, determine whether all files on the media can be
captured and made accessible, or whether only certain types of files are accessible.
❑ Be prepared for the repository to retain the original digital media unless the acquisition
agreement indicates otherwise.
❑ Determine whether limited, specific files or information need to be restricted or redacted and
when those restrictions will expire.
❑ Consider seeking advice from a legal professional when working with the repository on a
contract or agreement.
❑ Consider involving a technical specialist in conversations with repository staff about what
guidelines to follow when copying and transferring media and files.
❑ Be prepared to have ongoing communications with the repository during the capture and
processing of digital materials.
❑ Determine whether computer equipment no longer used by the donor can be offered as part of
the acquisition.
❑ Consider writing a personal computing history to provide context for the digital media and
computers in the acquisition.
❑ Seek guidelines from the repository for shipping of hardware or transfer of digital files.
❑ Document the ways in which digital media and files have been stored, accessed, and
transported prior to their arrival at, or collection by, the repository.
❑ Be prepared for electronic copying and transfer to take a significant amount of time, and
for repository staff to develop new capture techniques to accommodate novel acquisition
scenarios.
APPENDIX E
Checklist of Recommendations for
Repositories
❑ Consider potential future use and access mechanisms when contemplating the value of disk
images, deleted files, and automatically saved files.
❑ Make sure donors and dealers are aware of the different kinds of deleted information that may
be present in their digital materials.
❑ Be realistic about restrictions, redaction, and the potential for sensitive material to be missed in
an initial review, considering
• legal restrictions
• donor requests for restrictions
• third-party restrictions (e.g., oral histories without permission forms)
• technical constraints (e.g., obsolete formats, corrupted files, other access problems)
• use restrictions (e.g., need to ensure authenticity and appropriate use by patrons)