Professional Documents
Culture Documents
ContentServer PDF
ContentServer PDF
DOI 10.1007/s00799-012-0098-8
Received: 30 September 2011 / Revised: 14 August 2012 / Accepted: 3 September 2012 / Published online: 30 September 2012
© Springer-Verlag (outside the USA) 2012
Abstract In this article, we present a model based on Keywords Annotation · Medieval manuscript ·
the principles of Linked Data that can be used to describe Digital facsimile · Linked Data · Data model
the interrelationships of images, texts and other resources
to facilitate the interoperability of repositories of medieval 1 Introduction
manuscripts or other culturally important handwritten doc-
uments. The model is designed from a set of requirements There are many repositories and Digital Libraries that main-
derived from the real world use cases of some of the largest tain digitized images of medieval manuscript pages or other
digitized medieval content holders, and instantiations of the historically important handwritten documents. These images
model are intended as the description to be provided as input are often the only way in which scholars and students can
to collection-independent page turning and scholarly presen- interact with the material, due to the expense of travel or the
tation interfaces. A canvas painting paradigm, such as in PDF fragility of the original. Use of a digital surrogate comprised
and SVG, was selected due to the lack of a one to one correla- of the images and other resources increases the likelihood
tion between image and page, and to fulfill complex require- of its persistence, and as interactions with the physical copy
ments such as when the full text of a page is known, but only decrease its usable lifetime, the use of surrogates is attrac-
fragments of the physical object remain. The model is imple- tive to the owning institution as well as to humanities travel
mented using technologies such as OAI-ORE Aggregations budget managers. Therefore, it is essential that the digital
and Open Annotations, as the fundamental building blocks surrogate be as rich an experience as possible for the scholar,
of emerging Linked Digital Libraries. The model and imple- with access to the existing scholarship about the manuscript,
mentation are evaluated through prototypes of both content as discussed by Audenaert and Furuta [1]: the surrogate for
providing and consuming applications. Although the system the physical object is the humanist’s primary research data.
was designed from requirements drawn from the medieval Institutions holding medieval manuscripts have long
manuscript domain, it is applicable to any layout-oriented known the value of digitization and millions of grant and
presentation of images of text. industry dollars have been spent generating images of decay-
ing physical pages, yet, according to an estimate of Dr
Christopher de Hamel, Fellow Librarian of Corpus Christi
College, Cambridge, less than 1 % of existing medieval doc-
uments have been digitized to date. While the digitized man-
R. Sanderson (B) · H. Van de Sompel uscripts are unique, much of the effort to display the digi-
Research Library, Los Alamos National Laboratory,
Los Alamos, USA
tized material has been duplicated across institutions, with
e-mail: azaroth42@gmail.com each recreating very similar basic page-turning applications;
the only differences being for institutional branding and the
B. Albritton seemingly unique complexities of their documents. Further,
Stanford University, Stanford, CA, USA
practically all of the presentation effort has been used for
R. Schwemmer navigation within the institutional silos of images and texts,
e-codices, Fribourg, Switzerland rather than to implement cross-collection capabilities.
123
4 R. Sanderson et al.
At a series of meetings1 of content providers, scholars and interoperability between repositories, tools, services and
technologists from organizations such as the British Library, presentation systems. The primary domain of use is medieval
the National Library of France, Google Freebase, the Uni- manuscripts that have had their pages individually digitized
versity of Oxford, and the authors’ institutions, it was recog- and their text transcribed; the physical item is the artifact of
nized that to reduce the duplication of effort, a single shared interest, rather than the work in the FRBR sense [16]. Sev-
model for description of manuscripts was necessary. Such eral basic requirements were derived from this domain and
a model could be instantiated for various digital manuscript then expanded upon via the examination of more complex
collections and provided to a conforming display application. use cases, which were beyond the capabilities of existing
Furthermore, the “unique” rendering complexities of many systems. These extended requirements inform the design of
institutions’ documents were, in fact, shared to a very large the model, and therefore will be discussed in detail.
degree across many different collections and hence such a Requirements were identified in four main areas:
display application would require minimal, if any, customiza-
tion beyond branding. If this shared data model was at the 1. Images and their relationships with the physical object
same time sufficiently simple to allow for comprehension and 2. Texts and their relationships with the images
implementation, and sufficiently expressive to facilitate the 3. Sequencing of the images and texts
description of the multi-structured documents, then access 4. Rendering of the images and texts
to the digital surrogates would be greatly improved and the
duplicated time and effort could be rededicated to further dig- 2.1 Image requirements
itization or more complete descriptions and transcriptions.
A second goal identified at the meetings was to extend While in the most basic case there is exactly one image per
the notion of interoperability between manuscript reposito- manuscript page, multiple images will often exist. Those
ries from sharing of their resources to seamless integration images may differ in color, resolution, lighting conditions
between them. While displaying an image is a common base- or scale due to the positioning of either the page or the cam-
line capability, the experience of the scholar can be greatly era during digitization, and rectification through dewarping
enriched by involving materials from multiple repositories. transformations should be possible, as discussed by Baumann
Other repositories may contain the transcription of the text, and Seales [5]. Hence, the model must contain sufficient
processing services may be able to discover locations within information to determine the most appropriate image for the
the image of the transcribed text, and it should be possible user automatically, or at least be sufficiently well described to
to integrate this data into the display. Further value would allow the user to select the correct image for their purposes.
be added by links to Digital Libraries containing publica- To avoid unnecessary duplication of information, the model
tions regarding the manuscript or related material, scientific must also provide a means to avoid attaching information
data about the subject matter of the manuscript, or scholarly directly to any single image if it also pertains to other images.
annotations. Such capabilities would yield a coherent land- Images may also exist that depict only parts of the manu-
scape of interconnected systems, rather than the current set script page, such as very high-resolution images of beautiful
of disparate content silos. illuminations or decorated initial letters. In these cases, the
This article describes the steps taken towards meeting entire page may be left undigitized, only available at a much
these goals and, in Sect. 2, details the use cases and require- lower resolution, or only in black and white. This is espe-
ments that were generated from the medieval manuscript cially true of older digitization projects where microfilms
domain. The background work is described in Sect. 3, and of the non-illuminated pages are available. From a modeling
the abstract SharedCanvas model in Sect. 4. The technolo- perspective, the ability to connect the high-quality image with
gies used to instantiate the model are discussed in Sect. 5, and the appropriate part of the page in the full image is important.
experiments evaluating both the expressiveness for describ- The converse is also true, where part of the image depicts
ing manuscripts and the investment required for implemen- the entire page. Depending on the digitization workflow, the
tation are described in Sect. 6. image may depict more than just the page, such as parts of
the scanning bed or calibration tools such as color strips or
rulers, which would not be desirable to show to all users. In
2 Requirements and use cases this case, it is necessary for the model to be able to describe
the area within the image that depicts only the page.
The goal of this research is to provide a standardized descrip- A single image may depict multiple pages, such as an
tion of the digital resources that are surrogates for culturally image of an open spread of two pages. If these two pages
important, primarily text-bearing, physical objects to enable have pictures or text that cross between them (as in Fig. 1), it
is important to have mappings from the parts of the image to
1 See http://dmstech.group.stanford.edu/ the page, and a method for describing the spread as a whole.
123
SharedCanvas 5
123
6 R. Sanderson et al.
• Part Page/Whole Text: A transcription of a column, line, are collected together in quires (sets of normally 8 leaves),
word or character, perhaps as an annotation or note. The and the location of the boundaries between these is important
area on the page may be rectangular, but warping and information to scholars. Textual sections such as chapters or
decorative text also provides non-rectangular use cases verses are equally important for the humanist interested in
for text. particular parts of the text. Other features, such as the range
• Part Page/Part Text: Alignment of part of a larger tran- of pages at the beginning or end of a manuscript that do
scription to a part of the page at some level of granularity, not contain any text, are also important for navigation and
including area of the entire text, column, line, word or display of an automatically generated table of contents. To
character. enable the description of sections at any granularity, these
• Whole Page/No Text: Unfortunately, the most common sub-lists must be able to include parts of a page, since textual
scenario as very few manuscripts have been fully tran- and other features do not necessarily align with page breaks.
scribed. When the text is transcribed line by line, it is important
to be able to explicitly describe the reading order of those
In the case where an image of the page is not available, lines. Visual clues, such as size, location and color of the
the text may still be known from other copies, and this writing, may make the order clear for human readers, but
information should be available to the reader without a are difficult to interpret for a machine due to writing in the
depiction. margins (marginalia), writing between regular lines (inter-
Among the most challenging textual use cases are stitial text), decorated initials and the scribal tricks used to
palimpsests, manuscripts where one or more texts have been manually justify text into columns. The ability to express the
erased and the pages reused for another text. The original correct sequence of textual resources is just as important as
texts can be recovered using techniques such as multispec- the order in which to display the pages.
tral imaging as in the case of the Archimedes Palimpsest,
fully described by Reviel Netz and William Noel [25]. Some 2.4 Rendering requirements
images may therefore depict two or more completely sep-
arate texts, often perpendicular to each other. The identity Many of the rendering requirements have already been dis-
of each text is important, and could require rotation of either cussed with respect to the Images, Texts and Sequencing.
the image or rendered text for verisimilitude. The model must Further requirements include the ability to create and dis-
record this information and not assume a one to one relation- play scholarly annotations that discuss the images and text
ship between page or image and text, nor any mandatory at a very fine level of granularity.
rotational alignment. If the manuscript leaves were rebound A requirement expressed by many scholars is the abil-
when the second text was written, the first text would also ity to pan and zoom into an image to investigate minute
need a different page order. This brings us to page and text details that might give a clue as to the provenance of the
ordering requirements. manuscript, the identity of the scribe(s) or illuminator(s),
or just to assist in transcribing a particularly difficult pas-
2.3 Sequencing requirements sage. With the alignment of other resources such as the tran-
scribed text, this requires the rendering environment to be
Most page turning applications assume a single correct order able to maintain the alignments correctly at multiple res-
for the pages, yet many manuscripts have been disassembled olutions and regardless of the part of the facsimile being
and rebound over the centuries by well-meaning curators, displayed.
and intentionally or not, the page order has changed over The creation and maintenance of the manuscript descrip-
time. It should be possible to reconstruct the order as it was tion should be possible in a highly distributed and collabora-
at a particular point in time, without duplicating all of the tive environment, to enable the sharing of content and exper-
resources. tise between different communities and individuals. The ren-
The presentation of alternate paths within the same order dering application must be prepared to consume and display
is also an important use case when, as per Fig. 1, a spread resources from across many locations, not limited to a single
exists as a single image but also as separate pages. An ani- file or content silo as is the case currently.
mated page turning application, for example, should not try Although the discussion is in terms of page, image and
to display the image of the spread as a single page, nor should text, there are many other sorts of resource that would be
it animate the turn using the entire image. Yet for scholars, relevant, and bear the same properties as discussed above.
access to the image of the full spread is important to get as For example, a manuscript containing a musical score would
accurate a depiction as possible. benefit from having its score transcribed, perhaps in to Music
The description of subsets of pages from the ordered list is XML [26] or MEI [29], or an audio file with a performance of
also important. In the manuscript construction, process pages the music associated with the appropriate parts of the pages.
123
SharedCanvas 7
This enrichment of the original manuscript is only possible multiple versions and overlapping hierarchies, and Rehbein
with a digital facsimile, providing another motivating factor [28] considers the problem from the change of the text over
for scholars and the lay-person alike. time, as a medieval codex of law is updated over many years
Visualizing these resources and the relationships between as the laws of the town were revised.
them in an innovative way that promotes scholarship is an
interesting research challenge for future work. 3.3 Sequencing
123
8 R. Sanderson et al.
depicted with a blue dotted line within the Canvas. The “Text
Anno” node represents a Text Annotation that associates the
transcribed text with the region of the Canvas in which the
text is located, potentially to a curve rather than straight line.
The Image or Text itself may also have Segment Information
if only part of a larger resource is needed, such as a section
of a TEI transcription. A non-rectangular manuscript would
use this approach to paint only to the appropriate segments
of the Canvas.
The body of an annotation may also be a set of equiv-
alent resources from which a choice is made by the pre-
sentation application. A Choice set could then contain
all the equivalent images of a page, to be applied to a
Canvas, at different sizes, different color depths and so
forth. The alignment of these multiple images is enabled
by the independent coordinate system of the Canvas. The
same construction would allow for a choice of multi-
Fig. 3 Canvas, image and text; MS fr. 190/1 [8] ple Texts to be associated with a specific line segment.
The application would then select the appropriate text
need to be repeated for each of the images, some of which
based on properties such as author or the edition it is
may not be known by the system making the connection.
derived from, or a default as specified by the descrip-
For fragments, it must be possible to create annotations that
tion and present the other options to the user to choose
reference both digitized and undigitized sections of the man-
from.
uscript.
Scholarly annotations can also be applied to any of the
resources, as appropriate. If the annotation should be dis-
4.1 Image and text played regardless of the images and texts being used to ren-
der the Canvas, then it should target the Canvas. On the
Following the lead from canvas-based layout systems, the other hand, a criticism of a particular transcription should be
SharedCanvas model starts with a blank Canvas to be drawn attached to the transcription itself, so that if the transcription
on, which stands for a page in the manuscript. In Fig. 3 the is replaced, the annotation would no longer be displayed.
Canvas is depicted as a rectangle with a dashed black border. These annotations can come from any source and pull in
The Canvas has its own dimensions or aspect ratio that may or external resources that otherwise would not have been known
may not be the same as any image, however, the top left hand about.
corner of the canvas is chosen to correspond to the top left The Type of annotation is used to convey the expected
hand corner of a bounding box around the manuscript page, presentation behavior to the presentation system. There are
and similarly for the bottom right hand corners of canvas and two major types of annotation in this regard, ContentAn-
page. The most common case is a rectangular page, however, notation and CommentAnnotation. The former associates
non-rectangular manuscripts exist such as the heart-shaped content to be rendered as part of the digital facsimile with
chansonnier with the shelfmark BNF Rothschild MS 2973 the appropriate part of the Canvas, whereas the latter is a
[37]. comment or discussion about the facsimile or object that
Images and texts are then overlaid on top of the blank it represents. The different classes allow for the easy dis-
Canvas using a stand-off Annotation Paradigm, following the tinction between annotations that overlay images or tran-
approach taken by Brugman, Bauer and others. Using anno- scriptions on the canvas and more traditional scholarly
tations to paint the canvas, the number of technology depen- annotations. By recognizing the differences and similarities
dencies is reduced, as the parsing and rendering of scholarly between these types of annotation, the display system should
annotations is also required. For example, the body of an require less implementation effort than if the layout were
Image Annotation is the Image and the target is the Canvas. done in a completely different method from regular annota-
In Fig. 3, the “Img Anno” circle represents an annotation tions.
of this type that associates the Image resource with the full Areas, similar to a Canvas in that they have dimensions,
Canvas. may be defined that allow multiple annotations of any type
To paint either an image or text on the appropriate region to be grouped together into a logical unit and then further
of the Canvas, Segment Information is used. In the example, operated upon. These Zones are the targets of the Anno-
the information records the location of the bounding box tations, and are then painted onto one or more Canvases
123
SharedCanvas 9
4.2 Sequencing
123
10 R. Sanderson et al.
instantiated using specific technologies. Although most An example of Open Annotation where a rectangular sec-
manuscript transcription and description work has taken tion of an image should be overlaid on top of a Canvas, in
place to date using XML, and notably with the TEI, METS the relatively readable Turtle [6] syntax:
[27] and ALTO [3] schemas, many of the use cases require
a graph rather than an XML tree. To permit the distributed
creation and use of the model simultaneously by multiple
tools and repositories, an approach that follows the archi-
tecture of the web [19] is necessary. Linked Open Data [7]
using RDF [21] fits these requirements for a web-centric
graph.
123
SharedCanvas 11
6 Experimentation
Fig. 6 Multi-classed list, Aggregation, Sequence resource 6.1 Image and text layout
123
12 R. Sanderson et al.
123
SharedCanvas 13
6.2 Sequencing Here, Canvases representing the missing leaf are added to
the Sequence that describes the manuscript’s original, rather
6.2.1 Parker CCC 286 than current, state. In addition, some content is known about
the verso side of the missing page—it contained an illustrated
Many examples exist of manuscripts that have had pages frontispiece to the Book of Matthew—which can be attached
physically removed or destroyed at some point in their his- to the Canvas through an annotation.
tory. For example, Parker CCC 286 [36] is missing a leaf This modeling is depicted in Fig. 11. There are two
between the current second and third pages. To properly rep- Sequences, “286 Curr” that represents the current state of
resent this manuscript in its original state, the fact that the the manuscript, and “286 Orig” that represents the origi-
page has been physically removed at some point in time must nal state with the additional leaf. The missing leaf has two
be made explicit. Canvases, for recto and verso-labeled Canvas 2a and 2b,
123
14 R. Sanderson et al.
123
SharedCanvas 15
123
16 R. Sanderson et al.
meetings, and the institutions that own the manuscripts for allowing 19. Jacobs, I., Walsh, N.: Architecture of the World Wide Web, vol.
their use in this publication, including Christopher De Hamel, Don- 1. Technical Report, W3C Recommendation, W3C, 15 December
nelly Fellow Librarian, and the Master and Fellows of Corpus Christi 2004 (2004)
College, Cambridge. 20. Liesaputra, V., Witten, I.: Seeking information in realistic books:
a user study. In: Proceedings of the 8th Joint Conference on Digital
Libraries, USA (2008)
21. Manola, F., Miller, E., (eds): RDF Primer, W3C Recommendation,
References W3C, 10 February 2004 (2004)
22. Marshall, C., Bly, S.: Turning the page on navigation. In: Proceed-
1. Audenaert, N., Furuta, R.: What humanists want: how scholars ings of 5th Joint Conference on Digital Libraries, USA (2005)
use source materials. In: Proceedings of 10th Joint Conference on 23. Moap, G.: Lancelot du Lac, c. 1470. Held at: Bibliotheque
Digital Libraries, Australia (2010) Nationale de France, Shelfmark: Fonds Francais, pp. 113–116.
2. Baechler, M., Ingold, R.: Medieval manuscript layout model. http://gallica.bnf.fr/ark:/12148/btv1b60000903
In: Proceedings of ACM Symposium on Document Engineering, 24. Murer, H.: Diuae Virg: Mariae Sintlacisaugia, c. 1627. Held at
England, pp. 275–278 (2010) Frauenfeld, Kantonsbibliothek Thurgau, Shelfmark Y 112. doi:10.
3. Bauer, J., et al. (eds.): ALTO Technical Metadata for Optical Char- 5076/e-codices-kbt-y112
acter Recognition, Library of Congress (2010). http://www.loc. 25. Netz, R., Noel, W.: The Archimedes Codex: Revealing the Blue-
gov/standards/alto print for Modern Science, Phoenix (2008). ISBN 0753823721
4. Bauer, P., Hernath, Z.: A data model supporting conflict resolving 26. Recordare MusicXML 3.0 Specification. http://www.recordare.
in cooperative text editions-HypereiDoc. In: Proceedings of the com/musicxml/specification. Accessed 29 Sep 2011 (15:00:00
First International Conference on Networked Digital Technologies, MST)
Czech Republic, pp. 476–481 (2009) 27. METS Editorial Board.: METS: Metadata Encoding & Transmis-
5. Baumann, R., Seales, W.B.: Robust registration of manuscript sion Standard. http://www.loc.gov/standards/mets/
images. In: Proceedings of 9th Joint Conference on Digital 28. Rehbein, M.: Reconstructing the textual evolution of a medieval
Libraries, USA, pp. 263–266 (2009) manuscript. Lit. Linguist. Comput. 24(3), 319–327 (2009). doi:10.
6. Beckett, D., Berners-Lee, T.: Turtle-Terse RDF Triple Language, 1093/llc/fqp020
W3C Team Submission, 14 January 2008 (2008) 29. Roland, P.: The Music Encoding Initiative, MEI. In: Proceedings of
7. Bizer, C., Cyganiak, R., Heath, T.: How to publish linked the First International Conference on Musical Applications Using
data on the Web (2007). http://sites.wiwiss.fu-berlin.de/pub/ XML (2002)
LinkedDataTutorial/ 30. Sanderson, R.: Linking past and future: an application of dynamic
8. Boccace, J.: Des cas des nobles hommes et femmes, c. 1410, Paris. HTML for medieval manuscript editions. Ph.D. Thesis, University
Held at Bibliotheque de Geneve, Geneva. Shelfmark: Ms. fr. 190/1. of Liverpool (2003)
doi:10.5076/e-codices-bge-ft0190-1 31. Sanderson, R., Ciccarese, P., Van de Sompel, H., (eds): Open Anno-
9. Brugman, H., Malaise, V., Hollink, L.: A common multimedia tation Core Data Model, W3C Community Draft, 09 May 2012.
annotation framework for cross linking cultural heritage digital http://www.openannotation.org/spec/core/ (2012)
collections. In: Proceedings of the 6th International Conference of 32. Sanderson, R., Albritton, B., (eds): Shared Canvas Data Model,
Language Resources and Evaluation, Morocco (2008) June 01 2012. http://www.shared-canvas.org/datamodel/spec/
10. Bulacu, M., et al.: Layout analysis of handwritten historical docu- (2012)
ments for searching the archive of the Cabinet of the Dutch Queen. 33. Schmidt, D., Colomb, R.: A data structure for representing multi-
In: Proceedings of the 9th International Conference on Document version texts online. Int. J. Hum. Comput. Stud. 67, 497–514
Analysis and Recognition, Brazil, pp. 357–361 (2007) (2009). doi:10.1016/j.ijhcs.2009.20.001
11. Clark, J.: XSL Transformation (XSLT) Version 1.0, W3C Recom- 34. TEI Consortium (eds): TEI P5: guidelines for electronic text encod-
mendation, 16 November 1999, W3C (1999) ing and interchange, TEI Consortium, November 2007. http://
12. Dahlstrom, E., et al. (eds): Scalable Vectore Graphics (SVG) 1.1, www.tei-c.org/Guidelines/P5/ (2007)
W3C Working Draft, 22 June 2010, W3C (2010) 35. Troncy, R., Mannens, E., (eds): Media Fragments URI 1.0, W3C
13. Doumat, R., Egyed-Zsigmond, E., Pinon, J.: Digitized ancient doc- Working Draft, W3C, 17 December 2009 (2009)
uments.. what’s next?, pp. 195–210. INFORSID, France (2009) 36. Unknown Gospels of St. Augustine c. 600 Held at Parker Library,
14. Froissart, J.: Chroniques, c. 1400. Held at Morgan Library, New Corpus Christi College, Cambridge. Shelfmark: CCCC MS 286
York City, NY. Shelfmark: M.804 37. Unknown Chansonnier Cordiforme c. 1475. Held at Bibliotheque
15. Hickson, I.: HTML5. A vocabulary and associated APIs for HTML Nationale de France, Shelfmark: BNF Rothschild MS 2973
and XHTML, W3C Working Draft, W3C, 19 October 2010 (2010) 38. Van de Sompel, H., Lagoze, C., et al.: Adding eScience assets to the
16. IFLA Study Group on FRBR: Functional requirements for biblio- Data Web, linked data on the Web Workshop. In: 18th International
graphic records. Final Report, International Federation of Library World Wide Web Conference, Spain (2009)
Associations and Institutions (2009) 39. Vincent of Beauvais, Speculum Historiale, c. 1325. Held at Corpus
17. von Arx, I.: Veterum Fragmentorum Manuscriptis Codicibus Christi College, Cambridge, Parker Library, Shelfmark CCCC MS
detractorum collectio Tom. I., 1822, Held at St. Gallen, 008. http://purl.stanford.edu/cv176gb0028
Stiftsbibliothek, Shelfmark Cod. Sang. 1394. doi:10.5076/
e-codices-csg-1394
18. Iso, TC 171/SC 2: ISO 32000–1:2008 Document Management-
Portable Document Format-Part 1: PDF 1.7. http://www.iso.org/
iso/catalogue_detail.htm?csnumber=51502 (2008)
123
Copyright of International Journal on Digital Libraries is the property of Springer Science & Business Media
B.V. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright
holder's express written permission. However, users may print, download, or email articles for individual use.