Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 3

Batch importing

The basic concept behind the DSpace simple archive format is to create an archive, which is directory
full of items, with a subdirectory per item. Each item directory contains a file for the item's descriptive
metadata, the files that make up the item and a "contents" textfile listing the items to be imported.

archive_directory/
item_000/
dublin_core.xml
contents
file_1
file_2
item_001/
dublin_core.xml
contents
file_1
etc...

The dublin_core.xml file has the following format, where each Dublin Core element has its own entry
within a <dcvalue> tagset. A full list of the Dublin Core elements available in DSpace can be found in
“Dspace-dc-overview.pdf” (http://www.scribd.com/doc/15284184/Dspace-Dublin-Core-elements-
Overview). There are currently three tag elements available in the <dcvalue> tagset:

• <element> - the Dublin Core element


• <qualifier> - the element's qualifier
• <language> - (optional)ISO language code for element

Example:
<dublin_core>
<dcvalue qualifier="none" element="title">Stereoscopic Photographs of the Franklin Relics, No.
2</dcvalue>
<dcvalue qualifier="author" element="contributor">Cheyne, John Powles</dcvalue>
<dcvalue qualifier="created" element="date">2008</dcvalue>
<dcvalue qualifier="issued" element="date">1861</dcvalue>
<dcvalue qualifier="provenance" element="description">Lefoy, Jessie, bequest, 1941</dcvalue>
<dcvalue qualifier="medium" element="format">print, cardboard</dcvalue>
<dcvalue qualifier="none" element="subject">sailing ships</dcvalue>
<dcvalue qualifier="none" element="subject">equipment</dcvalue>
</dublin_core>
The Importing command
<Drive>:\<dspace install dir>\bin>dsrun org.dspace.app.itemimport.ItemImport -h

Using DSpace installation in: <Drive>:\<dspace install dir>

usage: ItemImport

-t,--test test run - do not actually import items

-R,--resume resume a failed import (add only)

-a,--add add items to DSpace

-c,--collection destination collection(s) Handle or database ID

-d,--delete delete items listed in mapfile

-e,--eperson email of eperson doing importing

-h,--help help

-m,--mapfile mapfile items in mapfile

-p,--template apply template

-r,--replace replace items in mapfile

-s,--source source of items (directory)

-w,--workflow send submission through collection's workflow

adding items: ItemImport -a -e eperson -c collection -s sourcedir -m mapfile

replacing items: ItemImport -r -e eperson -c collection -s sourcedir -m mapfile

deleting items: ItemImport -d -e eperson -m mapfile

If multiple collections are specified, the first collection will be the one that owns the item.
The network batch transfer system

It is possible to transfer the batch data to the repository over the network. This secure service uses the
SFTP protocol, for which native clients exist in Linux and Mac OS X, as well as many free clients for
Windows, like WinSCP.

You might also like