Professional Documents
Culture Documents
Chino Inconsistencias en Catalogacion
Chino Inconsistencias en Catalogacion
Chino Inconsistencias en Catalogacion
To cite this article: Yue Li (2004) Consistency versus Inconsistency: Issues in Chinese Cataloging in OCLC, Cataloging &
Classification Quarterly, 38:2, 17-31, DOI: 10.1300/J104v38n02_04
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
Consistency versus Inconsistency:
Issues in Chinese Cataloging
in OCLC
Yue Li
Downloaded by [University of North Texas] at 01:50 27 November 2014
Yue Li, MLS, MA, is Assistant Librarian, George A. Smathers Libraries, P.O.
Box 117001, University of Florida, Gainesville, FL 32611 (E-mail: yueli@mail.
uflib.ufl.edu).
The author wishes to thank John Van Hook and Gerald Langford for reading a
draft of this article and Betsy Simpson and Tatiana Barr for their helpful comments.
Cataloging Classification Quarterly, Vol. 38(2) 2004
http://www.haworthpress.com/web/CCQ
2004 by The Haworth Press, Inc. All rights reserved.
Digital Object Identifier: 10.1300/J104v38n02_04 17
18 CATALOGING & CLASSIFICATION QUARTERLY
INTRODUCTION
April 10, 1991 in New Orleans, Louisiana. The group members now come not
only from North America but also from Australia, Mainland China, Taiwan,
Hong Kong, and many other countries and regions. After years of effort by the
Library of Congress, OCLC, and OCLC CJK Users Group, OLCC CJK
Cataloging is better serving a wide range of users all over the world and
playing an important role in information globalization.
Chinese cataloging is one of the major components of the OCLC CJK initia-
tive. In October 2000, the Library of Congress, OCLC and the Research Librar-
ies Information Network (RLIN) joined efforts to convert Chinese vernacular
romanization from Wade-Giles romanization to pinyin romanization in order to
conform to Chinese language transliteration practices in most parts of the world
and assure greater consistency. Chinese catalogers have received trainings and
they are equipped to create records using AACR2, the LCRIs, and the LC Pinyin
Romanization Guideline. Chinese cataloging in OCLC CJK is getting more and
more standardized and upgraded as a result. Up to now, there have been many
publications about the Library of Congress pinyin rules and various pinyin proj-
ects to upgrade bibliographic records, the national name and subject authority
files. However, there are very few publications focusing on some specific unre-
solved issues related to pinyin romanization, vernacular application, field coding,
and other aspects of Chinese records in OCLC. These issues often lead to incon-
sistencies in cataloging Chinese materials, though the projects like Pinyin Con-
version, Manual Review, and Pinyin Clean-up have been completed. In this
article, eight most commonly encountered issues and inconsistent practices in
Chinese cataloging in OCLC CJK are discussed. Examples from Chinese rec-
ords in WorldCat are used to demonstrate the problems they raise. After the dis-
cussion, it is hoped that these inconsistent practices can be recognized and
avoided in the future.
ISSUE ONE:
INCONSISTENCY IN ROMANIZING CHINESE VERNACULAR
$c 1989.
#13364665 260 Taibei : $b Lian he bao she : $b Zong jing xiao Lian jing chu ban
#25471437 260 [Taipei] : $b Zhonghua min guo qiao wu wei yuan hui, $c Min
guo 80 [1991]
TABLE 1. Search Results by Title “vt ⇥” in OCLC CJK from 1998-2001
ISSUE TWO:
INCOMPATIBLE SPACING IN PINYIN AND VERNACULAR FIELDS
vernacular scripts. The vernacular search and display provide faster and more
precise information retrieval and more accurate access points for both the
users and the librarians since one romanization will usually correspond to
many characters. For example, a search on pinyin “da” in OCLC CJK brings
out 28 characters and another search on “yi” brings out as many as 231 char-
acters. Furthermore, cataloging with vernacular characters greatly promotes
the globalization of information as well and better serves a wider range of
information seekers, both in North America and other parts of the world.
Downloaded by [University of North Texas] at 01:50 27 November 2014
Therefore, CJK catalogers are encouraged to catalog CJK materials with ver-
nacular characters in order to assist more effective and user-friendly retrieval.
For pinyin romanization, the guideline is to “separate the romanization of each
Chinese character with a space. This includes corporate names, terms of ad-
dress, and titles of royalty, separation of syllables of general, non-specific
geographic terms,” (2) although there was once a heated debate and dis-
agreement on this decision among the East Asian Studies community in
North America. Nevertheless, this guideline has been passed and imple-
mented.
Nonetheless, there remains the issue of whether aggregation or the logic
group is needed to facilitate better understanding of the information in Chi-
nese vernacular. Clearly, vernacular aggregation has had its own advantages
in that it makes it easier for readers to read and understand. Sometimes, read-
ing Chinese characters without spaces can be confusing, especially for the
classic Chinese literature, where distracting ambiguities occur when there are
no vernacular separations and appropriate punctuation as indicated in the
following example:
in OCLC like,
245 00 Zhongguo yin yue wen hua da guan / $c Jing Jing, Guan Jianhua, Qing
The consistent and common practice is without any space between characters:
Some records even use wrong and inappropriate aggregation, which causes
more confusion and inconsistency in practice. For example:
#46859171
# 46859171
260 Beijing: $b Ren min chu ban she : $b Xin hua shu dian jing xiao, $c 1999.
# 44721513
020 7810580590
100 1 ∾Б♢.
It can be seen that this book is published by “Shanghai da xue chu ban
she” (Shanghai University Press), which is not in Beijing. Common sense
should have told the cataloger that the field 260 had to be in error. If in
doubt, some research or inquiry should be done before the information is
used in a record. Look at another example:
24 CATALOGING & CLASSIFICATION QUARTERLY
#41342142
700 1 䌒ᑇߍ.
Many Chinese characters can have more than one pronunciation; “ߍ” is
one of them, which can be pronounced as “ao” and “wa.” It is difficult to get it
right just by looking up the character in a dictionary; for example, “䌒ᑇߍ” is
a very famous contemporary novelist in mainland China. The character “ߍ”
here is pronounced as “wa” rather than “ao.” It is pronounced the same as
“⌐” and it is often used as a place name. (5) By the way, mistakes of roman-
izing Chinese characters in the example is not corrected in this article. Cata-
logers should have access to authority files, which can help eliminate such
errors. When a bibliographical record is created before the author’s authority
record is established, knowledge of Chinese culture, literature, history, and
geography is essential to avoid making such mistakes.
ISSUE FOUR:
U.S. DOLLARS vs. CHINESE YUAN IN FIELD 020
According to the volume itself, its true price is 40.00 Chinese Ren Min Bi
Yuan. The Library of Congress practice should be:
Yue Li 25
According to the book in hand, the price of this book is 30.00 Chinese
Ren Min Bi Yuan and the price of 15 dollars could not be found anywhere in
the book. This coding does not agree with the Library of Congress practice
rule either.
4. 041 0 chieng 9
First of all, the books in this series were published in Chinese before. Re-
cently, these works were republished in Chinese with English translation, so
the first indicator will be “1” (“Item is or includes a translation”). This series
contains both Chinese and the English translation; therefore, subfield “a” will
be “chieng.” Also, the original language is Chinese; therefore, subfield “h”
will be “chi” (“Language code of original and/or intermediate translations of
text”). (6) Thus, number two is the correct recording.
26 CATALOGING & CLASSIFICATION QUARTERLY
Some items are originally published as the bilingual text and the typo-
graphical predominance on the title page is often reflected in the record. In
this case, the readership of the item should govern what the fixed field cod-
ing should be, “eng” or “chi,” and the 041 coding: 041 0 $a engchi. Catalog-
ers should be very careful in applying the correct coding in the future.
ISSUE SIX:
FAULTY INPUT OF CHINESE LANGUAGE SCRIPTS
Downloaded by [University of North Texas] at 01:50 27 November 2014
Generally speaking, there are two ways of writing Chinese vernacular: the
traditional form and the simplified form. When catalogers create Chinese
records with vernacular, they are required to record exactly what they see
from the item in hand. If the item uses traditional form, it should be recorded
with traditional form, and if the material adopts the simplified form, the sim-
plified one should be used. If there is a combination of both traditional and
simplified forms, catalogers should record them exactly the way the vernacu-
lar appears in the item. AACR2 does not allow catalogers to use either form
at will. Chinese catalogers familiar with the traditional form need to get used
to the simplified one, while those who are not familiar with the traditional
one need to study it as well. The following records, which have violated
these guidelines, are examples frequently encountered in OCLC CJK Cata-
loging:
# 50691433
245 00 Taiwan xin wen xue si chao shi gang / $c zhu bian L㫞u Zhenghui, Zhao Xiaqiu.
A check of the book shows that the publisher used the simplified Chinese
vernacular for “kun lun” and therefore it is inappropriate to record the pub-
lisher by using the traditional form in field 260. The correct recording of the
publisher should be in the simplified form:
In the next example, the original book used mixed forms with traditional and
simplified forms, the record, however, encoded all in the simplified form:
Yue Li 27
#48625003
245 00 ᄨᄺк⬏.
710 2 ЁᄨᄺӮ.
Downloaded by [University of North Texas] at 01:50 27 November 2014
245 00 ᄨᄺ⬿.
On Monday, May 5th, 2003, the first LC authority record with Chinese
vernacular was posted to the Council of East Asian Libraries (CEAL)
listserv. The record, the first LC authority record created using HKCAN as a
source, was created on February 3, 2003 and was acknowledged by the Li-
brary of Congress. (7) The problem mentioned above also exists in this rec-
ord, which used traditional vernacular form for the simplified one in the
original. Therefore, there still seems no consensus on this issue now even in
the very high level of administration concerned.
ISSUE SEVEN:
TRANSLATION OF NON-CHINESE CHARACTERS
IN THE PARALLEL DESCRIPTION FIELDS
Currently, more and more Chinese records for different materials and in
different formats contain Chinese translations of English data, such as the
GMDs “electronic resource” and “videorecording,” in 880 parallel vernacu-
lar data fields. For example:
#50857336
#47774066
245 00 Fa xian cao yuan $h [videorecording] : $b Chengjisihan jue qi de mi mi /
$c dao yan Han Yali.
This issue also crosses up in the field 310 (serials frequency) and in many
sorts of 5xx notes, such as those beginning with “Sponsored by.” MARC21
specifies that field 880 contain “the fully content-designated representation,
in a different script, of another field in the same record. In other words, 880
should be the same words in a different script.” (8) There is no provision for
translating data from a corresponding field into field 880. Chinese vernacular
characters ought to be present only to represent corresponding romanized
Chinese words, and those characters are mostly data transcribed from the
Downloaded by [University of North Texas] at 01:50 27 November 2014
item itself. “The perspective of OCLC quality control staff is that translating
data in field 880 is incorrect and results in a record which is a hybrid of two
languages of cataloging.” (9) Therefore, it is not acceptable to translate the
data in field 880. This issue was strongly restated recently in the letter to
OCLC CJK Users Group listserv by Hisako Kotaka, OCLC Senior Product
Manager: “OCLC prohibits users to create hybrid CJK records in WorldCat
by linking the English version notes to CJK vernacular script notes.” (10)
ISSUE EIGHT:
LACK OF TIMELY MAINTENANCE BY MEMBER LIBRARIES
As OCLC CJK contributors, librarians have been doing a great job over
all in cataloging as carefully as possible to guarantee the accuracy of infor-
mation and quality of the records. However, it is very difficult to reach the
goal of being one hundred percent correct. This raises the issue of maintain-
ing the high quality of the OCLC database. Many libraries, institutions, and
users may use the records in the OCLC database. They all should feel
obliged to promote the accuracy and safeguard the quality of the Chinese
records, not only the creators of the original records but everyone who uses
the database subsequently. Every use of a record should be regarded as a
good opportunity for review of the record. Whenever and wherever problems
or mistakes are found, they should be corrected or reported. Look at the fol-
lowing record:
020 7532607410
049 FUGG
245 00 Li dai ming ren bing cheng ci dian / $c [bian zhu zhe Long Qian'an,
250 Di 1 ban.
250 1 ⠜.
700 1 啭┯ᒉ.
700 1 ᴢᇣᵒ.
700 1 咗ᯣ.
987 PINYIN $b OU $d c
This record was created and updated holdings in OCLC CJK on May 15,
2002 and its physical description field 300 was missing. Later, twelve librar-
ies or institutions copied this record and added it to their own local catalogs
without reporting the error or doing any correction on the record. At least
some of the libraries noticed the missing 300 field because it was added in
their own OPAC. It was not until April 7, 2003 that the Hong Kong Univer-
sity of Science and Technology (HNK) corrected the record. The need for
record maintenance cannot be emphasized enough within the OCLC CJK
Users Group as well as to the catalogers using RLIN and the Library of Con-
gress catalog.
CONCLUSION
1,067,770 are Chinese records. This number is increasing every day. (11) As
can be seen, Chinese records make up almost half of the CJK records in
WorldCat and serve users all over the world. This article has addressed eight
issues that related to pinyin romanization, to the process of pinyin conversion,
and to other aspects of Chinese cataloging in OCLC CJK. The romanization of
“⇥” is a typical example of the problem related to the romanization and
conversion of personal names, place names, corporate bodies, and other entities.
In this case, it would be desirable for the Library of Congress to correct the
Downloaded by [University of North Texas] at 01:50 27 November 2014
REFERENCES
1. The Library of Congress, “OCLC, RLIN. Pinyin Conversion Project” http://
www.loc.gov/catdir/pinyin/timeline.html (June 2, 2003).
2. The Library of Congress, “ALA-LC Romanization Tables” http://www.
loc.gov/catdir/cpso/romanization/chinese.pdf (June 2, 2003).
3. Cathy Yang. “Cataloging Tips on Pinyin,” The eastlib mailing list of the Coun-
cil on East Asian Libraries (May 28, 2002).
4. The Library of Congress. “The Library of Congress Pinyin Conversion Project,
New Chinese Romanization Guidelines” http://www.loc.gov/catdir/pinyin/romcover.
html (Jan. 7, 2004).
Yue Li 31
5. Ci Hai Bian Ji Wei Yuan Hui. “Ci hai,” Shanghai: Shanghai ci shu chu ban she,
1999.
6. Online Computer Library Center. “OCLC Bibliographic Formats and Stan-
dards” http://www.oclc.org/bibformats/en,/ (June 2, 2003).
7. Lydia C. Hsieh. “First LC Authority Record Created Using HKCAN as a
Source” The eastlib mailing list of the Council on East Asian Libraries, sent by
Abraham Yu (May 05, 2003).
8. The Library of Congress. “MARC 21 Concise Bibliographic: Holdings, Location,
Alternate Graphics, etc. Fields (841-88X),” http://www.loc.gov/marc/bibliographic/
ecbdhold.html#mrcb880 (June 2, 2003).
Downloaded by [University of North Texas] at 01:50 27 November 2014