Table of Contents Enhancement of the Catalog INTRODUCTION As patron demand for information continues to grow, table-of-contents (TOC) enhancement of catalog records is becoming an increasingly common method of expanding access to previously "hidden" items in monographic collections and providing an in-depth view of library holdings. The vendor who initiated the distribution of TOC data in 1992 (Blackwell's) has recently extended the availability of its records to RLG, WLN, and the local-system vendor Innovative Interfaces. In addition, a few other vendors are probing the market. This report is intended to provide the library's senior management with the information necessary to make a decision on implementing TOC enhancement of Cornell's OPAC. It presents a profile of each vendor's product and service, a comparison of how well each would suit Cornell's particular environment and needs, and the considerations and concerns of those units or elements of CUL that would be most affected by enhancement of the catalog. It also discusses alternatives, contingencies, and questions of timing. Information on the vendors' services originates from direct conversations with the vendors' representatives and from their Web sites. Local input has come from speaking with staff in various departments in O/K/U, Mann, and central library administration. The organizations contacted that provided information based on their own experience include Columbia University; the University of California, Santa Barbara; the University of Pennsylvania; the University of Arizona at Tucson, and OhioLINK. REASONS FOR CONSIDERING TOC ENHANCEMENT
TOC SERVICES AVAILABLE Blackwell's
Yankee Book Peddler
WLN
RLIN
PALINET
Primary Source Media Chadwyck-Healy
Innovative Interfaces
PUBLIC SERVICES CONSIDERATIONS
TECHNICAL SERVICES CONSIDERATIONS
NOTIS (LMS) CONSIDERATIONS
SUITABILITY OF INDIVIDUAL VENDORS FOR CUL The following tables summarize the advantages and disadvantages of each vendor's services for CUL. The vendor profiles attached provide full descriptions of what each has to offer. BLACKWELL'S Blackwell's has mounted a detailed presentation of its services, including sample MARC records (illustrations 3-5), on its Web site. Advantages
Disadvantages
YANKEE BOOK PEDDLER Yankee is currently building a TOC file from records provided by PALINET and intends to test its product this winter. Advantages
Disadvantages
WLN WLN is now in the process of setting up a service to deliver Blackwell's records. The tapes are ready to load, and workflow is being determined. Advantages
Disadvantages
RLIN RLIN is supplying Blackwell's enhanced contents (505) fields with the content summaries (520) fields. Advantages
Disadvantages
PALINET PALINET is not a vendor itself, but is providing scanning and record formatting for Yankee. It would be interested in an agreement to do the same for CUL if the library wanted anything done retrospectively or TOC data for any special projects, such as items being transferred to the Annex. It provides all 3 levels of enhancement. PRIMARY SOURCE MEDIA, CHADWYCK-HEALY, INNOVATIVE INTERFACES These vendors are not suitable to Cornell's purposes. Primary Source Media and Chadwyck-Healy offer CD-ROM products, rather than additional search access to the OPAC. Innovative Interfaces has developed its service to function only in conjunction with the INNOPAC library management system. For the sake of completion, their descriptions are included among the vendor profiles attached. COST ANALYSIS
Blackwell's, WLN. Vendor charges are $.75 per record (20,000 x $.75 = $15,000). RLIN. The vendor charge is $.26 per ISBN numeric search for the record plus $.50 per record "PUT" (20,000 x $.76 = $15,200). Estimating that CTS staff could search and PUT a record every 2.5 minutes if the PUT function were working at normal speed, it would take 833.33 staff hours (21.37 weeks) to enhance 20,000 records. Yankee. The vendor charge is $.75 per record, and according to Scott Wicks, Cornell receives ca. 17,000 items per year from Yankee. Since Columbia has found that about 70% of its eligible records receive TOC data, the total cost here would be for only 11,900 records rather than 20,000 (11,900 x $.75 = $8,912) if we applied the same percentage to Cornell. ALTERNATIVE OR COMPLEMENTARY SERVICES RLIN Eureka Patrons already have access to Blackwell's records through Eureka, although there is no hook to holdings. Users must either check the locations in the RLIN bibliographic file to see if there are CUL holdings or search the main work in NOTIS. The University of Pennsylvania has created a direct link to the Eureka TOC file from the Web page listing its online databases ("Blackwell's Tables of Contents (essays and chapters)," labeled *NEW*). CUL might want to consider a similar device to increase public awareness of this resource. Palinet PALINET is currently producing between 100 and 200 TOC records per day from photocopies of contents sent by Yankee. That requires a staff of six, including a supervisor. As previously mentioned, they would be interested in special projects for Cornell. In-house scanning No academic libraries have been located that produce their own TOC records locally via optical scanning in the large quantities that are available through vendors. However, for the past three years Mann Library has been scanning TOC for the items on its "New Book Shelf" and has built up a file of several thousand images stored on a server. The intent is different from that of vendors, who are providing additional keyword or index terms for the OPAC. Mann's purpose is to give users a feel for the content--sometimes an image from the jacket is also scanned in. A link easily added to the catalog record (a URL in an 856 field) would allow users to retrieve the TOC image with a click of the mouse. This process would work with any "webbed" catalog and could possibly be applied to other CUL special collections. The service is accessed through the Mann Gateway: click on "What's New" and then "New Book Shelf" to bring up the page with the current "bibliography" of new books. Select any title to bring up its TOC image. The bibliography is updated every two weeks along with the physical books on the shelf as part of the regular workflow. Briefly, the process of creating the service (a joint project of Mann's Technical Services and Circulation staff) proceeds as follows:
CONTINGENCIES, CONSTRAINTS OPAC display As outlined in Storage Capability, CUL would have the option of taking TOC data at any of three levels: basic 505 fields, enhanced 505 fields, or local 9XX fields. (The vendor profile for Blackwell's includes technical descriptions distinguishing the three levels.) The two 505 options display the contents in the same familiar, block format with which patrons are well acquainted. In the 9XX field format, each item is clearly listed in a separate field. However, the elements in the subfields, such as title, author, author's name in reverse order, and page number are not separated by consistent punctuation, because those fields are not primarily intended for public view, but to recreate a facsimile display of the table of contents, which lacks internal punctuation. The display is certainly usable, and patrons would be able to discern the elements of information they are seeking, but it is not emanently readable, and the appearance is unpolished. Restricted Export of Blackwell's Data Any data from Blackwell's (including files obtained through WLN or RLIN) may not be exported. Therefore all contents (505) and content summaries (520) fields, no matter what their origin, would have to be flagged not to export. George Kozak has suggested a possible solution of having the vendor add a 988 field to TOC-enhanced records to control the bib-out process. Another alternative would be to take Blackwell's data into local (9XX) fields. The local fields could then be flagged not to export, leaving the contents (505) fields that were obtained from all other sources free to export on bib-out. CIP Records CTS Acquisitions is contracting with Yankee to provide cataloged records for full LC and un-upgraded CIP received from them. As explained under Integration with Repeat Search Processing, the CIP records will be upgraded when they are later run through OCLC Repeat Search. If the Repeat Search overlay occurred after TOC enrichment, the TOC data would be lost. Thus LTD would need to look for a satisfactory solution to issues of timing and integration with Repeat Search processing. RLIN Records The RLIN option would require CTS catalogers' time to search and PUT the records and thus necessitate revising the workflow to incorporate the process. It would also increase the current level of RLIN searching by CTS. LTD Priorities Establishing an appropriate time for implementation would depend on LTD staffing and its priorities (see Timing of Implementation, below). TIMING OF IMPLEMENTATION Since the currently scheduled date for bringing up a new LMS is summer 1999, CUL might well consider implementing the TOC enhancement before the library becomes heavily involved in planning for that major event. The TOC data would then migrate to the new system as part of the overall process. The introduction of the new Library Gateway will undoubtedly take a considerable amount of LTD's attention as of January 1998, as well as the Annex Inventory Control System and NOTIS version 6.4, but when those have been fully implemented, the time would be appropriate for initiating TOC enhancement. RECOMMENDATION Contracting with WLN seems to be the most advantageous and efficient option. WLN would then provide TOC data from Blackwell's (the vendor with the most numerous records and the most experience) in conjunction with CUL's authorities processing, thus eliminating the need for dealing with an additional TOC vendor. Taking 505 contents fields at this point, because of NOTIS limitations such as poor display of the 9XX fields, would be to impose limitations on the new LMS, because the two types of 505 fields contain either no, or fewer, designators for additional search capabilities or for a facsimile display of the contents. On the other hand, taking the data into local 9XX fields, which would offer more flexibility in the future, would in the interim present an OPAC display that some public services staff might find unacceptable. Because of the impact it would have on the OPAC display, it is important that the public services sector have the opportunity to discuss the form in which the data are acquired if TOC enhancement is implemented. Experimenting with a small test database before committing to a particular format could simplify the decision. If the project is undertaken, it will require close collaboration with LTD to overcome the difficulty presented by the restriction on the export of TOC data and to devise a schedule that coordinates the timing of TOC enhancement and Repeat Search processing. APPENDIX: VENDOR PROFILES Blackwell's Service: Since spring 1992 in the case of U.S./Canadian editions of English-language titles, and since fall 1995 in the case of U.K./European editions of English-language titles, Blackwell's has been electronically scanning and capturing tables of contents for most books covered by its Approval and New Titles Services; descriptive summaries and author affiliations, from book jackets, have been captured for many of the same titles. Blackwell's focuses on high-distribution monographic titles published by university, sci-tech, trade, and specialty publishers, whose titles are of interest to academic institutions. Approximately 800 records are created per week. Contents: Tables of contents that do not contribute to an understanding of works are excluded, e.g., tables of contents from dictionaries, novels, and travel guides. Blackwell's editors also determine the suitability of descriptive summaries (520 field). Summaries are concise and to the point. Optional Fields Used for Enhancement Libraries have the option of receiving TOC data in one of three formats: as a basic 505 contents field, as an enhanced 505 contents field, or as chapter-level 9XX fields, which some local systems can display as an online table-of-contents facsimile.
Search options: The basic 505 field allows only keyword searching. The enhanced 505 field provides keyword and title searching. The 9XX fields allow full searching, and the inverted form of the author's name makes possible the indexing of chapter-level authors along with monograph-level authors in the OPAC. The 9XX fields allow authority control. Comment: Blackwell's processes MARC data submitted on 9-track tapes, IBM 3480 cartridge tapes, IBM compatible diskettes, or via Internet/FTP. Records may be processed for enrichment weekly, monthly, quarterly, annually, or on an irregular basis. Blackwell's requires an agreement that its TOC data not be exported. Price: $.75 per record. Contact: Jeff Calcagno (800-547-6426) There is much more specific data available for Blackwell's service that is not included in this profile, including samples of each type of enhanced record, at its Web page. YANKEE BOOK PEDDLER Service: TOC will initially be provided as part of the cataloging for current materials only and limited to its approval customers. When the service begins, the file will contain 6 months of TOC data for 1997 imprints. In full production it hopes to add 1,000 records per week. Yankee delivers the contracted level of service when a book is shipped to a library. It does not provide subsequent record upgrade services (such as Blackwell does from a basic or enhanced 505 to 970 fields at no extra charge). Contents: Yankee will provide the equivalents of Blackwell's "basic 505," "enhanced 505," and "chapter level 970 fields" (cf. Blackwell's profile for descriptions). If Yankee doesn't have to do any additional programming, it will provide the 970 fields with the authors coded in inverted form. (Cf. the PALINET profile, which indicates that PALINET has included that in the Yankee template.) There are no contents summaries (520 fields). Search options: The basic 505 field allows only keyword searching. The enhanced 505 field provides keyword and title searching. The 9XX fields allow full searching, and the inverted form of the author's name makes possible the indexing of chapter-level authors along with monograph-level authors in the OPAC. The 9XX fields allow authority control. Comment: Yankee has received the data from PALINET, and the software has been tested, but the distribution mechanism will not be tried until this winter. In general, receipt of data from the supplier has been slower than expected. Price: $.75 per record. Contact: Judy McQueen (800-258-3774). WLN Service: Blackwell's TOC and summary records (520 field) for English-language titles published since April 1992 (currently 142,000 records; about 35,000 per year added in weekly batches of 800). Contents: See Blackwell's profile for descriptions. Search options: The basic 505 field allows only keyword searching. The enhanced 505 field provides keyword and title searching. The 9XX fields allow full searching, and the inverted form of the author's name makes possible the indexing of chapter-level authors along with monograph-level authors in the OPAC. The 9XX fields allow authority control. Comment: WLN has the Blackwell's file; it just needs to be loaded and the workflow established. WLN would handle authorities processing and TOC enhancement so that it would appear seamless to CUL. Blackwell's requires an agreement that its TOC data not be exported. WLN is also dealing with Yankee Book Peddler and intends to mount its file when Yankee markets its product (cf. Yankee Book Peddler profile). Price: $.75 per record. Contact: Sally Smith (800-DIAL WLN; ssmith@wln.com) RLIN Service: Blackwell's TOC and summary records (520 field) for English-language titles published since April 1992 (currently 142,000 records; about 35,000 per year added in weekly batches). Contents: RLIN provides the Blackwell "enhanced 505" field (cf. the Blackwell profile), which provides 3 internal subfields: labels, such as pt., sec.; author statement in natural order; and title. If a record also has a summary field, that must also be taken. There is no way to export just the 505 from RLIN's records. Search options: In the RLIN file the contents are intended to be searchable by exact author, keywords from the author, exact title, keywords in the title, and keywords in the summary fields. Currently the searching is not working properly. In NOTIS, searching would be by keyword only. There would be no authority control. Comment: There is no tape processing. Records must be identified via an individual search by the cataloger and manually "PUT" into NOTIS. Price: RLIN TOC data is stored in a file separate from the bibliographic file. The searcher locates the record by a search for the item's ISBN. $.26 for the search plus $.50 per record "PUT" = $.76 per record. Contact: RLG Information Center (800-537-7546; bl.ric@rlg.org) PALINET Service: PALINET has an exclusive contract to do the scanning and tagging of TOC data for Yankee Book Peddler. In regular production since May, by October it had produced 12,152 records from title pages that Yankee had sent. It hopes to produce 30,000 to 40,000 by the end of the year (500 to 1,000 per week). Contents: The template developed by PALINET provides internal subfields of the contents that include the inverted form of the author's name, authors' affiliations, translators, and compilers. From the representative's description, it seems to be on a par with Blackwell's records (cf. Blackwell's profile). The quality of the service itself, however, cannot be judged yet because PALINET has no track record. The Yankee representative reported that as of 12/1/97, PALINET is behind in production. Search options: The basic 505 field allows only keyword searching. The enhanced 505 field provides keyword and title searching. The 9XX fields allow full searching, and the inverted form of the author's name makes possible the indexing of chapter-level authors along with monograph-level authors in the OPAC. The 9XX fields allow authority control. Comment: PALINET could do for CUL exactly what it does for Yankee if CUL wanted any retrospective or special collection TOC enhancement (e.g., scan contents of materials going to the Annex). A possible alternative to an in-house operation? The contact was eager to tailor to Cornell's needs Contact: Jim Rush (215-382-7031; rush@palinet.org) PRIMARY SOURCE MEDIA Service: Bookscope, a CD-ROM containing 350,000 titles since 1989. Updated monthly; over 50,000 titles added per year. Records are supplied by Blackwell's. Contents: 70% of the records contain TOC information. In addition, the database provides: 1. detailed publisher information, including e-mail and Internet addresses; 2. book reviews; 3. summaries; 4. authors' biographies and affiliations; 5. conference proceedings titles. Search options: Keyword; exact phrase; author; title; subject, and ISBN--or a combination of fields; Boolean operators. Price: $595 Comment: Windows-based. Available for a 30-day preview. Contact: Robert Guarino (800-444-0799) CHADWYCK-HEALY Service: IDIOM, a CD-ROM containing 5,000 scholarly volumes in 15 key subject areas in the humanities and social sciences since 1994 (the history index goes back to 1984). Updated yearly; over 2,000 titles added per year. Contents: The core collection is the annual list "Outstanding Academic Books" in Choice. Books for College Libraries is used to identify additional relevant texts. Search options: Keyword; exact phrase; author; title; subject, or a combination of terms. Price: $595 for the backfile and for each update. Comment: Original plans called for a program of retrospective subject releases. The product has not sold well, and updates are currently on hold. Contact: Cheryl Crosby (800-610-9093) INNOVATIVE INTERFACES INC. Service: TOC records are automatically identified by Blackwell's and delivered over the Internet on a monthly basis. Search options: The basic 505 field allows only keyword searching. The enhanced 505 field provides keyword and title searching. The 9XX fields allow full searching, and the inverted form of the author's name makes possible the indexing of chapter-level authors along with monograph-level authors in the OPAC. The 9XX fields allow authority control and can display on the OPAC as a table-of-contents facsimile. Comment: Works only on the INNOPAC library management system. Contact: Anne Carson (800-878-6600, ext. 5115) NOTES 1. Claus Poulsen, "Tables of Contents in Library Catalogs: A Quantitative Examination of Analytic Catalogs," Library Resources & Technical Services 40:2 (April 1996): 133-138. (Top) 2. Cecily Johns, "Authority Control and Record Enrichment Outsourcing at the University of California, Santa Barbara," Outsourcing Library Technical Services Operations. Ed., Karen A. Wilson, Marylou Clover (Chicago: American Library Association, 1997). (Top) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
