load intelex past masters

Upload: dosithee

Post on 14-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/30/2019 Load Intelex Past Masters

    1/3

    LOAD INTELEX PAST MASTERS 9/1/11spg, rev. 3/18/13

    The Past Masters collections are either complete or very slow to change. There have been no updates since the initialload in 2010. Continue to check a couple of times a year.

    Each file has individual records plus a record for the set:

    1) British Philosophy: 1600-1900 (63 records)2) The Collected Works of John Dewey, 1882-1953 (first release) not on website, supplied separately (38)3) The Continental Rationalists (6)4) Immanuel Kant: Gesammelte Schriften (Akademie-Ausgabe), I-XXIII aka Haukptwerke not on website, supplied

    separately (20)5) Political Philosophy: Machiavelli to Mill (19)

    -- 146 total as of 2/8/12

    Its not clear how one can easily determine if there are new/updated records. For those sets whose records are on thewebsite you can check the site periodically, download the file of MARC records, and compare the number against whatyou know to be in the catalog.

    Go to: http://www.nlx.com/collections

    Double-click the desired collectionRight-click MARC Records and Save Target As to the InteLex Past Masters folder

    Double-click the zip file to open it. Then delete the zip file.

    Within the folder for the set there is a separate folder for each MARC record. Consolidate them using MarcEditMARCJoin:

    Source File: This is the file where the data will be saved. Give it the original file name followed by joined, e.g.britphil joined.

    File(s) to Join: This is the file or list of files to join to the destination file. Highlight all the files. This field has acharacter limit of 15000 characters so if the files selected are longer then that character buffer nothing will bereturned.

    The record numbered 00 in the URL (its usually the second record) is for the entire set.

    We may already have a record for the entire set that was entered manually. Delete that record from the catalog (andOCLC if needed) prior to batch loading.

    The records have MARC-8 character encoding

    Check to see if there are formats other than books by clicking Reports--Material Type Report. If other text formats, useitemtype ELECTRONIC TEXT.

    003s are the same among all the files: VaCvNLX, which is fine.

    The 001s are not unique, in fact every record in a set has the same code. ForBritish Philosophy: 1600-1900itsPM00205. Create unique 001s using the code for the set combined with the resource number in the URL. For example:

    001: PM00205URL: http://pm.nlx.com/xtf/view?docId=britphil/britphil.02.xmlUnique 001: PM002057-02

    ForThe Rationalists the only record that has an 003 and 001 is the record for the set. Add the 003 and 001 to all therecords. You can create unique 001s manually as the number of records so far is small.

    To create unique 001s in batch:

    1) Copy 856 to 956. Retain the original 856.

    2) Do Find--Replace to get rid of most of the 956:

    http://www.nlx.com/collectionshttp://www.nlx.com/collectionshttp://www.nlx.com/collections
  • 7/30/2019 Load Intelex Past Masters

    2/3

    Find: =956 40$uhttp://pm.nlx.com/xtf/view?docId=britphil/britphil.Replace: =956 40$u

    That leaves: =956 40$u02.xml

    3) Do another Find--Replace to get rid of the .xml:

    Find: =956 40(.*)\.xmlReplace: =956 40$1Choose: Use regular expressions. Click Replace All.

    The (.*) searches for any char, repeated 0 or more times, and saved in a "capture group." The\.xml saysyou want one example of a real character--in this case .xml--at the end of the string to be saved in thecapture group.

    In the replace field, we start with the same prefix =956 40 followed by the contents of the first capturegroup $1 and with no additional characters at the end of the string.

    That leaves: =956 40$u02

    4) Using Swap Field, append the contents of 956 |u to the end of the 001:

    Choose: Add to existing field. Click Process.

    That leaves: =001 PM002057$02

    5) Replace the $ with a hypen:

    Find: =001 PM002057$Replace: =001 PM002057-

    That leaves: =001 PM002057-02

    You are done! (No doubt there is a more efficient way to do this.)

    There are no 035s

    245 fields have the set name followed by the title, e.g. British philosophy 1600-1900. Francis Bacon. Novum Organum ..Leave these as is. Do not create a set-specific 830.

    One set had an odd 250 that needed fixing:

    Find: =250 ##$aElectronic edition.Replace: =250 \\$aElectronic edition.

    Delete any 506s.

  • 7/30/2019 Load Intelex Past Masters

    3/3

    Use Find--Replace to add a period at the end of 830 _0 |a Past masters.Do not trace the set name as it appears at the beginning of the 245

    The URL is a bit long. Use Find--Replace to insert |z Click to access before |u.

    Bulk import rule: EB035Operator ID: PASTMAST

    For the record for the entire set, add this note to the MFHD 852: |z Individual titles also cataloged separately

    Some sets have generated load errors. Follow up in the err.imp file.