跳到主要內容

Source reference:http://cunycataloging.pbworks.com/w/page/25410674/Exporting%20data%20FROM%20MarcEdit

Exporting data FROM MarcEdit

Page history last edited by mberger@... 3 years ago
Needing to get data for a batch of ebooks to set them up for possible targets in SFX, I had to learn how to export from MarcEdit. There is a little tutorial from Terry on YouTube which is good for the basics but didn't help me untangle the multiple 020s in my records that needed to be separated into discrete fields.

Update your MarcEdit to version 5.2. This version has more options for export.

Exporting the MARC data
1. Start with a MARC file, not a .mrk file.
2. On the main MARCEdit menu, choose TOOLS
3. EXPORT TAB DELIMITED RECORDS
4. Set the file paths
     a. first folder=your MARC file
     b. second folder=destination file and folder. Select TAB DELIMITED .txt format!
5. Set field delimiter as TAB
6. click on "next"
7. Define your fields
     a. check off "normalize field data"--this gets rid of indicators and other stuff you don't need in your spreadsheet
     b. check off MARC since you are working with a MARC file
     c. select your fields using the drop down menu. Click on "add field" to add them. You may want to use only $a of 245. It is easier to edit your spreadsheet if you only have the main part of the title. If you do not indicate which subfield, $a is presumed. Make sure the subfields you want to export appear in the main box below the text "Define Fields/Subfields to Export."
8. MarcEdit will generate a message that the data has been exported.

Wrangling the data into Excel
1. Start Excel or your spreadsheet program of choice
2. Select "open" and open your data file. It should have a .txt file extension.
3. The text import wizard will prompt you to indicate if the file is delimited or fixed width. Choose "delimited" and click on "next"  if you do not need to deal with multiples of the same field, e.g. 020 or 500. This is the buggy part of working with the data. The utility in MarcEdit smooshes multiples of the same MARC field together.
4. Select "tab" as your delimiter and then click "next"
5. If you are working with numerical data like ISBNs, format that column as TEXT so that the data doesn't get read as a formula in Excel. Click "finish"
6. Open file. Edit. Use "replace" to clean up data if you need to remove quote marks and other punctuation that came over in the export. Save as Excel spreadsheet.
7. If you need to separate multiple 020s, 500s, etc.: 
      a. instead of opening the file in Excel as "delimited," choose "fixed width."
     b. Create link breaks as needed to separate your fields that repeat so they go into discrete columns.
     c. Format text as needed (see step 5 above). Select "finish"
     d. Clean up data as described in step 6 above.
tip: it works best to export non-repeating fields into one file and then export the repeating fields into another field and then merge the data after it has been fully worked on in Excel.

留言

這個網誌中的熱門文章

Regex - Converting formatted content notes (505) to unformatted

This is the question asked in the MarcEdit Listserv, and answered and solved by MarcEdit experts and librarians. Question: Converting formatted content notes (505) to unformatted. =505 00$tYour love$g(4:17) --$tThe driver /$r(featuring Dierks Bentley and Eric Pasley)$g(4:34) --$tDancing around it$g(4:38) --$tSouthern accents /$r(featuring Stevie Nicks)$g(4:15) --$tLonely girl$g(2:59) --$tThe only one who gets me$g(3:46) --$tRound in circles$g(4:17) --$tI wish you were here /$r(featuring Miranda Lambert)$g(3:48) --$tLeaving Nashville$g(3:29). To turn to =505 0\$aYour love -- The driver / featuring Dierks Bentley and Eric Pasley -- Dancing around it -- Southern accents / featuring Stevie Nicks -- Lonely girl -- The only one who gets me -- Round in circles -- I wish you were here / featuring Miranda Lambert -- Leaving Nashville. =505 0\$a[Contents] 1) By changing 2nd indicator to from 0 to "\" 2) Removing all subfields ($t, $r, $g) 3) Removing the time in paren...

Convert formatted contents notes to unformated - example 2

Question: I wonder whether anyone has found a way to convert formatted content notes to unformatted .  An example is: =505  00$tOn comparison /$rJonathan Z. Smith --$tPolis-religion and its alternatives in the Roman provinces /$rGreg Woolf --$tFrom republic to principate : priesthood, religion and ideology /$rRichard Gordon --$tFeature of Roman religion /$rArthur Darby Nock --$tLudi saeculares and the Carmen saeculare /$rDenis Feeney --$tCults, myths, and politics at the beginning of the empire /$rJohn Scheid, trans. Philip Purchase --$tTheological efforts of the Roman upper classes in the first century BC /$rArnaldo Momigliano --$tHierarchy and structure in Roman polytheism : Roman methods of conceiving action /$rJohn Scheid, trans. Philip Purchase --$tReligious toleration in republican Rome /$rJohn North --$tReligion for the empire /$rClifford Ando --$tLoca sancta /$rSabine MacCormack --$tComplex of times : no more sheep on Romulus' birthday /$rMary Beard --...
Some definitions for Regular Expression - Literal, Metacharacter, target string, escape, sequence and search expression Literal - A literal is any character we use in search or matching expression, for example to find ind in w ind ows the ind is a literal string - each character plays a part in the search, it is literally the string we want to find. Metachatacter - A metacharacter is one or more special characters that have a unique meaning and not NOT used as literals in the search expression, for example, the character ^ (circumflex or caret) is a metacharacter. [ ] - ^ 1. A simple find and replace to change the indicator and first subfield code. Find: =505 00$t Replace: =505 0\$a Result: =505  0\$aYour love$g(4:17) --$tThe driver /$r(featuring Dierks Bentley and Eric Pasley)$g(4:34) --$tDancing around it$g(4:38) --$tSouthern accents /$r(featuring Stevie Nicks)$g(4:15) --$tLonely girl$g(2:59) --$tThe only one who gets me$g(3:46) --$tRound in circles$g...