跳到主要內容

Source reference:http://cunycataloging.pbworks.com/w/page/25410674/Exporting%20data%20FROM%20MarcEdit

Exporting data FROM MarcEdit

Page history last edited by mberger@... 3 years ago
Needing to get data for a batch of ebooks to set them up for possible targets in SFX, I had to learn how to export from MarcEdit. There is a little tutorial from Terry on YouTube which is good for the basics but didn't help me untangle the multiple 020s in my records that needed to be separated into discrete fields.

Update your MarcEdit to version 5.2. This version has more options for export.

Exporting the MARC data
1. Start with a MARC file, not a .mrk file.
2. On the main MARCEdit menu, choose TOOLS
3. EXPORT TAB DELIMITED RECORDS
4. Set the file paths
     a. first folder=your MARC file
     b. second folder=destination file and folder. Select TAB DELIMITED .txt format!
5. Set field delimiter as TAB
6. click on "next"
7. Define your fields
     a. check off "normalize field data"--this gets rid of indicators and other stuff you don't need in your spreadsheet
     b. check off MARC since you are working with a MARC file
     c. select your fields using the drop down menu. Click on "add field" to add them. You may want to use only $a of 245. It is easier to edit your spreadsheet if you only have the main part of the title. If you do not indicate which subfield, $a is presumed. Make sure the subfields you want to export appear in the main box below the text "Define Fields/Subfields to Export."
8. MarcEdit will generate a message that the data has been exported.

Wrangling the data into Excel
1. Start Excel or your spreadsheet program of choice
2. Select "open" and open your data file. It should have a .txt file extension.
3. The text import wizard will prompt you to indicate if the file is delimited or fixed width. Choose "delimited" and click on "next"  if you do not need to deal with multiples of the same field, e.g. 020 or 500. This is the buggy part of working with the data. The utility in MarcEdit smooshes multiples of the same MARC field together.
4. Select "tab" as your delimiter and then click "next"
5. If you are working with numerical data like ISBNs, format that column as TEXT so that the data doesn't get read as a formula in Excel. Click "finish"
6. Open file. Edit. Use "replace" to clean up data if you need to remove quote marks and other punctuation that came over in the export. Save as Excel spreadsheet.
7. If you need to separate multiple 020s, 500s, etc.: 
      a. instead of opening the file in Excel as "delimited," choose "fixed width."
     b. Create link breaks as needed to separate your fields that repeat so they go into discrete columns.
     c. Format text as needed (see step 5 above). Select "finish"
     d. Clean up data as described in step 6 above.
tip: it works best to export non-repeating fields into one file and then export the repeating fields into another field and then merge the data after it has been fully worked on in Excel.

留言

這個網誌中的熱門文章

Regex - Converting formatted content notes (505) to unformatted

This is the question asked in the MarcEdit Listserv, and answered and solved by MarcEdit experts and librarians. Question: Converting formatted content notes (505) to unformatted. =505 00$tYour love$g(4:17) --$tThe driver /$r(featuring Dierks Bentley and Eric Pasley)$g(4:34) --$tDancing around it$g(4:38) --$tSouthern accents /$r(featuring Stevie Nicks)$g(4:15) --$tLonely girl$g(2:59) --$tThe only one who gets me$g(3:46) --$tRound in circles$g(4:17) --$tI wish you were here /$r(featuring Miranda Lambert)$g(3:48) --$tLeaving Nashville$g(3:29). To turn to =505 0\$aYour love -- The driver / featuring Dierks Bentley and Eric Pasley -- Dancing around it -- Southern accents / featuring Stevie Nicks -- Lonely girl -- The only one who gets me -- Round in circles -- I wish you were here / featuring Miranda Lambert -- Leaving Nashville. =505 0\$a[Contents] 1) By changing 2nd indicator to from 0 to "\" 2) Removing all subfields ($t, $r, $g) 3) Removing the time in paren
Exporting data FROM MarcEdit   http://cunycataloging.pbworks.com/w/page/25410674/Exporting%20data%20FROM%20MarcEdit              

Edit Subfield Data - 008 example

Scenario: =008  171126 t 20172016at\135\e\\\\\\\\\\vleng\d Change 008 position 6: 't'    --> to 'p' Turn to =008  171126 p 20172016at\135\e\\\\\\\\\\vleng\d Solve by... (1) Edit Subfield Data Field 008 Position: 6:1  (Position: length) (MarcEdit starts counting at zero) Find: t Replace with: p (2) Using regular expression  Find : (=008.{2}.{6})(t)(.*)  Replace : $1p$3  Note: MARC starts its count at zero and regular expression at 1