2011-11-17 - SLIDE 1I242 - Fall 2011 Connecting Archival Collections: The Social Networks and Archival Context Project Ray R. Larson University of California,

Download 2011-11-17 - SLIDE 1I242 - Fall 2011 Connecting Archival Collections: The Social Networks and Archival Context Project Ray R. Larson University of California,

Post on 20-Dec-2015

212 views

Category:

Documents

0 download

Embed Size (px)

TRANSCRIPT

<ul><li> Slide 1 </li> <li> 2011-11-17 - SLIDE 1I242 - Fall 2011 Connecting Archival Collections: The Social Networks and Archival Context Project Ray R. Larson University of California, Berkeley School of Information Thanks to Daniel V. Pitti of the Institute for Advanced Technology in the Humanities, University of Virginia, and Brian Tingle of the California Digital Library for many of the slides here </li> <li> Slide 2 </li> <li> 2011-11-17 - SLIDE 2I242 - Fall 2011 SNAC Overview Funding and Timeline Project Team Project Objectives and Rationale Data Contributing Institutions Archival Standards Employed Extraction and Matching Prototype Interface </li> <li> Slide 3 </li> <li> 2011-11-17 - SLIDE 3I242 - Fall 2011 Funding and Timeline National Endowment for the Humanities A Preservation and Access, Research and Development grant Two-year project May 2010-April 2012 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia </li> <li> Slide 4 </li> <li> 2011-11-17 - SLIDE 4I242 - Fall 2011 Project Team Daniel Pitti (PI) and Worthy Martin (Institute for Advanced Technology in the Humanities, University of Virginia) Adrian Turner and Brian Tingle (California Digital Library, University of California) Ray Larson, Krishna Janakiraman (School of Information, University of California, Berkeley) Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia </li> <li> Slide 5 </li> <li> 2011-11-17 - SLIDE 5I242 - Fall 2011 Project Objectives Archival finding aids currently intermix description of records with description of the creators of records and persons evident in the records Goal: Using EAC-CPF, an International archival authority control standard facilitate the separation of the description of people from the description of records for archival description Goal: enhance the economy and effectiveness of archival description to enhance access and understanding of users of archives, libraries, and museums Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia </li> <li> Slide 6 </li> <li> 2011-11-17 - SLIDE 6I242 - Fall 2011 Data Contributing Institutions EAD-encoded finding aids Library of Congress (1159) Online Archive of California (15,400+) Northwest Digital Archive (5,563+) Virginia Heritage (8,390+) Authority records Library of Congress: NACO/LCNAF (3.8M personal names; 900K corporate names) Getty Vocabulary Program: Union List of Artist Names (293K personal and corporate names) Virtual International Authority File (intersection with NACO/LCNAF, 5M personal names) Other biographical sources (e.g., DBPedia, IMDB) </li> <li> Slide 7 </li> <li> 2011-11-17 - SLIDE 7I242 - Fall 2011 Methods and Processing Extract EAC-CPF records from existing EAD- encoded archival descriptions Extracting both creators and referenced CPF names Match EAC-CPF records against one another and against existing authority records (ULAN, VIAF, LCNAF); merge records for the same entity Enhance EAC-CPF by normalizing entries, adding alternative entries, titles (VIAF), and historical data (ULAN) Key challenge: two or more people with the same name; two or more names for the same person Create a prototype historical resource and access system Historical data and social-professional networks Links to archive, library, and museum resources (by and about) Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia </li> <li> Slide 8 </li> <li> 2011-11-17 - SLIDE 8I242 - Fall 2011 Components of Archival Description Description of records Context of creation: creators Functions and activities documented in records Dedicated descriptive semantics and structure for each component Components interrelated with one another </li> <li> Slide 9 </li> <li> 2011-11-17 - SLIDE 9I242 - Fall 2011 Records: EAD Encoded Archival Description Society of American Archivists and Library of Congress Used internationally English, Spanish, Dutch, French, and Chinese 1998, 2002 Official site at http://www.loc.gov/ead/ </li> <li> Slide 10 </li> <li> 2011-11-17 - SLIDE 10I242 - Fall 2011 What EAD Is An emerging encoding and structural standard for archival description Data structure Communication/interchange Finding aid / archival description Based on principles of ISAD(G): General International Standard Archival Description, Second edition </li> <li> Slide 11 </li> <li> 2011-11-17 - SLIDE 11I242 - Fall 2011 What EAD Is Not Content standard Data value standard Archival management system </li> <li> Slide 12 </li> <li> 2011-11-17 - SLIDE 12I242 - Fall 2011 Principals of Record Description Respect de fonds Provenance Original order Hierarchical and symmetrical Inheritance of description </li> <li> Slide 13 </li> <li> 2011-11-17 - SLIDE 13I242 - Fall 2011 Archival Records Records are the by-products of people living and working as individuals, in organized groups, in families Records document people living and working People exist in social-professional contexts, in relation to others Records document these relations All records created by the same entity are described together (a fonds or collection) Creators documented in detail Many of the people documented in the record referenced in description Archival descriptions document interrelations among people and records (documents) Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia </li> <li> Slide 14 </li> <li> 2011-11-17 - SLIDE 14I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia Source: J. Robert Oppenheimer Papers (LoC) Oppenheimer, J. Robert, 1904-1967 Oppenheimer, J. Robert, 1904-1967 Bethe, Hans Albrecht, 1906- --Correspondence Born, Max, 1882-1970 --Correspondence Boyd, Julian P. (Julian Parks), 1903- --Correspondence Bush, Vannevar, 1890-1974 --Correspondence Casals, Pablo, 1876-1973 --Correspondence Institute for Advanced Study (Princeton, N.J.) Los Alamos Scientific Laboratory EAD Elements </li> <li> Slide 15 </li> <li> 2011-11-17 - SLIDE 15I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia Source: Leonard Bernstein Collection (LoC) 1 Aaltonen, Erkki 1981 1 Abbado, Claudio 1963-90 5 [] EAD Elements </li> <li> Slide 16 </li> <li> 2011-11-17 - SLIDE 16I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia Biographical Sketch Jos Marcos Mugarrieta, prior to his term as Mexican consul in San Francisco 1857- 1863, served in the Mexican army from 1837. He saw action in numerous battles and campaigns Jamaica, under General Canalizo in 1841; Campeche, 1842-1843; Merida, 1843; Veracruz, 1845; Mexico City, 1846; Angostura and Cerro-gordo, 1847; Guanajuato, 1848, and Sierra-Gorda under Bustamante, 1848-1849; and Matamoros, 1849-1850. [] In April 1857 Mugarrieta received an appointment from the Comonfort government for the consulship in San Francisco. He did not actually begin his new duties until September 1, 1859, due to illness and to the political situation in Mexico. [] EAD Elements </li> <li> Slide 17 </li> <li> 2011-11-17 - SLIDE 17I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia Chronology 1900 Born on Jan. 20 in Hastings, Minnesota. 1922 Received baccalaureate from Princeton University, major in philosophy. [] 1965 Died on April 4. EAD Elements </li> <li> Slide 18 </li> <li> 2011-11-17 - SLIDE 18I242 - Fall 2011 The EAD DTD The EAD DTD is very complex and permits considerable flexibility in expressing the description and topics of the archival collection. The main parts are outlined on the following slides, but include: A header, including basic descriptive info. Optional frontmatter The archival description We will describe only a few of the top-level tags </li> <li> Slide 19 </li> <li> 2011-11-17 - SLIDE 19I242 - Fall 2011 Major Sections and DTD Defs EAD EADHeader: FILEDESC </li> <li> Slide 20 </li> <li> 2011-11-17 - SLIDE 20I242 - Fall 2011 Major Sections and DTD Defs The Archival Description: The Descriptive Identification </li> <li> Slide 21 GB 0133 TAB Tabley Muniments 19th century 1.24 cu.m Warren, family, of Tabley, Cheshire Warren, John Byrne Leicester, 1835-1895, 3rd Baron de Tabley, poet"&gt; </li><li> 2011-11-17 - SLIDE 21I242 - Fall 2011 Example EAD Record (Hub) GB 0133 TAB Tabley Muniments John Rylands University Library of Manchester 150 Deansgate Manchester... (Parts removed ) University of Manchester, John Rylands University Library of Manchester GB 0133 TAB Tabley Muniments 19th century 1.24 cu.m Warren, family, of Tabley, Cheshire Warren, John Byrne Leicester, 1835-1895, 3rd Baron de Tabley, poet </li> <li> Slide 22 </li> <li> 2011-11-17 - SLIDE 22I242 - Fall 2011 Example EAD Record (Hub) Administrative/Biographical History The poet John Byrne Leicester Warren, later 3rd and last Baron de Tabley, of Tabley near Knutsford, Cheshire, was born in 1835, the son of the 2nd Baron de Tabley (1811-1887), and his wife, Catherina. His mother was Italian, the daughter of the count de Soglio, and Warren spent much of his early childhood with her in Italy and Greece. He was educated at Eton and Christ Church, Oxford. At Oxford he published a volume of poetry. Originally he published under the pseudonyms George F. Preston (1859-1862) and William Lancaster (1863-1868), but latterly under his own name. His early verse included Praeterita (1863), Eclogues and Monodramas (1864), Studies in Verse (1865), Philocletes (1866), and Orestes (1868). His early work was Tennysonian in style, but he was later to be influenced by both Browning and Swinburne. In 1873 he produced . (some data removed) </li> <li> Slide 23 </li> <li> 2011-11-17 - SLIDE 23I242 - Fall 2011 Example EAD Record (Hub) Scope and Content The collection consists mainly of the personal papers of the 3rd Baron de Tabley. The papers reflect his interests in literature, politics, botany and numismatics and include correspondence with numerous prominent later Victorian figures. Attention should also be drawn to de Tableys extensive and important collection of armorial bookplates. Correspondents include Sir Mountstuart Grant Duff, Edmund Gosse, Lord Houghton, A.C.Benson, and Robert Bridges. There are volumes of Tabley's essays and verse, as well as a considerable number of notebooks and loose manuscripts of verse and other writings. There are various bundles and boxes relating to "Coins", "Botany", "Poetry", "Literary", "Financial" and bookplates. Preliminary survey list. There is correspondence with the 3rd Baron de Tabley among the Edward Freeman Papers, held at JRULM. The Library also has custody of the important Tabley Book Collection. The family and estate papers of the Leicester-Warren Family of Tabley are held by Cheshire Record Office. Some of these papers were originally in the custody of the John Rylands University Library of Manchester. </li> <li> Slide 24 </li> <li> 2011-11-17 - SLIDE 24I242 - Fall 2011 Example EAD Record (Hub) Index terms Tabley Inferior Cheshire SJ7378 Benson Arthur Christopher 1862-1923 Bridges Robert Seymour 1844-1930 Duff Sir Mountstuart Elphinstone Grant 1829-1906 Knight Gosse Sir Edmund William 1849-1928 Knight Milnes Richard Monckton 1809-1885 1st Baron Houghton Bookplates Botany Numismatics Poetry Modern 19th century </li> <li> Slide 25 </li> <li> 2011-11-17 - SLIDE 25I242 - Fall 2011 EAC-CPF EAD is now complemented by EAC or the Encoded Archival Context It is another XML-based standard for descriptions of record creators: corporate bodies, persons and families (CPF) It was developed as part of an international effort with hopes of being able to link and share information among archives having materials related to particular corporate bodies, persons and families </li> <li> Slide 26 </li> <li> 2011-11-17 - SLIDE 26I242 - Fall 2011 Transformation of EAD to EAC The EAD archival records containing many names are transformed using a complex XSLT transform to many EAC-CPF records one for each unique name in the EAD record </li> <li> Slide 27 </li> <li> 2011-11-17 - SLIDE 27I242 - Fall 2011 EAC-CP Data Examples Examples </li> <li> Slide 28 </li> <li> 2011-11-17 - SLIDE 28I242 - Fall 2011 Library and Archive Authority Control Library (or bibliographic) authority control is almost exclusively about the control of names Archival authority control involves biographical- historical description of the CPF entity Descriptions based on controlled vocabularies, for example, occupations, place of birth and death But also biographical-historical description Prose Chronological list Archival authority control provides context for understanding records, the context of their creation, the provenance </li> <li> Slide 29 </li> <li> 2011-11-17 - SLIDE 29I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia person Oppenheimer, J. Robert, 1904-1967. AACR2 Oppenheimer, J. Robert (Julius Robert), 1904-1967 VIAF Oppenheimer, Julius Robert, 1904-1967 VIAF Oppenheimer, Robert VIAF Ou-pn-hai-mo, 1904-1967 VIAF EAC-CPF example data </li> <li> Slide 30 </li> <li> 2011-11-17 - SLIDE 30I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia 1904, Apr. 22 1967, Feb. 18 Science--Societies, etc. Male Physicists. </li> <li> Slide 31 </li> <li> 2011-11-17 - SLIDE 31I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia 1904, Apr. 22 New York, N.Y. Born, New York, N.Y. 1943-1945 Los Alamos, N. Mex. Director, Los Alamos Scientific Laboratory, Los Alamos, N. Mex. 1954 (1) Denied security clearance [] (2) Published Science and the Common Understanding [] 1967, Feb. 18 Princeton, N.J. Died, Princeton, N.J. </li> <li> Slide 32 Bush, Vannevar, 1890-1974. recordId: DLC.ms998007.r007"&gt; </li><li> 2011-11-17 - SLIDE 32I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia Bush, Vannevar, 1890-1974. recordId: DLC.ms998007.r007 </li> <li> Slide 33 J. Robert Oppenheimer Papers, 1799-1980 (bulk 1947-1967) Papers 1799- 1980 (bulk 1947-1967) MSS35188 Oppenheimer, J. Robert, 1904-1967 Manuscript Division. Library of Congress Physicist and director of the Institute for Advanced Study, Princeton, New Jersey. [...] Topics include theoretical physics, development of the atomic bomb, the relationship between government and science, nuclear energy, security, and national loyalty."&gt; </li><li> 2011-11-17 - SLIDE 33I242 - Fall 2011 Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia J. Robert Oppenheimer Papers, 1799-1980 (bulk 1947-1967) Papers 1799- 1980 (bulk 1947-1967) MSS35188 Oppenheimer, J. Robert, 1904-1967 Manuscript Division. Library of Congress Physicist and director of the Institute for Advanced Study, Princeton, New Jersey. [...] Topics include theo...</li></ul>