hathitrust and print storage building around a digital core

60
HathiTrust and Print Storage Building around a digital core

Upload: taylor-purcell

Post on 27-Mar-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: HathiTrust and Print Storage Building around a digital core

HathiTrust and Print Storage

Building around a digital core

Page 2: HathiTrust and Print Storage Building around a digital core

HathiTrust Content Growth

Page 3: HathiTrust and Print Storage Building around a digital core

Content Distribution

* As of May 1, 2011

8,625,158 Total volumes2,297,041 Public Domain4,722,664 Book titles209,930 Serial titles

Page 4: HathiTrust and Print Storage Building around a digital core

Content Distribution

* As of May 1, 2011

Page 5: HathiTrust and Print Storage Building around a digital core

Dates

* As of May 1, 2011 Statistics and Visualizations

Page 6: HathiTrust and Print Storage Building around a digital core

Breakdown of HathiTrust book corpus by publication date

Bibliographic Indeterminacy and the Scale of Problems and Opportunities of "Rights" in Digital Collection Building – 2/2011

Page 7: HathiTrust and Print Storage Building around a digital core

Breakdown of HathiTrust book corpus by publication date

Page 8: HathiTrust and Print Storage Building around a digital core

Language Distribution (1)

The top 10 languages make up ~86% of all content

Statistics and Visualizations* As of May 1, 2011

Page 9: HathiTrust and Print Storage Building around a digital core

Language Distribution (2)

The next 40 languages make up ~13% of total

Statistics and Visualizations* As of May 1, 2011

Page 10: HathiTrust and Print Storage Building around a digital core

Content over time

* As of May 1, 2011

Page 11: HathiTrust and Print Storage Building around a digital core

Dates (copyright)

Page 12: HathiTrust and Print Storage Building around a digital core

A global change in the library environment

June 2010Median duplication: 31%

June 2009Median duplication: 19%

Academic print book collection already substantially duplicated in mass digitized book corpus

Page 13: HathiTrust and Print Storage Building around a digital core

Continuing growth of overlap …

• ARL overlap– 31% in June 2010– 33% in Dec (adjustment: adding little-held works)– ~ 1% per 225,000 vols– 38% in May, 2011; 45% by December, 2011

• Oberlin Group overlap– 41% in December, 2010– Higher rate of overlap per added volume?– Close to 50% in May, 2011

Page 14: HathiTrust and Print Storage Building around a digital core

And yet every library is different

• Our median rate of overlap may be the same• But our overlap profiles will differ by library

Page 15: HathiTrust and Print Storage Building around a digital core
Page 16: HathiTrust and Print Storage Building around a digital core
Page 17: HathiTrust and Print Storage Building around a digital core
Page 18: HathiTrust and Print Storage Building around a digital core
Page 19: HathiTrust and Print Storage Building around a digital core
Page 20: HathiTrust and Print Storage Building around a digital core
Page 21: HathiTrust and Print Storage Building around a digital core
Page 22: HathiTrust and Print Storage Building around a digital core
Page 23: HathiTrust and Print Storage Building around a digital core
Page 24: HathiTrust and Print Storage Building around a digital core
Page 25: HathiTrust and Print Storage Building around a digital core
Page 26: HathiTrust and Print Storage Building around a digital core
Page 27: HathiTrust and Print Storage Building around a digital core
Page 28: HathiTrust and Print Storage Building around a digital core
Page 29: HathiTrust and Print Storage Building around a digital core
Page 30: HathiTrust and Print Storage Building around a digital core
Page 31: HathiTrust and Print Storage Building around a digital core
Page 32: HathiTrust and Print Storage Building around a digital core
Page 33: HathiTrust and Print Storage Building around a digital core
Page 34: HathiTrust and Print Storage Building around a digital core
Page 35: HathiTrust and Print Storage Building around a digital core
Page 36: HathiTrust and Print Storage Building around a digital core
Page 37: HathiTrust and Print Storage Building around a digital core
Page 38: HathiTrust and Print Storage Building around a digital core
Page 39: HathiTrust and Print Storage Building around a digital core
Page 40: HathiTrust and Print Storage Building around a digital core
Page 41: HathiTrust and Print Storage Building around a digital core
Page 42: HathiTrust and Print Storage Building around a digital core
Page 43: HathiTrust and Print Storage Building around a digital core
Page 44: HathiTrust and Print Storage Building around a digital core
Page 45: HathiTrust and Print Storage Building around a digital core
Page 46: HathiTrust and Print Storage Building around a digital core
Page 47: HathiTrust and Print Storage Building around a digital core
Page 48: HathiTrust and Print Storage Building around a digital core
Page 49: HathiTrust and Print Storage Building around a digital core
Page 50: HathiTrust and Print Storage Building around a digital core
Page 51: HathiTrust and Print Storage Building around a digital core
Page 52: HathiTrust and Print Storage Building around a digital core
Page 53: HathiTrust and Print Storage Building around a digital core
Page 54: HathiTrust and Print Storage Building around a digital core
Page 55: HathiTrust and Print Storage Building around a digital core
Page 56: HathiTrust and Print Storage Building around a digital core
Page 57: HathiTrust and Print Storage Building around a digital core
Page 58: HathiTrust and Print Storage Building around a digital core

And yet every library is different

• Our median rate of overlap may be the same• But our overlap profiles will differ by library• Our use patterns differ• Our risk profiles differ• Our roles vis-à-vis our constituencies differ• Thus, the need to act independently on

common data

Page 59: HathiTrust and Print Storage Building around a digital core

Extending the holdings database

• HathiTrust print holdings database– Basis for new cost model (overlap of in-copyright)– Basis for lawful uses (e.g., print disabilities, Section 108)– A more complete picture than elsewhere

• Print monograph storage proposal– Enable partners to register commitments – Establish definitions (e.g., environment, use and condition)– Build in cost-sharing: collectively fund those that make

commitments– Communicate information to partners to facilitate

decision-making

Page 60: HathiTrust and Print Storage Building around a digital core

Next steps?

• Work to develop draft proposal, led by Tom Teper, underway by HathiTrust Collections Committee (Ivy Anderson, chair)

• Early draft for review to Executive Committee in May/June

• Final version from Executive Committee to partners in late summer

• Consideration as part of new cost model at Constitutional Convention