digital libraries and music jon dunn slis l631 music librarianship seminar april 7, 2003

78
Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Upload: homer-cobb

Post on 12-Jan-2016

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital Libraries and Music

Jon Dunn

SLIS L631

Music Librarianship Seminar

April 7, 2003

Page 2: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Outline Digital Libraries

Music content Variations Variations2 Special topics:

Music information retrieval Open Archives Initiative

Page 3: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

What is a digital library? DL as collection/information system

“a collection of information that is both digitized and organized” -- Mike Lesk, National Science Foundation

“networked collections of digital text, documents, images, sounds, scientific data, and software”-- President’s Information Technology Advisory Council report

DL as organization: “an organization that provides the resources, including the

specialized staff, to select, structure, offer intellectual access to, interpret, distribute, preserve the integrity of, and ensure the persistence over time of collections of digital works so that they are readily and economically available for use by a defined community or set of communities”-- Digital Library Federation

Page 4: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Applications of music DLs Education

Electronic reserves Online instructional tools

Research Better access to special collections New capabilities for analysis, searching

Commercial Professionals

E.g. music/film/video production Consumers

Online music catalogs, digital distribution

Page 5: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

What is a music digital library? What does it contain? How is this content acquired? How is this content accessed? How can the content be used once located? What is the purpose? Who are the users? How is content protected?

Page 6: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music DL features Content

Selection, digitization, storage, delivery Metadata (cataloging) Search capabilities

for content and metadata Interfaces

User interfaces, programmatic interfaces Access control

Page 7: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Basic Representations of Music

Audio (e.g., CD, MP3): like speech

Time-stamped Events (e.g., MIDI file): like unformatted text

Music Notation: like text with complex formatting

Digital Audio

Time-stamped Events

Music Notation

Page 8: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Content Formats Audio MIDI Scores

Images Structured file format

(Video)

Page 9: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital audio Sampling

Sample rate, sample size, number of channels

Compression Perceptual audio coding

File formats Standards

Page 10: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital audio file formats Uncompressed –all basically the same

WAV - Microsoft/IBM AIFF - SGI/Apple AU/SND - NeXT/Sun

Compressed MPEG-1 layers 1-3, MPEG-2 AAC RealAudio, Windows Media, QuickTime

Each supports various compression options

Page 11: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital audio file sizes Uncompressed audio

44.1 kHz, 16 bit, stereo (CD quality) 650 MB for one hour 1.4 Megabits/second

Compressed MP3: 58 MB for one hour, 128 Kilobits/second AAC: 29 MB for one hour, 64 Kilobits/second RealAudio, Windows Media Audio, QuickTime Qdesign

Music: down to 20 Kilobits/second or less

Page 12: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital Audio

audiosampling

quantizationnoise

Barlow, Multimedia Systems, p. 77.

Page 13: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

CD Audio Sample rate:

44.1 kHz (44,100 samples/second) Sample size:

16 bits Number of channels:

2 (stereo) Bitrate

44100 samples/second * 16 bits/sample * 2 channels = 1.4112 megabits/second (plus file format/network overhead)

Page 14: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Masking Effect

Barlow, Multimedia Systems, p. 73.

Page 15: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

From Research and Creative Activity, September 1999

Page 16: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digital audio delivery Delivery options

Download Streaming

e.g. RealAudio, Windows Media, QuickTime Streaming

Encrypted download e.g. LiquidAudio, a2bmusic, Windows Media

Page 17: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Scores Score image

File format: TIFF, JPEG, GIF, PDF, … Resolution Grayscale vs. bitonal (black and white)

Score notation Many proprietary formats No common standard

Page 18: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Attributes of notated musical information

Pitch Duration Tempo Dynamic level Articulation Part (sometimes

implying timbral definition) Selfridge-Field, Beyond MIDI, p. 9.

Page 19: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Difficulties in representing CMN

Grammar of CMN is open-ended Which is more critical: graphical

appearance or semantic meaning? Much left open to interpretation

Style differences, e.g. interpretation of rhythms

Page 20: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music Notation File Formats www.music-notation.info lists over 50

different music notation formats, most for CMN

Page 21: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

MIDI Musical Instrument Digital Interface

Originally a hardware interface spec Communication of real-time events between

musical devices Standard MIDI File (SMF)

Stores time-stamped MIDI event information e.g note on/off, key pressure, aftertouch, pitch bend,

control change, program change. Each event accompanied by parameters

e.g. note on includes pitch, duration, dynamic range Spec maintained by industry group

MIDI Manufacturers’ Assocation

Page 22: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Limitations of MIDI MIDI does not represent many musical

attributes Graphical notation elements

Rests, stem direction, enharmonic distinctions, staff systems, page layout, etc.

Sound elements Timbre, full stylistic expression

Extensions exist but not widely used

Page 23: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Creating Notation Content Transcription

Music notation editor ASCII data entry

Recognition OMR: Optical Music Recognition

Page 24: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music notation editor example:Finale

Page 25: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OMR: Optical Music Recognition Commercial packages

Musitek MidiScan/SmartScore Version included with Finale

Neuratron PhotoScore Version included with Sibelius

Page 26: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OMR: A long way from OCR

Scanned into Finale: Only 5 easy edits needed.

Here's the original:

Taken from http://www.codamusic.com/finale/scanning.asp

Page 27: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OMR: Optical Music Recognition Research projects

CANTOR University of Waikato, New Zealand

Adaptive OMR Johns Hopkins University, USA Example:

http://mambo.peabody.jhu.edu/omr/demo/ others:

http://www.cs.waikato.ac.nz/~davidb/omr/

Page 28: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Score Images Scanned images still useful

Historical editions, manuscripts Preservation, improving access

Impracticality of large-scale OMR Music presents challenges for scanning

Page 29: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations Digital library of music sound recordings

and scores Online since 1996 Accessible in Music Library and other

select locations - copyright Used daily by large student population

Page 30: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Original Concept Burroughs and Fenske, 1990 VARIATIONS name

Theme and Variations Variety of information formats for music

Networked access for the music student or scholar to sound recordings, scores, textual materials, video recordings

Page 31: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Focus on Audio High demand portion of collection Fragile formats Lack of previous work; uniqueness

Page 32: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Focus on Audio Reserves Half of sound recording use from

reserves Problems with existing practices

Cassette tape dubs, analog distribution systems

Concentrated use of a few items at any given time

Page 33: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations System Digitization Storage Access

Page 34: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Design and Development Developed by Music Library with assistance from

UITS and Library Information Technology Integrate rather than develop from scratch Partnership with IBM Funding: School of Music, Libraries, UITS, IBM Online in April 1996

Page 35: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digitization Formats

Analog: LP, cassette tape, reel-to-reel tape Digital: CD, DAT

Capture at CD quality 44.1 kHz, 16 bit, stereo, 700MB for one hour

Compress to MPEG-1 layer 2 (“MP2”) 200 MB for one hour

Create “track file”

Page 36: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Digitization Hardware and Software Windows PCs Sound capture card Microtest Disc-to-Disk CD capture

software Sonic Foundry Sound Forge XP audio

editor

Page 37: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music Library Digitizing Lab

Page 38: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Storage Tivoli Storage Manager software IBM RS/6000 AIX server IBM Tape Library Dataserver

Contains three tape drives 10 terabyte (10,000 gigabyte) capacity

Page 39: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

IBM 3494 Tape Library

Page 40: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

IBM 3590E tapes:

20GB each

Page 41: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Access Discovery

How does the user find the desired recording?

Playback How is audio delivered to the user? How does the user navigate within a given

recording?

Page 42: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Collection Currently: 6900 titles, 8000 hours of

audio 5.6 TB uncompressed 1.6 TB compressed

Opera, songs, instrumental music, jazz, rock, world music

Page 43: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Discovery Varies based on purpose of access

Reserves Course reserve lists Faculty-created course home pages (incl.

Oncourse) General use

Links from IUCAT library catalog(856 fields in MARC bib records)

Page 44: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Playback Streaming server

IBM RS/6000, 150 GB disk IBM VideoCharger server software Software to connect VideoCharger with

TSM (locally written) Client

IBM VideoCharger client software Variations Player (locally written)

Navigation via track files

Page 45: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Network Originally ATM

25/100/155 megabits per second Now switched Ethernet

10/100/1000 megabits per second Variations audio stream requires 384

kilobits/second Up to 150 streams

Page 46: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations Demonstration

Page 47: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2 Four-year project

Started October 1, 2000 Funding from NSF and NEH through Digital

Libraries Phase 2 (DLI2) program Large interdisciplinary team of investigators Faculty: Music, Information Science, Law,

Computer Science Librarians and technologists: Libraries, University

Information Technology Services Bloomington and Indianapolis campuses

Page 48: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Project goals Establish a digital music library testbed

system supporting multiple formats: audio, video, score images, score notation

Develop multiple interfaces for specific user applications in the music library and the classroom

Conduct research in metadata, usability, copyright, and networking

Page 49: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Partners: “Satellite Sites” United States

University of Illinois at Urbana-Champaign University of Massachusetts at Amherst Northwestern University

United Kingdom Kings College - London Loughborough University University of Oxford

Japan Waseda University

Evaluation…potential for co-development

Page 50: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

The Variations2 System Integrated access to music in all formats

Digital audio recordings Score images Score notation Video

Delivery to wide range of users Faculty: teaching, course design, research Students: coursework, independent study Music librarians, other library users

Extensible Multiple user interfaces Staged development

Page 51: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2Version 1.0 Features Infrastructure

Data/metadata repositories, authentication, logging

Search and retrieval interface Based on new data model

Presentation/navigation of audio and scanned scores

Bookmarking

Page 52: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2 Version 1.0 Technical Environment Client and server developed in Java Windows and Mac OS X client platforms,

Unix (AIX/Linux) server Audio streaming: QuickTime for Java, Darwin

Streaming Server Database: IBM DB2, DB2 Text Information

Extender Image compresssion: DjVu from AT&T Labs

and Lizardtech XML/MARC/Z39.50 tools: Saxon, Xerces,

Jafer, James

Page 53: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2 1.0System Architecture: Layer View

General user

Searchtool

Soundplayer

Scoreviewer

Digitaltime liner

General purposelibrary application

Multimedia MusicTheory Teaching

application

Oncourse

Theory student orinstructor

Non-majormusic studentor instructor

Soundplayback

Imageretrieval

Notationretrieval

Search

Metadata Audio VideoScore

imagesScore

notation

Applications

AccessComponents

Repositories

User InterfaceComponents

and others...

and others...

Catalogingtools

Cataloging/adminsistration

application

Cataloger ordigitization technician

Page 54: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2 1.0Communications

Variations2Client

Variations2LibraryServer

DarwinStreaming

Server

ApacheHTTP

Server

Variations2KerberosServer

IUKerberosServer

DB2Database

HTTP

RTSP/RTP

Java RMI

KerberosJDBC

Page 55: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Demonstration

Page 56: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003
Page 57: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Usability Usability = ease of use + usefulness Established baseline

Usability test of existing Variations system Satisfaction study of Variations users Contextual inquiry

Evaluation of usability of Variations2 Prototype interviews Usability tests of preliminary versions Pilot studies

Data gathering through satisfaction survey and automated usage logging

Page 58: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Future Versions Continuing 6-month development cycle

Version 2: Spring 2003 Version 3: Fall 2003 etc.

Features to be added include: Support for music notation Support for additional image and audio formats Support for new and emerging streaming technologies Support for video

Page 59: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Future Versions Features to be added (continued):

Support for supplemental recording materials (e.g., liner notes, booklets)

Improved browsing interface User interface support for synchronized navigation and

playback Instructional authoring, classroom presentation, and

instructional delivery interfaces Structure diagramming/visualization tools (e.g., Digital

Timeliner) Web browser interface OnCourse integration Access control based on intellectual property requirements Improved cataloging/administrative interface

Page 60: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Variations2 Version 2 Demo

Page 61: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music Information Retrieval Areas of research:

Indexing and search of music content Audio, MIDI, notation

Feature detection Genre, style, form, instrumentation, …

Page 62: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music IR:Inherent Difficulties in Music No analogue to ‘words’

No easy units on which to index or do synonym lookup, etc. Problems of representation

Graphical vs. logical aspects of music Polyphony

Multiple voices, chords Cross-voice matching Music is not linear

Page 63: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Polyphony

Page 64: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Polyphony Almost all music IR work to date

focused on pitch matching in monophonic music

Page 65: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music IR: More problems Query specification

What would a musical query look like? “Query by humming”

Music perception People do not always perceive pitch correctly

What type of matching? Exact pitch or intervals Melodic contour Exact rhythm “Rhythmic contour”

What to index? Entire works Themes

Page 66: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Music IR: Yet more problems Variety of users, probably with very different needs,

including: General public looking for pop music Music students, scholars

However, no formal assessment of user needs No standard query sets, relevance judgements, or

test collections Problems of copyright in building test collections

Page 67: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

More information on Music IR ISMIR: International Symposium/Conference on

Music IR 2000: Plymouth, Massachusetts, USA 2001: Bloomington, Indiana, USA 2002: Paris, France October 26-30, 2003: Baltimore, MD, USA

http://www.ismir.net/

Page 68: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI: Open Archives Initiative Original problem: searching across e-

print archives Distributed searching hard

e.g. Z39.50 Varying search semantics, capabilities Network, server problems

Solution: metadata harvesting

Page 69: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Metadata harvesting Extract metadata from various sources Build services on local copies of metadata

user

. . .

search for “Mozart”

local copy ofmetadata

metadataharvested offline

metadataharvested offline

metadataharvested offline

metadataharvested offline

all searching, browsing, etc. performed on the metadata hereIndividual repositories can

still support direct userinteraction

Data providers

Service provider

Page 70: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI roles Data Providers

Repositories of digital content and metadata Support harvesting of metadata via the OAI

protocol Service Providers

Harvest metadata from data providers using the OAI protocol

Implement user interface to data Usually for searching, but other services also possible

Can be selective

Page 71: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI protocol Originally developed in 1999 (“Santa Fe Convention”) Original focus on E-prints Has grown into general metadata harvesting protocol

OAI-PMH: OAI Protocol for Metadata Harvesting Version 1.0: January 2001 Version 1.1: June 2001

Conform to XML Schema 1.0 Version 2.0: June 2002

Transition period through December 2002 Currently 53 registered OAI data providers

Page 72: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI protocol Carried over HTTP Requests: HTTP GET or POST Responses encoded in XML

Format defined via XML schema Metadata in simple (unqualified) Dublin

Core (and potentially other formats)

Page 73: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Dublin Core elements Coverage Description Type Relation Source Subject Title

Contributor Creator Publisher Rights Date Format Identifier Language

Page 74: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI verbs

Verb Function

Identify description of archive

ListMetadataFormats metadata formats supported by archive

ListSets sets defined by archive

ListIdentifiers OAI unique ids contained in archive

ListRecords listing of N records

GetRecord listing of a single record

Page 75: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

OAI resources Web site, mailing lists Repository explorer Data/service provider toolkits

www.openarchives.org

Page 76: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Becoming an OAI data provider Make digital content available on web Translate metadata into Dublin Core

Crosswalks exist for MARC Can also make other formats available, e.g. MARC XML

Choose a unique identifier system Set up OAI data provider server software

See tools list at www.openarchives.org Depending on tool, uses its own database or operates over

existing database

Page 77: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Other technical concerns for scores Areas for standardization/agreement

Within-score navigation User interface, supporting metadata

Image file format MARC-DC metadata crosswalk

Not essential to OAI model, but enables more consistent user experience

Packaging scores for exchange between libraries, e.g. for e-reserves, cooperative preservation

Can METS play a role?

Page 78: Digital Libraries and Music Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Examples of OAI service providers UIUC Cultural Heritage Repository

http://dlc.grainger.uiuc.edu/ UMich OAIster

http://www.oaister.org/ RLG Cultural Materials

http://www.rlg.org/culturalres/ UCLA/JHU/IU Sheet Music Harvester

http://digital.library.ucla.edu/sheetmusic/