facet analysis in udc - international udc...
TRANSCRIPT
FACET ANALYSIS IN UDCQuestions of
structure, functionality and formality
Aida Slavic
UDC Consortium
The Netherlands
Sylvie Davies
Robert Gordon University
Aberdeen, UK
Aida Slavic & Sylvie Davies
CONTENT
Statement of the problem(s)
Introduction to Otlet’s classification system
Fundamentals of synthesis in UDC
Commitment to facet analysis
Faceted classification rationale
Types of faceted structure introduced in UDC
Discussion of issues
Conclusion
Aida Slavic & Sylvie Davies
STATEMENT OF THE PROBLEM(S)
Lack of connection throughout UDC development
between:
• requirements of an overarching facet analytical
theory
and…
• requirements of computerised systems in terms of:
notational representation
data structure
Aida Slavic & Sylvie Davies
CONCEPT BASED APPROACH AS DESIGNED BY
OTLET IN 1900s
Drawing cca 1910
(Courtesy of
Mundaneum)
Aida Slavic & Sylvie Davies
NOTATION
It should reflect class structure but it is also an
indexing language comprising:
• ‘vocabulary labels’ as index terms
• ‘syntax’ with rules for building complex expressions
It brings together semantic and syntax
It is the main source of structural data and most
relevant to classification data modelling, automation
and management
It is important to represent all components of
synthesised notation
It should not be treated as ‘text string’ in database
design
Aida Slavic & Sylvie Davies
FACETED CLASSIFICATION
Based on facet analysis which can be either:
• intuitive
• guided by common sense
• principles of formal logic
• facet analytical theory (FAT)
Notation tends to be viewed separately from the
classification structure (BC2)
We do not have a data model for fully faceted
classification
Aida Slavic & Sylvie Davies
ANALYTICO-SYNTHETIC CLASSIFICATION
It enables to compose and decompose notation in the
indexing/retrieval process
All discrete component concepts of synthesized
notation need to be coded to ensure full control of
data elements
It is often linked to the notion of facets because its
main purpose is to enable the combination of
mutually exclusive properties that may need to be
combined in describing a subject
Aida Slavic & Sylvie Davies
FACET REPRESENTATION AND SYNTHESIS IN UDC
811.133.1’276.6:34 Language of French lawyers
Discipline 1
Discipline 2
Discipline 3
81 Linguistics and languages 811 Languages811.111 English language811.112 West Germanic languages
(other than English)
811.112.2 German language 811.112.5 Dutch 811.113 North Germanic languages 811.113.4 Danish language 811.113.5 Norwegian language 811.113.6 Swedish language 811.12 Italic languages
Persons
Language
Time
Special Auxiliary Subdivision
81-1 Schools and methods in linguistics81-11 Schools and trends in linguistics 81-112 Diachronic linguistics 81-114 Synchronic linguistics81-13 Methodology of linguistics81-132 Method of string analysis81-139 Other methods
Special Auxiliary Subdivision
81`1/`4 Subject fields and facets of linguistics
81`1 General linguistics 81`2 Theory of signs. Theory of
translation. Standardization81`3 Mathematical and applied
linguistics. Phonetics. Graphemics. Grammar. Semantics. Stylistics
81`4 Text linguistics. Discourse analysis. Typological linguistics
Special Auxiliary Subdivision
81`01/`08 Origins and periods of languages81`01 Old period. Archaic period81`02 Classical period81`04 Middle period81`06 Modern period81`08 Revived language
BROAD FACET CATEGORIES
MAIN FACET
MAIN TABLES COMMON AUXILIARIES
Form
(1/9) Place(4/9) Countries and places of the
modern world (4) Europe(410) United Kingdom(430) Germany(436) Austria(437.3) Czech Republic(437.5) Slovakia(438) Poland
Aida Slavic & Sylvie Davies
COMMITMENT TO FACETING
Decentralized development of UDC (international special
subject committees) leads to multiplicity of approaches in
class presentation and synthesis
1960s FID Central Classification Committee (CCC) endorsed
facet analysis as a model for structuring of the scheme
(without specific theory or procedure in place)
CCC continues its search for appropriate theoretical
framework and more formal model that would enable
better management of UDC growth
Aida Slavic & Sylvie Davies
RATIONALE
Shorter and more rigorously structured schedules with
logical and predictable principle of organization of facets
and their notational presentation in all areas of knowledge
Tables containing simple concepts that would support
their unique identification, less repetition and
compositionality
Expressive notational system that fully supports
automation (parsing) and seamless linking of notational
elements and natural language
Aida Slavic & Sylvie Davies
TWO APPROACHES TO FACETIZATION
1980s facets as ‘relational tables’ (81 Linguistics and 82 Literature)
• Facet analysis logical but not based on any specific theoretical
principle
• Facets with simple concepts, no enumeration of combinations
(compound concepts appear in examples of combination only)
• All needed combinations appear in the process of indexing
1990s facets with enumerated compound and complex classes
(2 Religion)
• Structure based on facet analytical theory developed for Bliss (BC2)
• Table of simple concepts followed by selection of enumerated
compound subjects listed as a main hierarchy
• Schedules offer rich selection of useful combinations ready to be
used
Aida Slavic & Sylvie Davies
EXAMPLE OF A ‘RELATIONAL TABLE’ APPROACH
MAIN CLASS
82 Literature
821 Literatures of individual languages
821.1/.9 ≅ =1/=9 [Divide as language table!]
821.111 English literature
Example of combination:
821.111(417)-1(082.2)
English literature (Ireland) – poetry –
anthology
SPECIAL AUXILIARY FACETS
82.02/.09 Theory, study and technique of literature
82.02 Literary schools, trends and movements
82.09 Literary criticism. Literary studies ...
82-1/-9 Literary forms, genres
82-1 Poetry. Poems. Verse
82-2 Drama. Plays
82-3 Fiction. Prose narrative
82-4 Essays
82-5 Oratory. Speeches ...
COMMON AUXILIARY FACETS
PLACE
(4) Places of the modern world
(41) Countries of the British Isles
(417) Republic of Ireland
...
FORM
(08) Collected and polygraphic works
(082) Collections of works by several authors
(082.2) Anthologies. Selections. Excerpts
...
Aida Slavic & Sylvie Davies
UDC MASTER REFERENCE FILE DATABASE
(CREATED IN 1992)
UDC MRF assumed a ‘relational table’ presentation of facets
The main field in UDC MRF database record is ‘UDC number’
• this field should contain only simple UDC notation
• each UDC notation comes from a certain table (hence ‘table code
field’)
• the UDC notation in combination with table code would be
sufficient to produce automatic filing and ordering of UDC classes
• the UDC notation is a unique identifier of a class
Pre-combined (compound and complex) UDC numbers would
appear only in the field ‘example(s) of combination’ where
elements of the combination can be managed to a certain
extent
Aida Slavic & Sylvie Davies
EXAMPLE OF FACETING ALLOWING ENUMERATION
(1990s)
SPECIAL AUXILIARY FACETS
2-1 Theory and philosophy of religion. Nature of
religion
2-2 Evidences of religion e.g.
2-23 Sacred books. Scriptures. Religious texts
2-25 Secondary literature. Pseudo-canonical works
2-27 Critical works
2-28 Other religious texts
2-3 Persons in religion
2-4 Religious activities. Religious practice
2-5 Worship broadly. Cult. Rites and ceremonies
2-6 Processes in religion
2-7 Religious organization and administration
2-8 Religions characterised by various properties
2-9 History of the faith, religion, denomination
MAIN TABLE
2 Religion. Theology
27 Christianity
271 Eastern Christianity
271.2 Orthodox Church
271.2-1 The Orthodox ‘Tradition’
271.2-284.7-247 The Gospel Book
Example(s) of combinations:
271.2-282.7-247-536.36 Prostration
before the Gospel Book
271.2-284 Doctrinal statements. Symbolical Books
271.2-472-022.43 The Longer Catechism
271.2-523.46 Side rooms, chambers
...
COMMON AUXILIARIES
-02 Common auxiliaries of properties
-021 Properties of existence
-022 Properties of magnitude, degree, quantity, number, temporal values......
UDC MRF was designed for this kind of presentation:
27 Christianity
271 Eastern Christianity
271.2 Orthodox Church
Example(s) of combinations:
271.2-1 The Orthodox ‘Tradition’
271.2-284.7-247 The Gospel Book
271.2-284.7-247-536.36 Prostration before the Gospel Book
271.2-284 Doctrinal statements. Symbolical Books
271.2-472-022.43 The Longer Catechism
271.2-523.46 Side rooms, chambers
Aida Slavic & Sylvie Davies
FURTHER ISSUES (1)
Contracted captions2-23 Sacred books. Scriptures [special auxiliary]
27 Christianity
27-23 Bible
rather than:
27-23 Christianity -- Sacred books -- Bible
Compound class used as basis for hierarchical subdivision
233 Hinduism
233-13 The Holy. Brahma Absolute being
233-14 God(s) and goddess(es)
233-158D Devi
233-158G Ganesh
233-158K Kali
Aida Slavic & Sylvie Davies
FURTHER ISSUES (2)
Differential facets (i.e. introducing further class specifications
under particular subjects) not distinguished from the rest of
hierarchy
From: 2-265.3 Epics and sagas [special auxiliary]
To: 233 Hinduism
233-265 [omitted combination]
233-265.3 Itihasa. Epics and sagas
233-265.32 Ramayana
233-265.33 Mahabharata
233-265.34 Bhagavadgita
233-265.35 Puranas
Aida Slavic & Sylvie Davies
OVERALL...
The ‘relational’ model of facet analysis provides schedules
easier to manage online - however, introduced without:
• an established theoretical framework and guidelines for facet
analysis that should be applied for the UDC as a whole
• proper consideration of lack of searching access points in the
scheme which provides only a few examples of pre-combined
concepts
The Bliss-based approach offers a much needed theoretical
framework but it was introduced:
• ignoring the existing UDC MRF data model
• without guidelines about the principles and extent of
enumerated combinations to be presented in faceted
schedules of this type
Aida Slavic & Sylvie Davies
FURTHER ISSUES
lack of guidelines on
• facet analysis procedure and management and modelling of
differential facets in the UDC MRF
• standardization of notational representation
• semantic analysis/factoring of compound subjects (e.g. when a
compound should be represented as a simple class e.g. “Eastern
religions” or “Ancient religions” and when as a combination)
Also…
how the revised classes relate/combine with the rest of the UDC
vocabulary: when to introduce a concept and when to use an
existing concept from the UDC schedules?
indicate how, when and why to use examples of combination
indicate rules for verbal representation of pre-combined classes
Aida Slavic & Sylvie Davies
CONCLUSION
Facet analysis ought to be based on a theoretical framework valid for the system as a whole• to impose predictability in the schedule composition and
organization
• to impose rigorous principles for hierarchical subdivisions
Notational system should be formalized in relation to faceted structure• to formalize syntax rules to support synthesis
• to improve semantic linking to support syndetic structure
We should provide:• adequate data management system
• documentation on revision policy, guidelines and procedures