+ All Categories
Home > Education > UDC_in_Action

UDC_in_Action

Date post: 18-Jun-2015
Category:
Upload: andrea-scharnhorst
View: 514 times
Download: 2 times
Share this document with a friend
Description:
This presentation from the UDC Conference 2013 discusses methods from statistical analysis and complexity theory applied to structure and evolution of a library classification system - the Universal Decimal Classification. (with Richard Smiraglia and Almila Akdag Salah)
Popular Tags:
16
UDC in Action
Transcript
Page 1: UDC_in_Action

UDC in Action

Page 2: UDC_in_Action

UDC in ActionRichard Smiraglia – University of Wisconsin MilwaukeeAndrea Scharnhorst, Almila Akdag Salah - eHumanities Cheng Gao – Dahlian University (now Austin, Texas)

Acknowledgement: We would like to thank Ed O’Neill of the OCLC Office of Research who provided us with the OCLC dataset. We would also like to thank Johan Rademakers and Bart Peeters from KU Leuven who provided the Leuven dataset. Aida Slavic gave comments on the paper, and was an indispensable sparring partner for discussion. Part of this work has been funded by the Network of Excellence for Internet Science, FP7 – 288021.

Page 3: UDC_in_Action

Classification of human knowledge production as complex phenomena

Page 4: UDC_in_Action

Dataset OCLC – raw data

1. 9,055,623 records extracted from 214,596,487 bibliographic records using the “080” field in WorldCat

2. first column = internal ID number, second column = UDC numbers 3. Cleaning:

1. Lines not starting with an ‘a’ tag. 2. Lines with no numbers after “a”, or without “a”3. 8,944,669 records 4. Another 570,629 dismissed as non-UDC numbers

4. Eventually we have 8,374,040

Page 5: UDC_in_Action

Dataset KU Leuven

The original file has 95,544 lines. The first column contains a string with the structure $$8 UDC number $$a UDC heading $$9 language of the heading. The second says how often this UDC number is used in bib records in the library.

Page 6: UDC_in_Action

Use of UDC in KU Leuven

Page 7: UDC_in_Action

Data processing or the beauty of a UDC ‘string’

394.4 :[92(100+437) :329(437).15(091)+327.32(100)]

Page 8: UDC_in_Action

Data processing or the beauty of a UDC ‘string’

Page 9: UDC_in_Action

UDC as a complex systemNot a hierarchy but a fully connected graph – still to be exploredEvolution of the UDC over time

Growth of UDC classes (AS, AAS, KS, CG, RS, 2011, Class&Ontolog)

Entry and Exit of UDC numbers, changes in all tables including auxiliaries (AAS, CG, KS, AS, RS, 2012, ISKO)

Structure of UDC UDC in collectionsHow long is a UDC string?How are UDC classes connected by operations through auxiliary signs?

Page 10: UDC_in_Action

Structure I – Profile of collections

Page 11: UDC_in_Action

Structure II – Length of a UDC string

Page 12: UDC_in_Action

Structure III – UDC six connecting symbols (or ‘relators” or ‘operators’)

Page 13: UDC_in_Action

Structure III – Networks views of UDC

Matrix : The combined number 022:11.203+11.204 contributes one tick to the cell {row class 0, column class 1} in the matrix_colon, and in the matrix_plus between row class 1 and column class 1. Combinations between auxiliaries are not taken into account!

Page 14: UDC_in_Action

Structure III – Network views of UDC

No “+” in use in Leuven

Page 15: UDC_in_Action

Conclusion I – UDC as a complex system

Page 16: UDC_in_Action

Conclusion IIDone To do

Demonstration of analytic perspectives from scientific visualization and complexity research

Systematic exploration of one (or several ‘complete’ datasets. Complete = UDC, plus full bibliographic record

Possible applications Analysis of UDC numbers in collections = feedback to UDC editors about the use of classes. The temporal provenance of UDC numbers: Across the editions of the UDC, not only are UDC numbers added and deleted, they also are shifted (and re-labeled) and recombined, as well as receiving changed descriptions.

Mapping out basic statistics on UDC classes as used in libraries for the information professionals

Users might profit from mapping too, gaining an overview about the nature and focus of a specific collection.