1
The Library behind the sceneHow does it work ?
The Library behind the scenes
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
2
Outline
• 1- Introduction – definitions and context• 2- Information systems in particle physics• 3- Standards• 4- Tools• 5- Conclusions and outlook
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
3
1- Introduction
Do you speak “Librarian”?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
4
What’s a Library for you ?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
5
What’s a Library for you ?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
6
Some definitions
• Library (Oxford Reference Online)• A building or room containing collections of books,
periodicals, and sometimes films and recorded music for use or borrowing by the public or the members of an institution
• Digital Library (Wikipedia)– is a library in which collections are stored in digital formats
(as opposed to print, microform, or other media) and accessible by computers. The digital content may be stored locally, or accessed remotely via computer networks. A digital library is a type of information retrieval system.
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
7
Context
• Towards a digital world: print vs online– Traditional print/physical collections: • Books, journals, theses, reports, standards…• Physical item, description, location
– E-resources:• E-books, e-journals, multimedia (videos, photos…), e-
document…• File, description, link, content ++
Evolution of information retrieval
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
8
2. Information system at CERN and in particle physics
CDS and Inspire
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
9
Particle physics and CERN
• Particle physics– Aims to understand how the Universe works– Small but tightly organized worldwide community– Experimental vs theoretical
• CERN: Research Institute in Particle Physics– LHC (Large Hadron Collider)– 2500 staff + 10,000 users coming from everywhere in the world
• Need fast communication to distribute research result– Publication in journals, too long process– Preprint communication– From mail to arxiv.org
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
10
Open access
• Traditional publication model: – Subscription, purchase, controlled access
• Open Access: – Open Access (OA) literature is digital, online, free
of charge to the reader, and free of most copyright and licensing restrictions.
– Green OA, ex. Institutional repositories (CDS, JINR..)
– Gold OA, OA to published articles
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
11
CERN Scientific Information Service
• Mission– Provide information resources in ALL fields of
relevance to CERN– Ensure scientific information produced at CERN is
safeguarded and made publicly available.– Distribute CERN publications
• Audience– Particle physicists (from CERN and from outside),
Engineers, technicians, Computer scientists, Administrative staff
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
12
CERN Document server: institutional repository and Library catalogue
http://cdsweb.cern.chPowered by Invenio
CERN Library collections: (e)books, (e)journals, (e)standardsCERN Institutional repository: preprints, articles, theses (fulltext)…Multimedia collections. CERN publication
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
13
Inspire: HEP literature database
http://inspirehep.net/Powered by Invenio
Worldwilde repository:CERN, Fermilab, SLAC and Desy
All HEP literature (since 1960)
Citation extraction, author, affiliation analysis..
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
14
JINR Document server
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
15
Where do first search for information?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
17
What system do your prefer?Arxiv 0804.2701v2, Gentil-Beccot et al.
2007 survey9% of HEP scholars use Google as preferred information system
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
18
Google generationArxiv 0804.2701v2, Gentil-Beccot et al.
What do you do when you are looking for an information?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
19
Why?
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
20
2- Standards
Why we need them
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
21
Library catalogue
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
22
Library catalogue
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
23
Marc 21
• Metadata: data about data, record description• MARC: MAchine Readable Cataloguing,
international standard for representing and communicating bibliographic records, developed in the 60s
• MARC21: redesigned MARC for the 21st century– Is based on the ANSI standard Z39.2, which allows
users of different software products to communicate with each other and to exchange data.
• XML-MARC: XML schema based on MARC21
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
24
Marc 21
Author Title
Identifier
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
25
MARCxml
Title
Author
Identifier
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
26
Other standards
• Metadata: – MARC, Dublin Core, BibTex…
• Identifiers: – DOI, ISBN, Barcodes…
• Data exchange protocols:– Z39.50, OAI-PMH
• Full text and coding:– Xml, PDF, PDF/a…
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
27
Why is this important?
• Retrieve information– Identification and searchability
• Preservation– Ensure the information will be readable by
another machine / in 20 years time (?)• Interoperability / Information integration– Transfer (convert) data easily to another catalogue– Extract information and re-use it
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
28
4- Tools
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
29
A Library system
Bibliographic record
Physical Item
Electronic file
Borrower
LibrarianBibliographic record
Physical Item
Electronic file
Bibliographic record
Physical Item
Electronic file
BorrowerBorrower
Bibliographic record
Bibliographic record
Publisher Other Source
Loan
Create/edit
Ingest
Ingest
LIBRARYCATALOGUE
Access
Search / Find
Author
Submit
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
30
CERN Document server
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
31
Circulation and statistics
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
32
Record edition: BibEdit
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
33
Record edition: Multi-record editor
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
34
Records ingestion
Library catalogue
Conversion
Matching
-> New records-> Update records
MARCXML
XML
XMLMARCXML
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
35
• Importance of structured information:– Standards– Automatic procedures as much as possible
• Why? – Users find what they need (and even more)– In the digital era, new challenges and opportunities:
• Build new services on top of the catalogue • Integrate information resources• Communicate!
• More in the next session!
Conclusions and outlook
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS
36
Спасибо!