1
Akademisk forskning onlineAcademic Research in Sweden online
A pilot study of an OAI-compliant portal
Eva Müller, [email protected] Publishing CentreUppsala University Library, Sweden
2
I will focus on
• Interoperability• Based on practical experiences• Relationship between a library
catalogue & a separate web service
3
Background
• Five universities in Sweden are participating in a common portal for electronically published documents – the DiVA portal (http://publications.uu.se/portal/)
• Portal based on DiVA publishing system built at Uppsala University
• Want to allow other Swedish universities to participate in the portal (even without DiVA)
4
Grant for a pilot study
• Academic Research in Sweden Online• Funded by the Royal Library’s
Department for National Coordination and Development (BIBSAM)
5
DiVA Publishing System
• Focus on workflow– reuse and enhance the data originally entered
by authors for metadata and a digital master for an electronic & printed version
– store & checksum the files– assign a persistent identifier– send a copy to the National Library Archive
6
Implementation
• Java – XML technologies• Oracle database used for searching• Architecture: component-based design
– Modularity and reusability of the components– Possibility to seamlessly replaced modules with
improved implementations of the component
7
Issues
• Do we need a separate portal for ETDs?• What kind of added value would such a
portal provide?• What is the minimum level of interoperability
for meaningful resource discovery of academic publications in Sweden?
8
Levels of interoperability
• Technical (protocol level, file formats, technical solutions used)
• Content (metadata formats, vocabulary used)
• Organizational (agreements on exchanging of data, rules for access and reuse of data)
9
Strategies
• Federation strategy– Agreements on the technical, content and
organizational levels • Harvesting
– Agreements on the technical level – for example OAI-PMH
10
Methodology
A comparison between a portal built using the federation strategy and another built by harvesting of available metadata
11
DiVA-portalAn example of a portal based on the federation strategy
• Agreements on the…– technical level (using of DiVA Publishing System)– content level (DiVA Document Format,
vocabularies used)– organizational level (exchanging of metadata,
rules for access, support of the technical solution, joint development of new services)
12
Functionality
• Federated searching and browsing for theses, dissertations and other publications at a number of Swedish universities
• Metadata publishing service
13
Academic Research Online PortalAn example of a portal based on the harvestingstrategy(http://publications.uu.se/portal)
• Goals– Allowes even more Swedish universities to
take part in the common portal– Focus on ETDs & new services– Simpler integration of participating
collections– Low barrier technical solutions
14
Implementation
• OAI-PMH used primary as metadata transmission protocol– Interoperability & extensibility (possibility
to use even community specific metadata)• Harvested metadata stored in a native
XML database (eXist)• Simple search interface devloped
15
OAI compliance
• Only 6 repositories could be harvested directly (5 DiVA repositories & 1 repository using E-prints software)
• OAI-PMH for metadata transmission is not practical for all repositories at the moment
• Alternative methods were used to collect metadata
16
Results
• A very simple interface to the harvested metadata (http://publications.uu.se/afo/)– Lack of content interoperability
(vocabularies, granularity of description)• A number of recommendations about
interoperability level• Starting point for additional discussions
17
Conclusions
• Interoperability strategies has a direct impact on the quality of service, results delivered and on the costs
• Levels of interoperability = granularity of resource discovery and services enable
18
Next steps
Project proposals submitted to BIBSAMFocus on interoperability:• To define and agree on different levels on
content operability– Low barrier level (ex. simple DC using within OAI
technical framework)– Richer level (URN:NBN, MARC XML, DiVA
Document Format)– Vocabulary level (controlled vocabularies)
19
Next steps
• To build up a technical infrastructure supporting interoperability
• To achieve some basic organizational interoperability agreements (exchanging of data, rules for access…)
20
This case study shows
that common agreement on interpretation of metadata standards and vocabularies is necessary for the meaningful resource discovery
achieving a national system that facilitates this access is a question of interoperability not only on the technical, but also on the content and organizational level
21
More information
• http://publications.uu.se/portal/• http://publications.uu.se/afo/• http://publications.uu.se/etd2003/• http://publications.uu.se/epcentre/