1 This work is licensed under a Creative Commons License Attribution Non-commercial ShareAlike 2.0...

Post on 31-Mar-2015

221 views 0 download

Tags:

transcript

1This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Towards International Repositories Infrastructures

Workshop 16/17 March, 2009Norbert Lossau,

Director Göttingen State and University Library

& Scientific Coordinator DRIVER

2This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 2This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Topics

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

2

3This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 3This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

1. Identify and establish relationships with key thought leaders, major projects/activities and services, and leading practitioners from around the world

2. Suggest commonalities between infrastructures, points of possible collaboration and pathways that might take the collaboration forward

3. To come to a shared vision of an international repositories infrastructure or, at least, the infrastructure components that might best be developed internationally

4. To identify the essential components of an international repositories infrastructure

4This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 4This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

5. To review the approaches to sustainability, scalability and interoperability being taken by these components, bearing in mind the wider research infrastructure

6. To consider ways in which the progress might be coordinated and reviewed over time

7. Focus the agenda to achieve tangible outcomes

5This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 5This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Topics

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

2

6This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 6This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

High-level Vision …

Free and unrestricted access

to sciences and human knowledge representation

worldwide,

incl. cultural heritage

Berlin Declaration, October 2003

7This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 7This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories or Knowledge Infrastructure Vision …

Discovery & Access

Management

Usage & manipulati

on

Collaboration &

Sharing

Dissemination &

Publishing

To support the… complete research cycle, working with scientific information

8This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 8This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

High-level use cases & possible infrastructure components

Preservation actionsfile format registries

validation tools

representation information registries

IngestSWORD

shared metadata services

name / factual authority services

automatic metadata creation services

9This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 9This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

High-level use cases & possible infrastructure components

Accesswidespread OAI-ORE implementation

common text-mining API?

Online Reputation and reportingeffective, real-time, automatic forward and backward citation mechanisms

factual authority (common tagging of objects with funder / grant number metadata)

10This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 10This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Infrastructure components non-technical factors

Discovery & AccessEstablished search & browsing behaviours and pathways

Online Reputation and reportingEstablished evaluation mechanisms (impact factor)

Preservation actionsAdditional (manual) effort on the author side required?

IngestAdditional (manual) effort on the author side required?

11This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 11This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Use cases - behind the scene“ infrastructures – essential components

A ‘component’ might be:

A service (eg sherpa-romeo, connotea, BASE, funder repository, institutional repository)

A service environment (eg, Amazon S3, Microsoft Azure, Facebook)

A technical success factor (eg consistent use of DC to point from a metadata record to the ‘full text’, use of OAI-ORE, the DRIVER Guidelines), or

a non-technical success factor (e.g. filling repositories through OA-agreements with publishers).

These components will form the focus for the workshop and the action plans that will be its principal output.

12This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 12This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

Topics

2

13This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 13This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories Infrastructure – Where do we stand today? „Briefings“

Author identification

Copyright and licensing

Global harvesters (other than search engines)

Harvesters – subject or discipline based

Ingest – selected issues

Institution identifiers

Peer review

Persistent identifiers

Preservation

14This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 14This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories Infrastructure – Where do we stand today?

Prestige and profiling services

Registries

Repository software

Repository support organisations

Storage

Usage reporting and metrics

User services

Validation and certification of repositories

Versioning

15This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 15This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Development status of components - for discussion

Advanced?

Information on the repository landscape

Global harvesters

Preservation of research papers

Repository software

Storage

Validation & certification of repositories

A brief insight into some components =>…

16This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 16This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories Infrastructure – Where do we stand? Repositories• Informal survey carried out by SURF earlier in

2005• DRIVER Inventory Study 2007

1. Produced 7 studies in 3 publications

• Inventory study into the present type and level of OAI compliant Digital Repository activities in the EU

• A DRIVER's Guide to European Repositories• The Investigative Study of Standards for Digital

Repositories and Related Services 2. Disseminated through the DRIVER website (in Open Access) +

as 3 books (Amsterdam University Press)

17This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 17This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories Infrastructure – Where do we stand? Repositories

• “Research Repositories in Europe: the 2008 DRIVER Inventory study”, Maurits van der Graaf (on behalf of SURF, DRIVER)

=>…

18This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 18This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Research Repositories in Europe

TopicsGrowth, total number and situation

Contents, coverage and depositing

Technical issues and standards

Services on top of repositories

Steady increase of number of Digital Repositories

Total of 280, yearly increase by 25-30

Large part of universities in half of European countries

19This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 19This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Conclusions content, coverage and depositing

More flexibility in access formsTrend to more OAVersion of deposited full text articles

Trend towards depositing postprint stage

Work processes for depositingNo harmonisation

Growing (partly) mandatory depositing32% in 2008, while 25% in 2006

(Still) Coverage of a third33% of Researchers delivering in repositories 35% of Research output of an institution deposited

20This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 20This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Technical issues

Various technical issues 2008 2006persistent identifier 84% 75%

long-term availability secured 52% 73%

statistical data on access and usage 72% 70%

some form of subject indexing 86% 93%

author identifier 31% 33%

ARNO

locally developed

CDSware

Digitool

DIVA

DSpace

Fedora

GNU EPrint

iTOR

MyCoRe

OPUS

VITAL other

21This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 21This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Technical issues

Is your repository technically prepared for Enhanced Publications?

Metadata standards on the rise: DIDL, MODS and OAI-ORE

46.1% 32.6%

YESNo, but

NO

21.3%

no, but we have plans to prepare our repository

no, no plans

22This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 22This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

International Repositories Infrastructure – Where do we stand? Repositories

• OpenDOAR – a comprehensive register of digital repositories worldwide- More than 1300 repositories listed

=>…

23This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 23This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

24This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 24This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

OpenDOAR

25This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 25This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Repository Type=>1072 institutional and 177 disciplinary

Content Type=>815 hold journal articles, 318 Multimedia, audiovisual….69 datasets, 27 software etc.…

OpenDOAR

26This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 26This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

=>Languages:1133 English155 German96 Spanish86 French73 Japanese…3 Africaans…2 Pashto, Pushto…1 Bulgarian1 Romanian

OpenDOAR

27This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 27This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

=>Disciplines763 Multidisciplinary86 Science General…99 Health and medicine…98 History and Archaeology…75 Social Sciences General

OpenDOAR

28This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 28This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Repository platforms & their (international) communities

EPrints*, DSpace*, Fedora Commons*, OPUS (GE), DiVA (SE), CDS Invenio (CERN),

29This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 29This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Country organisa-tions and/or repository infrastructures

Australia, Belgium, Brazil, Canada, France, Germany, Hungary, Ireland, Italy, Japan, The Netherlands, Nordic countries, Portugal, Spain, UK, ???

30This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 30This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Global Harvesters

OAIster, BASE, Scientific Commons

31This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 31This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Cross-Country organisations & repository infrastructures

DRIVER

eIFL

32This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 32This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Repository Infrastructure Architectures

DRIVER

33This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 33This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Existing Solutions: Repository Aggregation Systems (RAS)

RAS aggregate content from OAI-PMH Repositories, form an Information Space and provide community-specific functionalities via Web User Interfaces

Well known examplesBASE (DE)

DAREnet (NE)

OAIster (USA)

Others…

34This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 34This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

RAS

OAI-PMH

Institution Site

OAI-PMH

Institution Site

OAI-PMH

Institution Site

Aggregator

Information Space

Index

Search

Index

UI

35This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 35This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Service Open Infrastructures (SOI), DRIVER

Inspired by component-oriented systemsComponents provide specific functionality in isolation

Components can be provided by different Service Providers and be shared between applications

Applications are formed by combining independent components under the control of System Managers

Service Open InfrastructureComponents are distributed services running on the network at different sites

Open to instance and types of services: instances or new functionality can be added/removed any time

36This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 36This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Infrastructure architecture

Functionality LayerUser Interface

ServiceRecomm.Service

CommunityService

UserService

SearchService

Repositories

Data Layer

OAI-PMHService

IndexService

BrowseService

StoreService

AggregatorService

Info

rma

tion

Ser

vice

Man

ager

Ser

vice

Aut

hz&

Au

thn

Ser

vice

Res

ulS

etS

ervi

ce

UserService

ValidatorService

Text EngineService

EnablingLayer

CollectionService

37This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 37This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

ReuseFunctionality sharing

OAI-PMH

Aggregator

Index

Search

Index

UI

OAI-PMH

Institution Site

…OAI-PMH

Institution Site

OAI-PMH

Institution Site

Ena

blin

g La

yer

Mid

dlew

areUI

Search

Index

Aggregator

User Profiling

Others

Aggregator

UI

Search

Index

Store

Store

FunctionalityServices

Institution Site

Dynamic, distributedRun-time Infrastructure

ContentResources

38This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 38This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Repository Infrastructure Interoperability

DINI Certificate

DRAMBORA

TRAC project

DRIVER Guidelines, Maurice Vanderfeesten, SURF + DRIVER partners (some of the following slides have been presented by Maurice on the 29 August 2007, TICER Digital Libraries a-la-Carte)

39This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

40This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

- Guidelines

- Validate

- Workflow

40

Interoperability pragmatics

41This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Guidelines

- Chapter 1: Use of OAI-PMH - Chapter 2: Use of Metadata OAI_DC - Chapter 3: Use of Best Practices for OAI_DC- Chapter 4: Use of Compound Object Wrapping - Chapter 5: Use of Vocabularies and Semantics - Chapter 6: Use of Quality labels - Chapter 7: Use of Persistent Identifiers- Chapter 8: Use of Usage Statistics Exchange- Chapter 9: Use of Intellectual Property Rights

(IPR)

42This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 42This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Guidelines in various languages

43This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 43This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 GermanyThis is as optional footer

Guidelines

From the inventory study:

72.5% knows DRIVER Guidelines; 54.5% tries to follow them

Does your repository follow the DRIVER guidelines? n %

We do not know about the DRIVER guidelines 49 27.5

We know about the DRIVER guidelines, but do not follow them 32 18.0

We know about the DRIVER guidelines and (make every effort) to follow them 97 54.5

44This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Validator

- Detects interoperability failures

- Goes deep into the metadata content

- Provides explanation about guideline principals per

interoperability feature.

- Offers recommendations on how to correctly modify

your repository to interoperable standards

- Creates a report for future reference

=>Developed at the University of Athens

45This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 45This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Technology Components – 2008 Study

Slides from the „Technology Watch“, Karen Van Godtsenhoven, University of Gent (+ DRIVER partners)

46This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 46This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Structure of Technology Watch Report

ChaptersDRIVER-GRID interaction

Interoperability

Long Term Preservation (LTP)

+

DRIVER-CRIS interaction (added later)

Result: two main partsNew communities and technologies (GRID, CRIS, LTP)

Interoperability of EP’s (5 types)

Structure of each chapterTheory - Case studies - Outcomes for DRIVER

47This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 47This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Interoperability Enhanced Publications

Interoperability in DRIVER context: Exchange and dissemination of EP’s as complex, compound objects, based on textual publication

Focus on five types of representing and publishing enhanced publications (relationship of files within objects)

Envelope models or packaging formatsOverlays, maps, feedsEmbedding formatsNew/Old publishing formatsWeb services

NOT focus on ingest or descriptive metadata (Russell, Vanderfeesten, Hochstenbach, Van Godtsenhoven)

48This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 48This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Envelopes

Access to metadata, structural data, identifiers, and binary streams of publications all in one package (= envelope)

MPEG 21-DIDL in DARE context

METS

IMS – CP

ODF packages

OOXML/ Package convention

Open e-book packageComparison: table with all features in doc

49This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 49This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Overlays, maps and feeds

SWAP, ORE and POWDER all qualify as good formats / models for the dissemination of EP’s

SWAP uptake by community is very low (high complexity)

OAI-ORE very popular in community and used in DRIVER demonstrator for EPs

POWDER: recent W3C standard, viable alternative to ORE (when the aggregations are of a very dynamic nature or cannot be simply enumerated)

50This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 50This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

New Publishing Formats

ODF (ISO 26300:2006) versus OOXML (ISO 29 500-1:2008)File format ISO standards for saving & exchanging office documents (alternative to proprietary formats e.g. doc, ppt)Open up access to structured content which can be reused by other services e.g. DRIVERGuarantee long term accessibilityControversy surrounding development of OOXML: DRIVER should adopt approach that is capable of using both ODF and OOXML Plus: many disciplinary xml types, structured and crawlable data

51This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 51This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Infrastructure Technology Components in Practice

Automating and monitoring harvesting, data processing and indexing processes

52This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 52This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Repository Map

53This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 53This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Admin (Internal) Control Panel I

Monitor repository landscape I

54This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 54This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Admin (Internal) Control Panel II

Monitor repository landscape II

55This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 55This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Admin (Internal) Control Panel III

Monitor &

Process

Repository D

ata

56This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 56This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER Admin (Internal) Control Panel V

Check repository

index profile updates

57This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 57This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Where do we stand? Linking publications to datasets (Enhanced Publications)

58This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 58This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

DRIVER – Enhanced Publications

Technology

The demonstrator aggregates scientific web resources via OAI-ORE v0.9 and RDF. XSLT is used to transform these into XHTML. CSS and Javascript do the rest of the presentation. A Java applet is used to dynamically display the relations between resources. Although these relations can be fed to the applet as parameters, they are not yet automatically interpreted from the RDF-serialisation

59This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 59This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Driver II Definition Enhanced Publication

An Enhanced Publication (EP) is:a textual publication enhanced with:

research data (evidence of the research) and/or

extra materials (to illustrate or to clarify) and/or

post-publication data (commentaries, ranking)

So: ever developing

60This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 60This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

61This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 61This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

62This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 62This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

Topics

2

63This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 63This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Challenges towards an International Repositories Infrastructure

Complex matrix, addresses

(min.) five main dimensions

Countries (political, finance, organisational, legal issuesetc.)

Academic disciplines

Content access & usage

Multiple content resource types

Data Models & Technology

Countries

Disciplines

Content type

Data Models & Technology

Content access& usage

64This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 64This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Essential: keep all stakeholders and their perspectives in mind

Researchers/disciplines

Research managers

Library Managers

Repository Managers (technical & content)

Computer Scientists

Publishers & further content providers

Service & Infrastructure providers

Funders

65This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 65This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

The Diversity & Wealth of academic disciplines

EC, Framework 6 Programme: 46 pages, c. 40 entries each

Countries

Disciplines

Content type

Data Models & Technology

Content access& usage

66This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 66This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Discipline Schema (Keywords): European Commission

7 main areas

67This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 67This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

68This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 68This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Text, Manuscript

Drawing

Painting

Foto

Film

Radio, TV Broadcasts

Papyri

Cuneiform tablets

Artefacts

Buildings

Maps

Language audio recordings

Content type

69This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 69This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Discipline Schema: Deutsche Forschungsgemeinschaft, Germany

4 main domains

HSS

Life Sciences

Natural Sciences

Engineering

14 subdomains

70This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 70This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Disciplines and their International Repositories Infrastructures

ExamplesArXiV – Physics, Mathematics, Informatics

PubMed - Life Sciences

CLARIN, OLAC – Linguistics, language archives (datasets, international)

CESSDA – Social Sciences (datasets, international)

DARIAH – Humanities (datasets, international)

RePEc; NEEO – Economics (pre-/postprint publications, international)

METAFOR – Meteorology, Climate research (publications + datasets, international)

MACE - Architecture

IVOA - Astronomy

71This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 71This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Issues to be addressed on the way towards an international repositories infrastructure, e.g.

Each discipline – one infrastructure?

Each information type – one infrastructure?

Same data models, technology, same services – different implementation

Same data models and syntax- different semantics

Project specific goals & funding – external liaision & collaboration

Focus on a specific community, a country, a region – cross community, cross-country initiatives

72This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 72This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Further issues

Content! Filling repositoriesPublications: business models publishers

Research data: culture of sharing data

2

73This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 73This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

Topics

2

74This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 74This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Global Data Network: a model for the International Repositories Infrastructure?

75This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 75This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

76This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 76This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

77This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 77This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Global Data Networks vs. Global Repository Infrastructure?

Data networks are „neutral carriers“ of information - Digital repositories contain the actual information

Content resources – multiple semantics and formats

Data networks are generic – knowledge infrastructures are disciplin-specific

Cultural issues for disciplines: „You share the network – but not your research data“

Financing: hardware vs. service, business cases

78This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 78This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Further, architectural models for infrastructures?

79This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 79This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

JISC-fundedcontent providers

institutionalcontent providers

externalcontent providers

brokers aggregators catalogues indexes

institutionalportals

subjectportals

learning managementsystems

media-specificportals

end-userdesktop/browser pr

esen

tatio

n

fusion

prov

isio

n

OpenURLlink servers

shared infrastructure

authentication/authorisation (Athens)

institutional profilingservices

terminology services

service registries

identifier services

metadata schema registries

© Andy Powell (UKOLN, University of Bath), 2005

This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0

JISC Information Environment architecture

80This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Ingenieur-Wissenschaft

en

1x

Weitere Wissenschaft

en

0x

Geistes- und Sozial-

Wissenschaften

2x

Natur- und Lebens-

Wissenschaften

4x

Dokum.Server

DigitalisierteSammlungen

Fernsehen/Radio

Forschungs-daten …Mail

Archive

“Open Office”

Programme Suite

(Scholarly workbench)

Publikations-/

Kommunikations-

Dienste(z.B. Wikis)

Daten-konverter

, Rohdaten-analyse

u. Referenz

Informations-

Extraktion,

Semantische

Vernetzung

Disziplin-spez.

Navigationund

Visual.

Datenaggre-

gation und Verlinkung

CAD, CAE&S,

CAM

Rapid Prototypin

gN.N.>>

>

• 3-D-Rekonstruktion von Artifakten

• Handschriften-Transkription• Analyse von

Sprachaufzeichnungen

Kataloge /Datenbanken

Multi-MediaServer

Shared Workspac

e, Kollaborationsdienst

e

2x

1x 0x 1x

Langzeit-archivierung

+ Verfügbarkei

t

1x

• Datenvisualisierung

• ... 1x

2x 6x

7x 1x 4x 3xDefinition von

Standards(Metadaten,

Formate, etc.)

1x

Bildbearbeitung

und -annotat

ion

2xSuche,

Navigation,

Visualisierung, AAR

3x

Nutzungs-statistiken

,Zitationen

1x 6x

Repositories

3x

Daten-transfer

und Workflow-integratio

n

1x

2x 0x 4x 4x0x 1x

Semantic

Social Interact

ion

1x

WissenschaftlerDisziplin-spezifische Werkzeuge und Dienste

Disziplinüber-greifende Dienste & Werkzeuge

Basisdienste

Content

1xD-Grid- und links4science-Workshop, 29. März 2007, Göttingen

81This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

wissenschaftliche Communities, Institutionen

disziplinspezifische Werkzeuge und Dienste

virtualisierte Hardwareresourcen

Persistent

Identifier Resolver

LZA-Dienst

e

RepositorySysteme

Info-Extraktion

disziplinübergreifende Werkzeuge und InfrastrukturDienste-katalog, Service Registry

......

Visuali-sierung

Ontology

Registry und

Dienste

Metadata

Registry und

Dienste

Grid-/VO-Such

e

Content

82This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 82This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Learn from other sectors, e.g. logistics industry?

openID-center

An open platform for the integration of identification systems

Fraunhofer Institute of Material Flow and Logistics, www.openID-center.de

83This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 83This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Objectives of the Workshop

Visions - use cases – infrastructure and components

International Repositories Infrastructure(s) – Where do we stand today?

Challenges

Global Data Network: a model for the International Repositories Infrastructure?

How do we proceed: our next two days (and beyond)

Topics

2

84This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 84This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

How do we proceed? Four action plans

Organisational structures (Norbert)

Sharing citation data (Les)

‚Repository handshake‘ (Peter)

Identification Infrastructure (Andrew)

=>Aimed to stimulate discussions (drafts have been circulated beforehand)

85This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 85This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Purpose of the action plans

The action plans are not necessarily about building infrastructure, but about whatever action needs to be taken so that the components form an infrastructure capable of supporting the use cases.

86This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 86This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Organisational structures Suggestions…

Define a clear Statement of Intent/Code of Conduct of the Confederation in relation to Open Access and repository developments

Launch the nucleus of an International Repository Confederation, unifying diverse stakeholders from country networks, disciplinary networks, technology, research managers, funders etc.

Commission an international Inventory Study on disciplinary repository infrastructures

Start a systematic consultation process with discipline representatives, selected national research funders etc. from representative regions all over the world

Draft a roadmap for an International Repository Infrastructure

87This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 87This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Sharing citation data Suggestions …

Revive Citebase, and expand its scope to cover a fuller range of open access material, including from OA journals and institutional repositories

Define and implement a common API for citation services such as Citebase, to enable machine query of the data

Implement the updated “CLADDIER trackback protocol” in major repository software as part of the core release

Learn lessons from above that impact on repository and journal practice, eg on metadata consistency. Act on those lessons

88This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 88This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

‚Repository handshake‘ Suggestions…

1. Establish working group incl. major interested parties

2. Define/refine priority use cases

3. Describe negotiations needed for each use case

4. Identify minimum set of tools and mechanisms

5. Identify test partners

6. Implementation

89This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 89This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Identification Infrastructure Suggestions…

Identify relevant (inter)national activities (see briefing materials)

Define in the abstract who are the trusted sources of authority for each of the named entities (eg, a funder is trusted to assert the title of a project)

Identify relevant (inter)national naming and resolution practice (DOIs, Handles, URNs, etc)

Based on above, and relevant trends / plans, define a practical roadmap with milestones

Implement roadmap!

90This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 90This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

In addition: Organisational structures

Promote complimentary, non-technical actions to technological strands

Sharing citation data <= Engage researchers, learned societies, research managers, research funders to discuss new models for evaluation and reputation schemas

Repository handshake <= Bring together existing and future initiatives (such as the PEER project) to discuss policy and legal frameworks, business models and organisational issues

Identification infrastructures <= Explore how identifiers will be used in practice in research processes, in difderent disciplines, on a broad scale

91This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany 91This work is licensed under a Creative Commons License

Attribution Non-commercial ShareAlike 2.0 Germany

Outlook: a global network of repository infrastructure hubs?!

lossau@sub.uni-goettingen.dewww.driver-community.eu