+ All Categories
Home > Documents > CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a...

CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a...

Date post: 05-Mar-2021
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
24
CLARIN a European Research CLARIN - a European Research Infrastructure Peter Wittenburg Max-Planck Institut für Psycholinguistik, Nijmegen
Transcript
Page 1: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN a European ResearchCLARIN - a European Research Infrastructure

Peter WittenburgMax-Planck Institut für

Psycholinguistik, Nijmegen

Page 2: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

eResearch - InfrastructuresBozen,

16.9.2010

www.clarin.eu

J. Taylor“eScience is about global collaboration in key areas of science and the next generation of infrastructures that willgeneration of infrastructures that will enable it”

Requires new persistent platformsRequires new persistent platforms- to enable researchers to combine resourcesand tools to solve the big challenges of today (global migration crisis of cultures and minds)(global migration, crisis of cultures and minds)

- to increase the efficiency of researchers in the many small tasks- 40 % of the time of "knowledge workers" is spent, to find

useful material (Forrester Research)useful material (Forrester Research)

Page 3: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN GoalBozen,

16.9.2010

www.clarin.eu

What: How: Offer a distributed Research Infrastructure of

allow the combination of existing and web-accessible digitalInfrastructure of

integrated and interoperable

accessible digital centers hosting resources in a common federationLanguage

Resources and Tools that serves

common federationoffer language tools and services as distrib ted ser icesTools that serves

researchers and students in the SSH

distributed services with a common web interface

Page 4: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Key Application/Mission Bozen,

16.9.2010

www.clarin.eu

A researcher authenticates at his own organization and creates a virtual collection of resources from different repositories pand executing a virtual pipeline of processes on them.

Page 5: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN is pan-European

CLARIN:CLARIN:• 3 Jahre Prep-Phase• ~ 200 members • ~ 25 centre candidates

Page 6: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN Work Dimensions

at least IT oriented aspects

how to come to how to come to how to make all how to come to how to get it all

... at least IT oriented aspects

a persistent and stable

infrastructure?

a federation and how to get

access?

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?

community service CMDI future & service pan-European community centres provider

federationshort term solution

oriented architecture

demo cases

CLARIN has other very important aspects:• Relation with SSH disciplines - mainly driven by national funds• Education/Training, Help/Support/Advice, Dissemination

Harmoni ation of licencing and Code of Cond cts• Harmonization of licencing and Code of Conducts• Specification of the ERIC legal framework to ensure persistency

Page 7: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Community Centres

25 Centre Candidates

all are busy with restructuring plans

2 already give long-term preservation service

how to come to a persistent and stable

how to come to a federation

and how to get

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?infrastructure? access?visible? services? user services?

community centres

service provider

federation

CMDI future & short term solution

service oriented

architecture

pan-European demo cases

CLARIN Centres

CentresCriteria

Long-termPreservation

REPLIX Replication

Page 8: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Service Provider Federation

• Service Provider Federation

• Agreement 1

setup federation technology

build initial federation

setup EPIC service

central user attribute server

g• n centers members

• Link up with nationalIdFs

• Agreement 2

how to come to a persistent and stable

how to come to a federation

and how to get

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?

• Agreement 2• DFN De• HAKA Fi• SURFnet Nl

infrastructure? access? visible? services? user services?

community centres

service provider

federation

CMDI future & short term solution

service oriented

architecture

pan-European demo cases

• 1 Mio pot. Users-id

• currently more countries and centers coming

h // id ihttp://www.pidconsortium.eu

Trust Domain

Initial Federation

PID Service

Page 9: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Metadata Domain

ISOcat concept registry

myprofile 

CLARIN component registry

component registration

CMDI Infra

ISOcat development

setup OAI PMH machinery

Category Definition

LRT Inventory

Virtual Language World

ARBIL MD Editor component 

editor

ypg y

how to come to a persistent and stable

how to come to a federation

and how to get

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?metadata 

user area

infrastructure? access? visible? services? user services?

community centres

service provider

federation

CMDI future & short term solution

service oriented

architecture

pan-European demo cases

editorconcept registration      

?

metadata

descriptions 

Component Metadata

Metadata now

Virtual Collection

ISOcat Registry

VLO Observatory

Page 10: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Service Oriented Architecture

Stuttgart Tübingen Leipzig

Service Framework Specification

Web Service and Processing Chains

Standards and Best Practices

Web 2.0 Application for RepositoryStandard-conformant

how to come to a persistent and stable

how to come to a federation

and how to get

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?

Tool Chainingand Execution

Text Corpus Encoding

infrastructure? access? visible? services? user services?

community centres

service provider

federation

CMDI future & short term solution

service oriented

architecture

pan-European demo cases

Stuttgart Tübingen Berlin Leipzig FinlandRomania

Service Oriented

Infrastructure

Web Services Interoperability

Standards & Best

Practices

Page 11: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Demo Cases (just started)

EU Identity Index Case

Multimedia/multimodal Case

Folkstory Case

C4/WebLicht Corpus Case

how to come to a persistent and stable

how to come to a federation

and how to get

how to make all of their LRT

visible?

how to come to interoperable

services?

how to get it all together for

user services?infrastructure? access? visible? services? user services?

community centres

service provider

federation

CMDI future & short term solution

service oriented

architecture

pan-European demo cases

Page 12: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

not alone ...

EUDAT

Meta-Net

Page 13: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

need to take care of data ...

Data UsersUser functionalitiesData capture & transfer

generators Users

ion

Virtual Research EnvironmentsCLARIN, DARIAH etc

Community Support Servicesa

Cur

at Data discovery & navigationWorkflow generationAnnotation,Tr

ust

Services

Dat

a Annotation, Interpretability

Safe & persistent storage

Daten e-Infrastructure

Common Data ServicesSafe & persistent storageIdentifiers, Authenticity, Workflow execution, Mining

Architecture created by EC High Level Expert Groupwill be a guideline for coming decades

Page 14: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

why European?Bozen,

16.9.2010

www.clarin.eu

live in a multilingual sharing costs in all gEurope with a joint historical tradition

grespects is more efficient

and need to exploit this strength

h

finally it's about global competition

l i SSHmany research questions are cross-national

also in SSH

nationalrequired standards cannot be nationalcannot be national

Page 15: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Why now?Bozen,

16.9.2010

www.clarin.eu

there is the ESFRI we need to organize our process and all countries are synchronized which is a

resource domain due to huge increase of data (MPI: 200 TB)synchronized which is a

unique chance to build infrastructures

(MPI: 200 TB)we need to take care to not loose our cultural

in total 44 initiatives on the ESFRI roadmap and there is the

and scientific memorythere is a huge uptake of RI and there will be

potential of gain by an eco system of RI

of RI and there will be many funding streams!!!

Page 16: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

who and when?Bozen,

16.9.2010

www.clarin.eu

current EU CLARIN consortium in prep phase (08-10): 32 partners from 24 countries

CLARIN construction phase from 2011; main funds byCLARIN construction phase from 2011; main funds by national programs - but additional funding streams by EC connected to RI

legal issue: foundation of a European Research Infrastructure Consortiums (ERIC) as basis for future withInfrastructure Consortiums (ERIC) as basis for future with automatic qualification to participate in programs

Page 17: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Organisation of the CLARIN ERIC

CLARINUtrecht

Page 18: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

who seems to be on board?Bozen,

16.9.2010

www.clarin.eu

Belgium Bulgaria Germany Denmark EstoniaBelgium, Bulgaria, Germany, Denmark, Estonia,

Latvia, Finland, Croatia, Netherlands, Norwegen,

Austria, Portugal, Spain, Czech Republic, Hungary,

South Tirol ?South Tirol, ?

Some are discussing: FR, SW, GR?, etc.

Page 19: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Advantage of membershipBozen,

16.9.2010

www.clarin.eu

privilaged access to CLARIN federationp gnetworked with CLARIN centres (direct technology transfer)technology transfer)a word when discussing priorities, agreements best practicesagreements, best practicesaccess to EC funding streams

t d ti d t i iaccess to education and training programs to make our young generation competitive

Page 20: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Weitere InformationenBozen,

16.9.2010

www.clarin.eu

CLARIN web site: http://www.clarin.eupCLARIN office: [email protected]

CLARIN Newsletter:http://www.clarin.eu/newsletter

CLARIN members:http://www.clarin.eu/members

Page 21: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Thanks for your attention.

Page 22: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN Usage Scenario

Scenario: A Serbian and a German PhD student want to study language variation in the Balkan areastudy language variation in the Balkan area

Resource: via VLO they find all relevant language variation data for that area

Tools/Services: Modern clustering methods available via the web allow to quickly build dialect continua on top of a geographic map; visualization services allow to pipeline this to get a nice outputto get a nice output

Page 23: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

Visualization of Dialect Data: Clustering

Page 24: CLARIN - a European Researcha European Research …CLARIN Goal Bozen, 16.9.2010 What: How: Offer a distributed Research Infrastructure of allow the combination of existing and web-Infrastructure

CLARIN Usage Scenario

Scenario: Linguists, sociologists and ethnologists want to study the cultural and linguistic differences of parliamentstudy the cultural and linguistic differences of parliament debates in SE, DE and GR about the swine flue and compare how such global problems are dealt with

Resource: building a virtual collections of all debates (Audio, Video, Transkription)

Tools/Services: allowing researchers to analyse and annotate gestures, intonation, word choices, timing etc

h tl f l t d b i dwhere partly powerful computers need being used

Vision: in 2011/12 such computational services will be d il bl i CLARIN 2011made available in CLARIN 2011


Recommended