+ All Categories
Home > Documents > Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick...

Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick...

Date post: 13-Jan-2016
Category:
Upload: delphia-potter
View: 218 times
Download: 2 times
Share this document with a friend
Popular Tags:
53
Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University of Melbourne Charting Vanishing Voices: A Collaborative Workshop to Map Endangered Oral Cultures: WOLP 2012 Workshop
Transcript
Page 1: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Mapping endangered records of endangered cultures

or

We have harvesters but not enough fruit

Nick Thieberger

School of Languages and Linguistics

University of Melbourne

Charting Vanishing Voices:

A Collaborative Workshop to

Map Endangered Oral Cultures:

WOLP 2012 Workshop

Page 2: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Metrics (June 2012)274 collections of which 181 are publicly available8,268 items of which 7,637 are publicly available59,987 filesSize : 6.04 TBTime : 3,390 hours716 languages represented in the collection, from 65 countries

Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC)

Page 3: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Collaborative archiving project begun in 2002

Team made up of linguists and musicologists

Thee universities in a consortium (Sydney, Melbourne, ANU)

Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC)

Page 4: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Endangered records

Too little is recorded in most of the world’s languages

Much of what is recorded is not being looked after properly

We can’t even find what has been recorded

How can we change that?

Page 5: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Too little is recorded in most of the world’s languages

How much fieldwork is going on?• Newman (1992 and 2004) reports 34 US departments

running fieldmethods courses• LLL conference 2009 – 180 abstracts• 2nd International Conference Language Documentation

and Conservation 2011 – 230 abstracts

-

Page 6: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

How much fieldwork is going on?• Assume at least 100 current fieldwork-based linguistic

projects • Since 1960, assuming 50 per year there should be

reasonable records of 2500 languages• Recordings, texts, dictionaries

– paper and digital (from the late 1980s onwards)

Too little is recorded in most of the world’s languages

Page 7: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

• Not even all funded projects are producing well-formed records– Well formed means described, archived and

accessible, e.g.,

ELDP – funded 2641 projects but ELAR has somewhere around 1102 deposits

1 http://www.hrelp.org/grants/projects/index.php?year=all

2 http://www.paradisec.org.au/blog/2012/04/elar-update-update

Too little is recorded in most of the world’s languages

Page 8: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

• More recording by non-linguists is necessary

Too little is recorded in most of the world’s languages

Page 9: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

• More recording by non-linguists is necessary

• New methods (e.g., Basic Oral Language Documentation - BOLD) that could include more recording by speakers

Too little is recorded in most of the world’s languages

Page 10: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

• More recording by non-linguists is necessary

• New methods (e.g., Basic Oral Language Documentation - BOLD) that could include more recording by speakers

• Social media as a source of recordings/texts/etc

Too little is recorded in most of the world’s languages

Page 11: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

• More recording by non-linguists is necessary

• New methods (e.g., Basic Oral Language Documentation - BOLD) that could include more recording by speakers

• Social media as a source of recordings/texts/etc

• How to ensure this kind of recording has longevity?

Too little is recorded in most of the world’s languages

Page 12: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

There should be reasonable records of 2500 languages

• Where are they?

• How do we find them?

Page 13: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

What is recorded is not being looked after properly

Page 14: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

What is recorded is not being looked after properly

Digital recordings more fragile than analog, but most are not being archived

Page 15: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

We can’t even find what has been recorded

Harvesting tools:

WorldCat http://www.oclc.org/worldcat

LLMap (Linguist List, USA) http://www.llmap.org

Multitree http://multitree.org

UNESCO Atlas http://www.unesco.org/culture/languages-atlas

ELCat / Endangered Language Cataloghttp://www.endangeredlanguages.com

Page 16: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Aggregated information

http://oralliterature.org/database, since mid-2010

Page 17: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

We can’t even find what has been recorded

Language codes as a basis for searching- ISO-639-3, three-letter codes

Typically not used by most repositories (small regional libraries, State libraries, Film and Sound archives)

Page 18: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

British Library

We can’t even find what has been recorded

Page 19: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

National Library of Australia

We can’t even find what has been recorded

Page 20: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Vienna Phonogrammarchiv

We can’t even find what has been recorded

Page 21: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 22: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Online searching for language material

e.g., ‘Lewo’ as a language name?

Google – ‘Lewo’ – 3,080,000 hits

Google – ‘Lewo grammar’ – 2,200 hits

Open Language Archives Community (OLAC) – ‘Lewo’ 13 hits

Page 23: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

OLAC search result

Page 24: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 25: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 26: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

What else is out there?

• Items held in personal collections can’t be located

• speakers who recorded their families

• missionaries

• patrol officers

• These could be listed in catalogs, even if online access is restricted

Page 27: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Existing resources = low-hanging fruit

e.g., http://anglicanhistory.org/oceania/

Page 28: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 29: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 30: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 31: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Existing resources = low-hanging fruit

Problems of longevity of website-

based data sources

Page 32: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Existing resources = low-hanging fruit

Problems of longevity of website-

based data sources

Use the Internet Archive for a

persistent identifier

Page 33: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

06/19/12

Page 34: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Endangered recordings

• Linguists need a shared infrastructure in which to locate their recordings

– to make them discoverable

– to provide standard descriptions which can be located by standard search mechanisms

– to enter metadata before it is forgotten

Page 35: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

ExSite9

Metadata creation without (too many) tears

File browser – assigning attributes to files created in fieldwork

Application writes an XML file capturing relationships expressed by ‘drag and drop’ in the browser

XML file submitted to an archive’s catalog

From the laptop to the archive

Page 36: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

06/19/12

ExSite9

From the laptop to the archive

Page 37: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

06/19/12

ExSite9

From the laptop to the archive

Page 38: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

06/19/12

Page 39: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

06/19/12

Page 40: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

In development in mid-2012

Cross-platform tool

Expected release later in 2012

ExSite9

Page 41: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

EOPAS – Delivery of text and media

Encourage deposit of text and media

- Provide presentation formats for recorded texts

- Based on a linguist’s normal workflows

Record > Transcribe (Elan) > Interlinearise (Toolbox) >

XML output > EOPAS

http://linguistics.unimelb.edu.au/research/projects/eopas/

Page 42: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Metadata

Playable media

http://www.eopas.org/transcripts/55

Page 43: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Selected text

Keyword in Context / Concordance in all texts of that language

http://www.eopas.org/transcripts/55

Page 44: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Ability to turn off

morphemic view

Ability to turn off

morphemic view

http://www.eopas.org/transcripts/55

Page 45: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Reference to

morpheme-level

Reference to

morpheme-level

http://www.eopas.org/transcripts/55

Page 46: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Reference to timed chunk

Reference to timed chunk

http://www.eopas.org/transcripts/55

Page 47: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Stories

Recorded by researchers

Strong source community interest in hearing recordings and reading texts

Stored in digital archives

Digitised from analog sources

Page 48: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Central harvesting by language code (ISO-639-3)

Page 49: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Stories in many of the world’s 7,000 languages

Page 50: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.
Page 51: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

Persuade linguists to create research data properly and to deposit their materials in archives

- create incentives in academia to create collections

Locate existing digital material and incorporate it into principled online catalogs

Location of analog collections and their digitisation and incorporation into principled online catalogs

Building example texts/media for as many languages as possible

Harvesting tools need something to harvest!

Page 52: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

http:/paradisec.org.au

[email protected]

Page 53: Mapping endangered records of endangered cultures or We have harvesters but not enough fruit Nick Thieberger School of Languages and Linguistics University.

http://www.nflrc.hawaii.edu/ldc/


Recommended