Date post: | 03-Jan-2016 |
Category: |
Documents |
Upload: | esmond-mcgee |
View: | 218 times |
Download: | 1 times |
Major objectives for data management
16 May-2011
•Ability to be cited – thereby generating incoming links via CrossRef
•Integration with other publication types to create an integrated information service
•Pushing metadata to information/knowledge management channels (e.g. MARC)
•Pushing metadata to discovery channels (e.g. RePEc, econlit et al)
OECD Publishing’s approach
A two-steps approach: from standards implementation to online publishing and citing
Consolidating the foundations: Continuing process of linking data with
publications Next challenges to increase discovery and use …
16 May-2011
A two-steps approach
From Standards implementation• In 2008-2009: Aggregation of datasets & data tables
in a central bibliographic database including OECD books and papers
• development of standards for bibliographic management and citing of datasets and data tables
16 May-2011
Green, T (2009), “We Need Publishing Standards for Datasets and Data Tables”, OECD Publishing White Paper, OECD Publishing. doi: 10.1787/603233448430, http://dx.doi.org/10.1787/603233448430
DB
DPP
DB
DS
DS
DS
DS
DS
DS
Collection of datasets
Collection of collection of datasets
Collection of datasets
datasets
DOI
DOI
DOI
DOI
DOI
DOI
DOI
DOI
DOI
ISSN
Two concepts are required for datasets’ management dataset (being part of collection/stand-alone serial collection (of datasets/ of collection of datasets)
Stand-alone Dataset – subject to subscription
ISSN
DSDOI
DB
DPP
DB
DS
DS
DS
DS
DS
DS
Data Concepts
16 May-2011
Collection • of more than one datasets• of collection of datasets
• Has an ISSN • Has a DOI • Is subject to subscription
Collection of datasets• belongs to a Top Collection • Has a DOI • Does not have ISSN
Dataset: • a content type (group of related data such as a OECD.stat cube) published:
• as part of a collection • stand-alone (in this case it can be subject to subscription and has an ISSN)
• Has a DOI
Agreed definitions
16 May-2011
Collection (of collection of datasets)•DOI suffix =<CollectionAcronym>-data-<LanguageISO2Code>e.g. agr-data-fr
Dataset (including stand-alone dataset managed as serial) • DOI suffix = data-<DatasetOrderNumber on 5 digits>-<LanguageISO2Code>e.g. data-00023-en
Agreed DOI syntax
16 May-2011
Only Dataset are cited
What do we cite ?
Citation of a DYNAMIC DATASET belonging to a collection of datasets
<copyright owner> (<year of last update date>), "<default parent subcollection main title>: <dataset main title>: <dataset subtitle>", <default parent top collection main title> (database).doi: <doiprefix>/<doisuffix>(Accessed on dd month yyyy)
e.g. (dataset subtitle and joint copyright OECD and FAO)
OECD/FAO (2008), "OECD-FAO Agricultural Outlook: World prices", OECD Agriculture Statistics (database).doi: 10.1787/data-00217-en(Accessed on 21 December 2008)
16 May-2011
16 May-2011
Citation of KEY TABLE EDITION (Yearly edition of a key table, belonging to a key table collection )
OECD is the author and publisher. The table belongs to a key table collection of type “Theme”
<author physical/institutional> (<year of publication date>), “<key table title>”, <key table collection title>, No. <key table order number>.doi: <doiprefix>/<key table edition doisuffix> key table edition doisuffix>
e.g. table belonging to OECD Key Tables on Taxation:
OECD (2009), “Income tax plus employee social security contributions", OECD Key Tables on Taxation, No.1.doi: 10.1787/16097319-2009-table1(Accessed on 02 February 2009)
Same standardization is made for data tables
A continuing Process of Linking data with publications
• Cross-referencing but also….
• Internal linking within OECD publications catalogue
16 May-2011
Consolidating the foundations
Serial
Main Eco. IndicatorsStatistical Collection Related database
Related periodical
Statistical Periodical or
AnnualIs Source/Method of
Business Tendency Surveys: A Handbook
Chapter/article
Has Source/Method
Is Source/Method of
Has Source/Method
Is Source/M
ethod of H
as Source/M
ethod
Quarterly unit labour
costKey table
Dat
asou
rce
IMF Data MapperExternal Resource
External link:
Related
Website
Publicat° compo-
nent
External
Overview of links management in KAPPA between books, papers and statistical content
Book Table/Graph
Datasource
Legend:
One way link
Bidirectional link: must be entered in a given direction in KAPPA (the full arrow represents the link that will be entered, and the dotted arrow represents the reciprocal link which will automatically be created)
N1
1N
Statistical Periodical or
AnnualRelated
database
Related periodicalN1
1N
DatasetDatasource
Datasource
Publication
Datasource
Statistical Collection
Related database
Related periodicalN1
1N
Dat
asou
rce
Datasource
Datasource
Serial
Publica-tion
16 May-2011
Provision of MARC records for datasets
• The MARC records are provided in MARCXML for
– dataset (within collection, or stand-alone)
– statistical collection– key table– key table collection
• MARC records are generated in English only, and describe the online version of a publication/serial.
Tag Field Book
article Key table & collect°
Statistical collection
dataset
020 ISBN x x022 ISSN x x x
024Other ident.
(DOI)x x x x x
040 Cataloguer x x x x x
100 Author Main (ind)
x x
110 Author Main (org)
x x
245 Title x x x x x
246 Title, varying form
x x x x x
250 Edition x x260 Publisher x x x x x300 Physical Desc. x x
310 Publication Freq.
x x x
362 Start/End x x x490 Series x x505 Contents x x520 Abstract x x x x x650 Theme x x x x x651 Country x x x x x
700 Author 2nd. (ind)
x x
710 Author2nd. (org)
x x x x x
760 Main series x x762 SubSeries x x773 Host Item x x x
774 Constituent Item
x
775 Other Lang. x x x x x780 Continues x785 Continued by x830 Series x x856 URL x x x x x
16 May-2011
Next challenges to increase discovery & use
Next challenges to increase discovery & use
• Expand the definition of dynamic datasets:– updating « datasets » which are continously updated in a
dynamic way– Regular datasets’ editions: datasets which are not updating
resources but are published in separate editions rather than as an integrating resource which is continuously updated
and adapt citing/bibliographic standards
• Manage online archived datasets• Disseminate datasets records on RePEc the world’s
largest collection of papers in economics
16 May-2011