Date post: | 13-Dec-2015 |
Category: |
Documents |
Upload: | stewart-oneal |
View: | 216 times |
Download: | 0 times |
D-square
Digital Databases and Tools for Dutch Regional Dictionaries
Folkert de Vriend- Methods XII, Moncton, Canada, 2005 -
1. Agricultural terminology
2. Other technical or craft terminologies
3. Common vocabulary
Macrostructure WBD & WLD
Linguistic information in entries:• Phonetic (vol. I&II) or lexical (vol. III) variant• Heteronym (functions as headword)• Lexical meaning (word, or description of a
concept)
Non-linguistic information in entries:• Place name
Microstructure WBD & WLD
1. Conversion to a new format
2. End user access to data
3. Enrichment of data
4. Data management
Overview phases D-square
Deel IIIMS-Word
Editors
/
Management
Users
Analog Digital Digital Analog
(parts of)Vol. I+IIMS-Word
Filing cards
WebsiteWBD/WLD
with tools forsearching andcartography
EnricheddataXML
Raw dataFileM Pro
(parts of)Vol. I+IIMacWrite
Questionnaires Nijmegen and
Leuven
Questionnaires (chiefly) Meertens
Raw data
Vol. I + II Vol. III
Edited data
Specializedprint editions (dialect atlas
or local dictionary)
Online DB WBD
(Polderland)
Edited dataXML
Vol. IIIFileM Pro
SGV on CD(Polderland)
Vol. III
Reasoning behind new encoding
• XML, not relational database
• Tailored to WBD and WLD
• Flexible enough to be used for other dialect dictionaries
• Standards: ISO TC 37/SC 4 DCR and LMF. (DCR = flat concept registries)
Example XML-encoding<LEXICON><ENTRY><CONCEPT ontol_ID="492"> <WORD lang=“Dutch”>Meikever</WORD> <ILLUSTRATION uri="meikever.jpg"/> <DESCRIPTION></DESCRIPTION></CONCEPT><VARIANTS>
<VARIANT type="heteronym">Bakkertje <VARIANT type="lexical">Bakkerke
<VARIANT type="raw" import="diplomatic" source1="N83"> bakkərkə<LOCATION>K178</LOCATION></VARIANT>
</VARIANT> </VARIANTS></ENTRY> …</LEXICON>
XML-encoding for WALD<LEXICON><ENTRY><CONCEPT ontol_ID="492"> <WORD lang=“Achterhoeks”>Hals</WORD>
<DESCRIPTION></DESCRIPTION></CONCEPT><VARIANTS> <VARIANT type="heteronym">Hals
<LOCATION>Gor</LOCATION><LOCATION>Harf</LOCATION><LOCATION>Alf</LOCATION><LOCATION>Eef</LOCATION><LOCATION>...</LOCATION>
</VARIANT> <VARIANT type="heteronym">Kael
<LOCATION>Gor</LOCATION><LOCATION>Eef</LOCATION><LOCATION>Wich</LOCATION><LOCATION>Gen</LOCATION><LOCATION>...</LOCATION>
</VARIANT></VARIANTS></ENTRY></LEXICON>
XML-encoding for WZD<LEXICON><ENTRY><CONCEPT ontol_ID="292"> <WORD lang=“Dutch”>de haard</WORD>
<DESCRIPTION></DESCRIPTION></CONCEPT><VARIANTS>
<VARIANT type="raw">'aerd (d'n)<LOCATION>Z.eil</LOCATION><LOCATION>Z.V.W.</LOCATION><LOCATION>L.v.Ax</LOCATION><LOCATION>Hlt</LOCATION><LOCATION>...</LOCATION>
</VARIANT><VARIANT type="raw">êêrd
<LOCATION>Z.V.W.</LOCATION></VARIANT>
</VARIANTS></ENTRY></LEXICON>
Existing ontology can be used
Ontology Concept field in Dictionaries
Elbow
Finger Nail Hand Palm
“Word in standard language”(WLD/WBD/WVD)
“Word in language of the dialect”(Achterh.+Lie)
“Picture” (Achterh.+Lie)