an integrated approach to online dictionary and ontology building for austronesian languages in...

36
An integrated approach to online dictionary a nd ontology building for Austronesian Languag es in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien Yang, Providence University, Taiwan Hui-Huan Ann Chang, Providence University, Taiwan Maa-Neu Dong, National Museum of Natural Sciences, Taiw an

Post on 21-Dec-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

An integrated approach to online dictionary and ontolog

y building for Austronesian Languages in Taiwan

D. Victoria Rau, Wheaton College, U.S.A.Meng-Chien Yang, Providence University, Taiwan

Hui-Huan Ann Chang, Providence University, TaiwanMaa-Neu Dong, National Museum of Natural Sciences, Taiwan

Page 2: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Outline

1. Introduction

2. A Trinitarian Model

3. Online Dictionaries

4. Yami Fish Ontology

5. Conclusion

Page 3: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

1. Introduction

• Yami corpora

Yami language archive

http://yamiproject.cs.pu.edu.tw/yami

Yami e-Learning

http://yamiproject.cs.pu.edu.tw/elearn

• Indigenous language revitalization

a “trinitarian” modela “trinitarian” model

Page 4: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

2. The Trinitarian Model

Trinitarian ModelTrinitarian Model

Language activistsLanguage activists Linguists Linguists Computer scientists Computer scientists

Page 5: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

3. Online dictionaries

• Three versions of Yami online dictionaries

1. Digital Archiving Yami Language Documentation

(funded by SOAS ) http://yamiproject.cs.pu.edu.tw/yami/database.htm

2. Yami Language Archiving (funded by the SOAS) http://yamiproject.cs.pu.edu.tw/elearn/search.php

3. Yami Learning Dictionary (funded by the CIP) (1) Lexique Pro software version (2) The participatory Wiki dictionary

http://yamibow.cs.pu.edu.tw

Page 6: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Digital Archiving Yami Language Documentation

• Keyword search from the texts gathered for digital archiving Yami language documentation

Page 7: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Yami Language Archiving

• A concise online Yami-Chinese-English dictionary

Page 8: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Yami Learning Dictionary

Link

HomeLink

Page 9: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

3.1 The Lexique Pro software version

• 1786 lexical entries• 780 roots • 1006 derivatives

Page 10: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• An English index

Page 11: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• An index organized by Chinese pinyin spelling

Page 12: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• An index organized by semantic categories

Page 13: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

3.2 The Participatory Wiki dictionary

• The structure of Web 2.0 style version dictionary

Page 14: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

The search page of the web 2.0 style dictionary-1

Page 15: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

The search page of the web 2.0 style dictionary-2

Page 16: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

4. Yami Fish Ontology

• 109 Yami fish with Chinese, English, and Latin name

• Toolbox Lexique Pro Protégé

• “Ontology 101 development process” by Noy and McGuinness (2001)

Page 17: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

4.1 Yami Fish Names

• Motivation– Finding the perspective and semantics of Y

ami fish names– Reinterpreting the fish classification of Yam

i– Constructing the indigenous knowledge of f

ish

Page 18: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

4.2 Methodology

• 7 steps of constructing the ontology from “Ontology 101 development process” by Noy and McGuinness (2001)

• 7 steps:1. Determine the domain and scope of the ontology

2. Consider reusing existing ontology

3. Enumerate important terms in the ontology

4. Define classes and the class hierarchy

5. & 6. Define the properties of classes and define the facets of the slots

7. Create instances

Page 19: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

(1) Determine the domain and scope of the ontology

1. Which fish are edible and inedible for Yami people?

2. Which gender can eat what kind of fish?

3. What kind of fish can be eaten by Yami elderly males?

4. What kind of fish can Yami pregnant women eat?

Page 20: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

(2) Consider reusing existing ontologies

The Fish Database of Taiwan (http://fishdb.sinica.edu.tw/ )

(3) Enumerate important terms in the ontology (a) classification of Yami fish: anito “inedible fish”;

types of edible fish: raet “fish for men”, oyod “fish for women”,

kakanen no rarakeh “fish for old men”;

(b) named Yami fish, such as ilek “rudderfish”, cilat “jackfish”;

(c) Yami people: men, women, and old men;

(d) women of three stages: not pregnant, pregnant, and breast feeding

Page 21: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

(4) Define classes and the class hierarchy

Yami fish

Yami people

oyod a among

raet a among

kakanen no rarakeh

among no anito

Men +(can eat)

+ - -

Women + - (cannot eat)

- -

Old men + + + -

• The classification of Yami fish

Page 22: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

(5) & (6) Define the properties of classes and slots

and define the facets of the slots

The object properties The datatype properties

Page 23: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

(7) Create instances (individuals)

• An Example of a class editor

Page 24: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• OntoViz display for paloy fish

Page 25: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• An initial ontology of Yami fish

Page 26: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

4.3 Yami fish ontology

• Hierarchy of Yami fish onotology

Page 27: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• OWLViz Displaying the Inferred Hierarchy

Page 28: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• OWLViz Displaying the Asserted Hierarchy

Page 29: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• OWLViz Display for anito_Class and kakanen_no_rarakeh_Class

Page 30: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

OWLViz Display for rahet_Class and oyod_Class

Page 31: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Ontology browser window generated by Protégé

Page 32: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

The OWL document generated for the Yami fish ontology

• http://yamibow.cs.pu.edu.tw/fish_en/index.html

Page 33: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

4.4 Limitations and future research

(1) selection of a text on a semantic domain

(2) reconstruction of the IK by building a network relationship of the semantic domain independently by both the linguist and the language activist to achieve high reliability

(3) transformation of the final diagram of the network relationship into the Protégé.

Page 34: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

• Analysis by Protégé of a text about taro planting

Page 35: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

5. Conclusion

• The trinitarian model of developing three Yami online dictionaries

• A wiki dictionary

• An ontology of Yami fish names, with the goal of building a collective knowledge system for the Yami language

• Ongoing project 1. An online encyclopedia in Yami 2. The semantic infrastructure of the Yami language

Page 36: An integrated approach to online dictionary and ontology building for Austronesian Languages in Taiwan D. Victoria Rau, Wheaton College, U.S.A. Meng-Chien

Ayoy!

Thank you!