+ All Categories
Home > Documents > Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Date post: 01-Apr-2015
Category:
Upload: lexus-eddie
View: 222 times
Download: 2 times
Share this document with a friend
12
Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari
Transcript
Page 1: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Quranic Arabic CorpusData Mining & Text Analytics

By Ismail Teladia & Abdullah Alazwari

Page 2: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Introduction What is the Quran?

Holy book for Muslims Revealed from 610 AD 6,236 verses, 114 chapters

Corpus Definition. Written or spoken language

What is the Quranic Arabic Corpus? 77,430 words of Quranic Arabic Researcher: Kais Dukes

Page 3: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Features of QAC: Morphological Annotation

Syntactic Treebank

Semantic Ontology

Page 4: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Morphological Annotation Word By Word

Grammar Syntax Morphology

Part-of-speech tagging Natural Language

Computing Technology

Page 5: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Details of Word’s Grammar Clicking the word gives more detail:

Type of WordTranslationGenderCaseRoot

In addition it shows the verse in which word appears and sound recitation of the verse.

Page 6: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Syntactic Treebank Verse by verse dependency graphs

Meaning of verse (broken down) Sentence structure (dependencies) Case

Mathematical graph theory

Page 7: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Ontology of Concepts Knowledge representation Relationship between concepts Historic places and people Named entity tagging E.g. Sun, Moon, Star, Earth classified

under “Astronomical Body” Uses predicate logic

Page 8: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Visual Representation of Ontology 300 linked concepts with 350 relations

Page 9: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Conclusion Uses of the QAC:

Analysing Arabic text of each verse Linking Arabic words through

dependencies Finding relationships between concepts

Website used daily by 2,500 people from 165 countries

Page 10: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Map Showing Usage of QAC

Page 11: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Bibliography http://corpus.quran.com

Page 12: Quranic Arabic Corpus Data Mining & Text Analytics By Ismail Teladia & Abdullah Alazwari.

Thank you for listening!


Recommended