+ All Categories
Home > Documents > High-level Data Access Based on Query Rewritings

High-level Data Access Based on Query Rewritings

Date post: 01-Feb-2016
Category:
Upload: masato
View: 23 times
Download: 0 times
Share this document with a friend
Description:
High-level Data Access Based on Query Rewritings. Ekaterina Stepalina. Higher School of Economics. High-Level Data Access. Concentration on application domain tasks Abstraction from data sources Efficient work. Research - PowerPoint PPT Presentation
21
High-level Data Access Based on Query Rewritings Ekaterina Stepalina Higher School of Economics
Transcript
Page 1: High-level Data Access Based on Query Rewritings

High-level Data Access Based on Query Rewritings

Ekaterina StepalinaHigher School of Economics

Page 2: High-level Data Access Based on Query Rewritings

High-Level Data Access

• Concentration on application domain tasks• Abstraction from data sources • Efficient work

Research• This problem is actively considered on modern scientific conferences on knowledge

representation and ontologies – OWLED (2009), (ICDE IIMAS, 2008) , the Semantic Web magazine (2011 – the Mastro System)

• W3C developed OWL 2, OWL 2 QL (2008) and etc.

Page 3: High-level Data Access Based on Query Rewritings

Ontology-Based Data Access (OBDA)

• Large amounts of data (distributed, inconsistent)• Main task – query answering (domain-oriented

and efficient)

Page 4: High-level Data Access Based on Query Rewritings

What is Ontology?• Ontology is a knowledge domain described on some knowledge

representation language.• Entity-Relationship and UML Class diagrams can be seen as ontology

languages.

Page 5: High-level Data Access Based on Query Rewritings

Logic-Based Knowledge Representation

• Enables semantic processing of data• Enables inference of implicit knowledge• Well studied and actively developed– Description logics (Baader,1999), esp. DL-Lite

• Standardized– OWL 2 Profiles

Page 6: High-level Data Access Based on Query Rewritings

DL-Lite Best Suites for OBDA

• High expressive and computationally efficient • Allows delegating query answering to DBMSs

and using all advantages of modern relational technologies

• Supported by the W3C standard - OWL 2 QL

Page 7: High-level Data Access Based on Query Rewritings

Query Answering Problem

• Given a query and an n-tuple of objects from A. Decide, whether , or the n-tuple is the answer for with respect to K.

For knowledge represented in DL-Lite, we can formulate queries in domain concepts, translate them into ordinary SQL queries and perform over separate databases.

Page 8: High-level Data Access Based on Query Rewritings

OBDA System Architecture

• Ontology Editor• OBDA-Enabled Reasoner• Mapping Processor• Data Source Manager• Consistency Checker

Page 9: High-level Data Access Based on Query Rewritings

Query Rewritings• OBDA-Enabled Reasoner rewrites the initial

ontology query into a set of UCQ (union conjunctive query).

• Mapping Processor builds an SQL from UCQ and given mappings.

• The initial query syntax may differ (SparQL, datalog query, etc.)

Page 10: High-level Data Access Based on Query Rewritings

TBox and ABox in DL

• TBox is a finite set of concept and role inclusion axioms:

• ABox is a finite set of assertions:

• Where - object’s name, A – concept name, P – role name, q – integer.

Page 11: High-level Data Access Based on Query Rewritings

Interpretation

• Interpretation (the particular instance of KB) is a pair if non-empty domain and an interpretation function :

, , and .• UNA (unique name assumption):

Page 12: High-level Data Access Based on Query Rewritings

OWL 2 QL• UNA is ignored; (in)equality must be defined

explicitly• Language expressive power reduced up to (other designation - ).

• Basic conceptual modeling relations are available: (A)sym, (Ir)Ref, Tran

• Main constraints of :– Functional relations cannot be defined– Particular roles cannot be assigned only to specific concepts, all roles are

applied to all concepts– Disjunction coverage of knowledge domain cannot be defined

Page 13: High-level Data Access Based on Query Rewritings

Query Rewriting Sample• RDB tables: Person(name, age), Lives (person, city), Manages (boss,

employee).• Query: Get the names and ages of all people living in the same city with

their boss.• UCQ:

• Simplified UCQ:

• SQL query: • SELECT P.name, P.age• FROM Person P, Manages M, Lives L1, Lives L2• WHERE P.name=L1.person AND P.name=M.employee AND

M.boss=L2.person AND L1.city=L2.city

Page 14: High-level Data Access Based on Query Rewritings

Query Rewriting Algorithms

• CGLLR (Calvanese et al., 2007)- Applies all suitable TBox axioms to - Replaces axioms containing existential

qualifications with another 3 axioms, which increases the number of UCQ

• RQR (Pérez-Urbina, Horrocks, Motik, 2009)- Generates clauses from TBox assertions and then

resolve clauses with query - Potentially supports more expressive DLs

( )q x

Page 15: High-level Data Access Based on Query Rewritings

Query Rewriting Benchmark• 9 ontologies with axioms,

containing -existential qualification:– Vicodi (V)– Stock exchange (S)– University (U,UX)– Adolena (A,AX)– Synthetic (P1, P5,P5X)

Page 16: High-level Data Access Based on Query Rewritings

Comparison Results

• RQR is more preferable to implement in OBDA-enabled reasoners, than CGLLR:– Generates less UCQ, especially for ontologies with

large number of existential qualifications– May be further optimized and advanced to more

expressive DLs, than

Page 17: High-level Data Access Based on Query Rewritings

Produced Rewritings

Page 18: High-level Data Access Based on Query Rewritings

Running Time, ms

Page 19: High-level Data Access Based on Query Rewritings

Current Work

• Preparing an ontology for a real application – interactive television platform (IPTV) for testing algorithms on real data

• Optimizing RQR – reducing the number of generated clauses

• Main idea – not advance RQR, but support more expressiveness and all OWL 2 QL constructors in powerful mappings

Page 20: High-level Data Access Based on Query Rewritings

References• The Description Logic Handbook: Theory, Implementation and

Applications. Cambridge University Press, 2002. ISBN 0521781760. Edited by F. Baader, D. Calvanese, D. McGuinness, D. Nardi, P. F. Patel-Schneider.

• F. Baader. Logic-Based Knowledge Representation. In M.J. Wooldridge and M. Veloso, editors, Artificial Intelligence Today, Recent Trends and Developments, number 1600 in Lecture Notes in Computer Science, pages 13–41. Springer Verlag, 1999.

• Artale, A.; Calvanese, D.; Kontchakov, R. and Zakharyaschev, M. (2009) The DL-Lite family and relations. Journal of Artificial Intelligence Research 36 (1), pp. 1-69. ISSN 1076-9757.

• H.P´erez-Urbina, I.Horrocks, and B.Motik. Efficient Query Answering for OWL 2. In Proceedings of the 8th International Semantic Web Conference (ISWC2009), Chantilly, Virginia, USA, 2009.

• H.P´erez-Urbina, B.Motik, and I.Horrocks. Tractable Query Answering and Rewriting under Description Logic Constraints. JournalofAppliedLogic, 2009.

Page 21: High-level Data Access Based on Query Rewritings

High-level Data Access Based on Query Rewritings

Questions?


Recommended