+ All Categories
Home > Documents > Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by:...

Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by:...

Date post: 15-Jan-2016
Category:
View: 213 times
Download: 0 times
Share this document with a friend
Popular Tags:
36
Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF
Transcript
Page 1: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Query Formulation as an Information Retrieval Problem

Presented by: Waleed Albattah

Supervised by: Dr. Mourad YKHLEF

Page 2: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Outlines

1. Introduction

2. An Architecture for Information System Exploration

I. Stratfied Hypermedia Architecture

II. A Formalisation of Stratfied Hypermedia Architecture

III. Exploring an Information Structure

3. The Hyperindex Layer

I. Generalised ORM Schemas

II. Molecules in the Index View

4. The Hyperbase Layer

5. Query By Construction

Page 3: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction

Most orgs use some automated information system large body of vital corporate information is stored in

these ISs Function of ISs is the support of disclosure of this

information Without adequate information disclosure avenues

an information system becomes worthless Most ISs do not provide any support for the users in

their quest for information

Page 4: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

Conceptual schemata of real-life applications tend to be quite large and complicated

As a result, users may easily become `lost in conceptual space' and they may retrieving irrelevant (or even wrong) objects and may miss out on relevant objects

Retrieving irrelevant objects leads to a low precision

missing relevant objects has a negative impact on the recall

Page 5: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

The disclosure of information stored in an IS has some clear parallels to the disclosure problems encountered in document retrieval systems

The information retrieval paradigm for document retrieval systems is, in our opinion, directly applicable to traditional information systems

Page 6: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

The Information Retrieval paradigm

Page 7: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

The characterisation of objects in an IS is directly provided by the reference schemes of the object types

disclosure is driven by a process of matching In document retrieval applications this matching

process tends to be rather complex Furthermore, the characterisation of documents is

known to be a hard problem

Page 8: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

In information systems the matching process is less complex

quest from users to fulfill their information need can be aptly described by:

I don't know what I'm looking for, but I'll know when I find it.

Page 9: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

Query formulation process (both for a document retrieval system, and an information system) consist of four phases:

1. The explorative phase What information is there how is it related what does it mean?

Page 10: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Introduction cont.

2. The constructive phase Using the results of phase 1 the actual query is formulated

3. The feedback phase result from phase 2 may not be completely satisfactory In this case, phases 1 and 2 need to be re-done and the

result refined

4. The presentation phase In most cases, result needs to be incorporated into a report This means that results must be grouped or aggregated in

some form.

Page 11: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

2 .An Architecture for Information System Exploration

2.1 Stratified Hypermedia Architecture in its simplest form, is a two level hypermedia

architecture descriptive layer (the hyperindex) indexing the lower layer (the hyperbase)

hyperbase contains actual information, whereas hyperindex only provides characterization of the stored information

Page 12: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.
Page 13: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Example navigation session

Architecture cont.

Page 14: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

2.2 A Formalisation of Stratied Hypermedia Architecture

a network of layers that are related by characterisation relations.

A layer is introduced as a structure (F;N;G;V) where: F is the set of (information) fragments, called the

fragment base Jesus, resurrection, . . . , proclamation of resurrection, . . . , proclamation of resurrection of Jesus by disciples

Page 15: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

N is the set of presentations (or nodes), referred to as the node base

G is a structure (E; P), where E is a set of syntactic categories, and P a set of production rules.

V is a set of views, called the mask.

Page 16: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

2 kinds of navigations between molecules structural navigation associative navigation

2 parse trees of the hyperbase layer (B1;B2) 2 parse trees of the hyperindex layer (I1; I2)

Page 17: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

Page 18: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

2.3 Exploring an Information Structure

The structure of the presidential database

Page 19: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

quest for a president who is married with a politician

The starting node of the hyperindex

Page 20: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Focus on marriage

Page 21: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

Page 22: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

Page 23: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

Page 24: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Architecture cont.

Page 25: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

3 .The Hyperindex Layer

3.1 Generalised ORM (Object-Role Modeling) Schemas A conceptual schema is build around a set of

object types O fact types (relationship types) F Each fact type f consists of a set of roles from P fact types in F should form a partition of the roles in P

All roles have a base provided by the function Base :

Page 26: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.

Page 27: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.

The hyperindex for a conceptual schema will contain only a single view.

A view is formally introduced as a structure:

The starting point of this view is S1 E1 The moleculesM1 are formed by the set of

linear path expressions augmented

3.2 Molecules in the Index View

Addition slide

Page 28: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.

The actual structure W1 is a subset of

M1* M1. Let x be an object type, and P be a fragment of a

path expression, then this set is identified by the following kinds of structural links:

1. a link from the empty path expression to any molecule x

2. a link from a molecule P x to a molecule

3.2 Molecules in the Index View

Addition slide

Page 29: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

3.2 Molecules in the Index View

Hyperindex cont.

Page 30: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.

we have the following kinds of associative links

1. a link from a molecule of the form P x to a molecule P y if x ~ y capturing the identification hierarchy

2. a link from molecule P to molecule Rev(P) if P ≠ Rev(P) catering for the reversal of path expressions.

3.2 Molecules in the Index View

Addition slide

Page 31: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.

The reversal of a path expressions by Rev is recursively defined as:

An example of such a reversal is:

3.2 Molecules in the Index View

Addition slide

Page 32: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Hyperindex cont.3.2 Molecules in the Index

View

Page 33: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

4. The Hyperbase Layer

The translation of instantiations into a hyperbase is also carried out bottom-up

the fragment base is defined first Followed by the node base, the schema and

the views respectively

the hyperbase layer is quite similar to the hyperindex layer

Page 34: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

5 .Query By Construction

Page 35: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

QBC cont.

Page 36: Query Formulation as an Information Retrieval Problem Presented by: Waleed Albattah Supervised by: Dr. Mourad YKHLEF.

Thanks

Questions?


Recommended