+ All Categories
Home > Documents > Nov 2002© Per Flensburg Ontology An introduction and overview.

Nov 2002© Per Flensburg Ontology An introduction and overview.

Date post: 21-Dec-2015
Category:
View: 214 times
Download: 1 times
Share this document with a friend
85
nov 2002 © Per Flensburg Ontology An introduction and overview
Transcript

nov 2002 © Per Flensburg

Ontology

An introduction and overview

nov 2002 © Per Flensburg

The concept of ontology

• Ontology have something to do with meaning• It is used within data bases and artificial

intelligence.• Often called “semantics” since it deals with

meaning of lingusitic expressions• These concepts are here used synonymously.

nov 2002 © Per Flensburg

Sources

• Main source for these slides: Dieter Fensel: Ontologies: Silver Bullet for Knowledge Management and Electronic Commerce

• lnteresting articles about semantics and ontology: http://w3.msi.vxu.se/~per/IVC743/Semantics.html

• There are also reports from some courses in Växjö dealing with various things about ontology

nov 2002 © Per Flensburg

Definition (från AI)

• In the simplest case, an ontology describes a hierarchy of concepts related by subsumption relationships; in more sophisticated cases, suitable axioms are added in order to express other relationships between concepts and to constrain their intended interpretation.

nov 2002 © Per Flensburg

More definitions

• Fensel: shared and common understanding of a domain that can be communicated between people and heterogeneous and widely spread application systems.

• Fensel again: ontologies describe the static domain knowledge of a knowledge-based system.

nov 2002 © Per Flensburg

More about ontologies

• A language for defining ontologies is syntactically and semantically richer than common approaches for databases.

• The information that is described by an ontology consists of semi-structured natural language texts and not tabular information.

• An ontology must be a shared and consensual terminology because it is used for information sharing and exchange.

• An ontology provides a domain theory and not the structure of a data container.

nov 2002 © Per Flensburg

Example -schedule

nov 2002 © Per Flensburg

Schedule - syntax

• Simple table structure• Data base schema

nov 2002 © Per Flensburg

Schedule - instance

• A row in the schedule• Is in fact a representation of a fact.

nov 2002 © Per Flensburg

Schedule - Data Description

The number of the week, according to standard ISO 321-543-432-645.a

The day expressed as weekday, number and month

• Is a meta-meta description in relation to the fact

nov 2002 © Per Flensburg

Ontology for IVC743

• Name: Schedule at VXU• Purpose: Temporal relation between the following entities:

Room, Person and Activity.• Temporal expression: Week, day and time in sep-oct• Room-domain: All lecture rooms at Växjö university with a

capacity of at least 35 persons• Person-domain: PF, RL, Inge Andersson, Olle Dahlborg,

Carina Hallqvist, Bertil Ekdahl, Anna Wingkvist, Gunnar Mosnik, Jonas Richardson

• Activity-domain: {a description of what is going to be dealt with in each lecture}

nov 2002 © Per Flensburg

Purpose

• The ontology can be used in many cases, not only for our schedule

• It says something about the content, not the form• If you see an instance with a value not belonging

to the ontology, you know something is incorrect• If you are familiar with the ontology, no further

explanations is needed in order to understand the meaning.

nov 2002 © Per Flensburg

Course ontology

• This generic ontology has the following syntax:• Name:<string>• Purpose: <Temporal/causal/result…> relation

between <list of entities>• Temporal expression: <ddd dd mmm hh - hh>• Room-domain: <list of lecture rooms>• Person-domain: <list of teachers>• Activity-domain: <list of activities>

nov 2002 © Per Flensburg

Initiatives

• Resource Description Framework (RDF)• Semantic web• XML Schemes, standard for describing the

structure and part of the semantics of data.• XSL, describing mappings between different

presentation sheets.

nov 2002 © Per Flensburg

Intranets and semantics

• In a fast changing world knowledge becomes increasingly important

• Maintaining and accessing knowledge (organisational memory) is thus important.

• The knowledge is often weakly structured, stored in intranets and in different formats (picture, sound etc.)

• Knowledge management, which turn information into useful knowledge is thus heavily needed.

Knowledge = Content

nov 2002 © Per Flensburg

Document management

• Key-word based retrieval provides lots of irrelevant information out of context.

• Extracting information requires human attention, both for extracting and integrating

• Maintaining weakly structured sources is time-consuming

nov 2002 © Per Flensburg

Semantic possibilities

• Search for content, not key-words• Query answering instead of information retrieval• Correct exchange of structured or semi-

structured information via for instance XSL.• Define view on documents or sets of

documents, information fusion

nov 2002 © Per Flensburg

• Agents that find the best shopping opportunity.

B2C-siteB2C-site

Example: Shop-bots

B2C-siteB2C-site

Shop-bot

Wrapper

nov 2002 © Per Flensburg

Problems with current shop-bots

• A wrapper is needed for each place and type of bot.

• No flexibility in retrieving the information• Information at the B2C-site must be provided in a

structured form.• Usually this information is provided in natural

language also which inevitably will cause inconsistency problems

nov 2002 © Per Flensburg

Solution

• Using various XML-techniques provides better possibilities for translation between the bot and the site.

• However, they must share the same ontology.• An ontology describes the various products and

can be used to navigate and search automatically for the required information.

nov 2002 © Per Flensburg

Electronic commerce B2B

• Standard techniques, such as EDIFACT cumbersome and error-prone and not integrated with other documents.

• The XML-family of techniques can be used for describing syntax and semantics of data, but not for the business processes and for the products.

• Standard ontologies in combination with XLS-based translation services is thus needed.

nov 2002 © Per Flensburg

About ontologies

A brief introduction

nov 2002 © Per Flensburg

Definition

• An ontology is a formal, explicit specification of a shared conceptualisation.

• A conceptualisation refers to an abstract model of some phenomenon in the world which identifies the relevant concepts of that phenomenon.

• Explicit means that the type of concepts used and the constraints on their use are explicitly defined.

• Formal refers to the fact that the ontology should be machine readable.

nov 2002 © Per Flensburg

Role of an ontology

• Facilitate the construction of a domain model by providing a vocabulary of terms and relations.

• Still the problem of translating between different ontologies persist.

• Also when you go outside the target for the ontology you are lost. The ontology might conserve a certain way of thinking.

nov 2002 © Per Flensburg

Types of ontologies

• Domain ontologies capture the knowledge valid for a particular type of domain

• Metadata ontologies like Dublin Core provide a vocabulary for describing the content of on-line information sources (Libraries).

• Generic or common sense ontologies aim at capturing general knowledge about the world, providing basic notions and concepts for things like time, space, state, event etc.

nov 2002 © Per Flensburg

More types

• Representational ontologies provide representational entities without stating what should be represented. A well-known representational ontology is the Frame Ontology which defines concepts such as frames, slots, and slot constraints allowing the expression of knowledge in an object-oriented or frame-based way.

• Method and task ontologies provide a reasoning point of view on domain knowledge such as hypothesis, cause-effect statements etc.

nov 2002 © Per Flensburg

Constructing ontologies

• Prerequisite: ontologies are small modules with a high internal coherence and a limited amount of interaction between the modules.

• Constructing a new ontology is a matter of assembling existing ones.• Inclusion • Restriction• Polymorphic refinement

nov 2002 © Per Flensburg

Formal languages

• Various kind of formal languages are used for representing ontologies, among others• Description logics• Frame Logic• First-order predicate logic extended with meta-

capabilities to reason about relations.

nov 2002 © Per Flensburg

The Sensus system

• The basic idea is to use so-called seed elements which represent the most important domain concepts for identifying the relevant parts of a toplevel ontology.

• The selected parts are then used as starting points for extending the ontology with further domain specific concepts.

nov 2002 © Per Flensburg

Word-Net

• An on-line lexical reference system. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different semantic relations link the synonym sets.

• WordNet contains around 100.000 word meanings organized in a taxonomy.

• http://www.cogsci.princeton.edu/~wn/

nov 2002 © Per Flensburg

Semantical relationships

• Synonymy: Similarity in meaning of words.• Antonymy: Dichotomy in meaning of words• Hyponymy: Is-a relationship between concepts.

This is-a hierarchy ensures the inheritance of properties from superconcepts to subconcepts.

• Meronymy: Part-of relationship between concepts.

• Morphological relations which are used to reduce word forms.

nov 2002 © Per Flensburg

Features of Word-Net

• Free of charge• Multilingual European version also exists (

http://www.let.uva.nl/~ewn )

• Its large size (i.e., number of concepts)

• Its domain-independence

• Its low level of formalization

• The definitions are vague and limits the possibility for automatic reasoning support

nov 2002 © Per Flensburg

Example

nov 2002 © Per Flensburg

Example (con’t)

nov 2002 © Per Flensburg

CYC http://www.cyc.com/

• Comes from AI• Humans decide based on their common sense

knowledge what to learn and what not to learn from their observations.

• CYC started as an approach to formalize this knowledge and provide it with a formal and executable semantics.

• Hundreds of thousands of concepts have been formalized with millions of logical axioms, rules, and other assertions.

nov 2002 © Per Flensburg

Features of CYC

• The upper-level ontology of CYC with 3000 concepts has been made publicly available.

• Most of the more specific concepts are kept secret• CYC groups concepts into microtheories to structure the

overall ontology. They are a means to express context dependency i.e., what is right in one context may be wrong in another

• CycL, a variant of predicate logic, is used as language for expressing these theories.

nov 2002 © Per Flensburg

nov 2002 © Per Flensburg

TOVE (TOronto Virtual Enterprise)

• Task and domain-specific ontology. • The ontology supports enterprise integration,

providing a shareable representation of knowledge in a generic, reusable data model

• TOVE provides a reusable representation (i.e., ontology) of industrial concepts.

• http://www.eil.utoronto.ca/tove/toveont.html

nov 2002 © Per Flensburg

From home-page

nov 2002 © Per Flensburg

Next picture

nov 2002 © Per Flensburg

Characteristics

• It provides a shared terminology for the enterprise that each agent can jointly understand and use

• It defines the meaning of each term in precise and unambiguous manner as possible

• It implements the semantics in a set of axioms that will enable TOVE to automatically deduce the answer to many “common sense” questions about the enterprise

• It defines a symbology for depicting a term or the concept constructed thereof in a graphical context

nov 2002 © Per Flensburg

(KA)2 – a case study on

• Knowledge Annotation Initiative of Knowledge Acquisition Community

• http://www.aifb.uni-karlsruhe.de/WBS/broker/KA2.html • The process of developing an ontology for a

heterogeneous and world-wide (research) community• The use of the ontology for providing semantic access to

on-line information sources of this community.

nov 2002 © Per Flensburg

Example in (KA)2

• Class: research-topic• Attributes:• Name: <string>• Description: <text>• Approaches: <set-of keyword>• Research-groups: <set-of research-

group>• Researchers: <set-of researcher>• Related-topics: <set-of research-

topic>• Subtopics: <set-of research-topic>• Events: <set-of events>• Journals: <set-of journal>

•Projects: <set-of project>•Application-areas: <text>•Products: <set-of product>•Bibliographies: <set-of HTML-link>•Mailing-lists: <set-of mailing-list>•Webpages: <set-of HTML-link>•International-funding-agencies: <funding-agency>•National-funding-agencies: <funding-agency>•Author-of-ontology: <set-of researcher>•Date-of-last-modification: <date>

nov 2002 © Per Flensburg

Procedure

• A lot of instances of the schema was developed and published on the home page

• Examples:• specification languages• knowledge acquisition methodologies• agent-oriented approaches• knowledge acquisition from natural language• knowledge management

nov 2002 © Per Flensburg

Knowledge management

An application

nov 2002 © Per Flensburg

Knowledge management deals with

• Acquiring • Maintaining• Accessing knowledge of an organization.• Here we will apply it to internet and concentrate

on the last issue: Search for knowledge.

nov 2002 © Per Flensburg

Search engines

• They have typically three parts: A webcrawler for downloading, an indexer for finding key terms and a query interface that retrieves answers to the proposed questions.

• They are all based on keywords. The indexing process of the web-pages is thus crucial for the retrieval.

nov 2002 © Per Flensburg

Search domain

• Consists of about 300 millions fix documents, but this is only about 20% of what is available in total. The rest (80%) is dynamically generated (example: Aftonbladet)

• Altavista provides it all, Google sort according to documents pointing at the actual document and Yahoo uses human invention.

nov 2002 © Per Flensburg

Dimensions in searching

• Precision: how many retrieved documents are really relevant?

• Recall: have I found all relevant information?• Time: for the humans to find the desired

information among the retrieved.• Scattering:The information might be scattered

over several pages with only implicit relations between them

nov 2002 © Per Flensburg

Ontobroker

• Define an ontology• Use it to annotate/structure/wrap your web

documents• Somebody else can make use of Ontobroker’s

advanced query and inference services to consult your knowledge.

• To achieve this goal, Ontobroker provides three interleaved languages and two tools.

nov 2002 © Per Flensburg

Languages

• It provides a representation language formulating ontologies.

• A subset of it is used to formulate queries, i.e. to define the query language.

• An annotation language is offered to enable knowledge providers to enrich web documents with ontological information.

nov 2002 © Per Flensburg

Representation languageR

epre

sent

atio

n la

ngua

ge

nov 2002 © Per Flensburg

Some definitions

• Class definition: c[]• defines a class with name c.

• Attribute definition: c[a=>> {c1,...,cn}]• implies that the attribute a can applied to the elements of c

and an attribute value must be member of all classes c1,...,cn.

• Is-a relationship: c1:: c2

• defines c1 as a subclass of c2 which implies that:• all elements of c1 are also elements of c2

• all attributes and their value restrictions defined for c2 are also defined for c1, and

• multiple attribute inheritance exists

nov 2002 © Per Flensburg

More definitions

• Is-element-of relationship: e : c• defines e as an element of the class c.

• Rules like• FORALL x,y x[a ->> y] <- y[a ->> x].

• If a is an attribute for x it is also an attribute for y

• FORALL x,y x:c1[a1 ->> y] <-> y:c2[a2 ->> x].• The common set of attributes for x and y

nov 2002 © Per Flensburg

Annotation language

• Ontobroker provides an annotation language called HTMLA

• The following HTML page states that the text string „Richard Benjamins“ is the name of a researcher where the URL of his homepage is used as his object id.

• <html><body><a onto="page:Researcher"> <h2>Welcome to my homepge</h2>

• My name is <a onto="[name=body]">Richard Benjamins</a>. </body></html>

• Cf XML!

nov 2002 © Per Flensburg

Query language

• The query language is defined as a subset of the representation language. The elementary expression is:

• X ∈ c Λ attribute(x) = v• written in Frame logic:• x[attribute -> v] : c

nov 2002 © Per Flensburg

Inference engine

nov 2002 © Per Flensburg

Summary

• Ontobroker only recognises pages that are annotated according to its rules.

• Principally you still have a database and a database schema in the bottom

• What can’t be expressed in first order predicat logic can’t be expressed.

nov 2002 © Per Flensburg

New idea (On2broker)

The web

Info agentData base

Fact retrieval

Ontologies

XML, RDF etc.Query engine

nov 2002 © Per Flensburg

Cf, Skogsresurs

Concept base Search agent URL

ClassificationURL-databaseProfile

Interesting linksUser

www.skogsresurs.com

nov 2002 © Per Flensburg

Comments

• Facts are defined by the logic of the ontologies • They are retrieved according to logical rules• They are stored into formal databases• Thus only facts possible to express in first order

predicat logic is possible to retrieve.• Also On2broker has severe efficiency problems

if the number of facts extend 100 000

nov 2002 © Per Flensburg

IBROW: Dynamic reasoning

• Brokering dynamic reasoning services in WWW• http://www.swi.psy.uva.nl/projects/IBROW3/home.html • It will access libraries in the Internet, search for

appropriate inference services, verify their requirements, request additional information from the customer if needed, adapt the inference services to the particular domain knowledge, plug them together, and execute them via CORBA.

• Therefore, the user no longer buys, downloads and installs software. Instead he uses it as a computational service provided via the network

nov 2002 © Per Flensburg

Use of IBROW

• In a business-to-business (B2B) context, IBROW technology can be used to construct half products, which then need further processing by industries before delivering end products to consumers. For example, a car manufacturer could be interested in a service that helps him to develop and/or adapt a new car design.

• In another scenario, the IBROW broker provides a service to configure the bare bones of a knowledge system, which then needs to be refined for end consumers based on their particular needs.

nov 2002 © Per Flensburg

Other use

• Yet another model would use IBROW technology to provide an underlying infrastructure to support knowledge engineers in selecting, testing, adapting, refining, and combining generic components into concrete systems.

nov 2002 © Per Flensburg

Electronic Commerce

An application are for ontologies

nov 2002 © Per Flensburg

Types of products

• Intelligent information search agents (i.e., shopping agents) that help customers to find products.

• Intelligent information providers (i.e., on-line stores) that help vendors to present their goods in appropriate manner.

• Intelligent information brokers (i.e., on-line market places) that mediate between buyers and vendors.

nov 2002 © Per Flensburg

Shopbots

Client (Browser)

Shopbot

Wrapper 1

On-line store 1

Wrapper 2

On-line store 2

Wrapper 3

On-line store 3

nov 2002 © Per Flensburg

Examples

• Bookblvd (http://www.bookblvd.com/ ), • Bottom Dollar (http://www.bottomdollar.com/ ),• Buyer’s Index (http://www.buyersindex.com/ ), • CompareNet (http://www.compare.net/ ),• Dealpilot (http://www.dealpilot.com/ ), • Jango (http://www.jango.com/ ), • Junglee (http://www.junglee.com/)• MyShop (http://www.myshop.de ), • Shopfind (http://www.shopfind.com/ )• Shopper (http://www.shopper.com ).

nov 2002 © Per Flensburg

Why shopbots fail

• Web users do not want to pay because they are used to free service.

• Product providers do not want to fund the agent because of its ability to always find the cheapest source.

• Product providers would fund the agent if it manipulated the search results. This would eliminate objectivity however which is a requirement for high acceptance.

• In the end, most of them were bought by Internet portals

I think Fensel principally has right, but in practice shop-bots have become a tremendous succes!

I think Fensel principally has right, but in practice shop-bots have become a tremendous succes!

nov 2002 © Per Flensburg

Other types of bots

• Lifestyle finder recommends documents matching your interests based on your answers to a set of questions.

• Alexa: watches all its users and anticipates their preferences. An “intelligent” proxy which caches the pages a user will visit.

• Firefly asks for ratings of specific musical artists, correlates each user with others who share their tastes, and recommends songs or albums which their cohorts have rated highly.

nov 2002 © Per Flensburg

B2B

• 1:1, negotiation, often EDIFACT-based

• 1:N, A large company dictates, often EDIFACT

• N:M, a fragmented marketplace

nov 2002 © Per Flensburg

The fragmented marketplace

• Replace a trade agent• Bring seller and buyer together on global base• Require means for translation and

representation• Require means for content description

nov 2002 © Per Flensburg

Translation

nov 2002 © Per Flensburg

Representation and translation

• RDF provides a standard for describing semantics

• XML provides a standard for describing the structure of a document

• XML schema provides a standard for describing the semantics

• XSL provides a standard for describing mappings between terminologies

• But none provides a standard vocabulary

nov 2002 © Per Flensburg

Ontologies in B2B

• Corresponds to standardised product catalogues. Attempts:• Common business library (http://www.commerce.net

) • Commerce XML (http://www.oasis-open.org )• Dublin Core (http://www.indecs.org )• Rosettanet (http://www.rosettanet.org/ )• Etc.

nov 2002 © Per Flensburg

Branch portals

• Chemdex (www.chemdex.com)• Life science products trading

• PaperExchange (www.paperexchange.com)• Paper industry spot market

• VerticalNet (www.verticalnet.com)• Generic portal

nov 2002 © Per Flensburg

VerticalNet

Seller ontology 1

Seller ontology 2

Seller ontology 3

Buyer ontology 1

Buyer ontology 2

Buyer ontology 3

Tran

sfor

mat

ion Visualisation

Standardontology

Ontologymapping

nov 2002 © Per Flensburg

To sum it all up

Putting it all together

nov 2002 © Per Flensburg

The techniques

• XML: Describing the structure of a document, cf. defining layout in a database

• DTD: • Element declaration that define composed tags and value

ranges for elementary tags• Attribute declaration that define attributes of tags• Entity declaration

• XSL: Describes how to render the document and the processing of the parts there, cf. calculating fields in a database. Can be used for translation

nov 2002 © Per Flensburg

The role of XSL

XML

DTD1

XML

DTD2

XSL

nov 2002 © Per Flensburg

A document

XML

DTD

XSL

Name: Value on name

Part: Value on part

Quantity: Value on quant

Price: Value on price

Part: Value on part

Quantity: Value on quant

Price: Value on price

Total price: Value on total

Adress: Value on adr

Ontololgy

nov 2002 © Per Flensburg

XML and semantics

• XML describe documents in tree format• Thus the overlaying context is transferred to the

underlying branches• This is the only semantics that exists.

nov 2002 © Per Flensburg

RDF

• A subject is an entity that can be referred to by a address in the WWW (i.e., by an URI).

• A predicate defines a binary relation between resources and/or atomic values provided by primitive data type definitions in XML.

• An object specifies for a subject a value for a predicate. That is, objects provide the actual characterizations of the Web documents.

nov 2002 © Per Flensburg

Example of RDF

• Author(http://www.msi.vxu.se/~per/) = X• Name(X) = Per Flensburg• Email(X) = [email protected]• Claim(Anna)=(affiliation(Author(

(http://www.msi.vxu.se/~per/))=Växjö university)

nov 2002 © Per Flensburg

Fine


Recommended