+ All Categories
Home > Documents > Sematic and Social Web

Sematic and Social Web

Date post: 10-Apr-2018
Category:
Upload: minh-pham
View: 218 times
Download: 0 times
Share this document with a friend

of 54

Transcript
  • 8/8/2019 Sematic and Social Web

    1/54

    User Model User-Adap Inter (2009) 19:433486

    DOI 10.1007/s11257-009-9067-3

    ORIGINAL PAPER

    Adaptive systems in the era of the semantic and social

    web, a survey

    Ilaria Torre

    Received: 26 May 2008 / Accepted in revised form: 6 May 2009 /

    Published online: 3 November 2009 Springer Science+Business Media B.V. 2009

    Abstract In this paper we provide a classification of adaptive systems with respect

    to the kind of semantic technology they exploit to accomplish or improve specific adap-

    tation and user modeling tasks. This classification is based on a distinction between

    strong semantic techniques and weak semantic techniques. The former are techniques

    based on the Semantic Web, while the latter regard technologies that, in different ways,

    annotate resources, enriching their meaning. This second category includes, in par-

    ticular, Web 2.0 social annotations and mixed approaches between social annotationsand Semantic Web techniques. While the impact of the Semantic Web on adaptive

    systems has been discussed in several survey papers, the potential of weak semantic

    technologies has, so far, received little attention. The aim of this analysis is to fill this

    gap. Therefore, we will discuss contributions and limits of both approaches, but we

    will focus special attention on weak semantic adaptive systems.

    Keywords Semantic web Web 2.0 Tag Annotation User model

    Adaptation Semantic adaptive systems Review

    1 Introduction

    Recently, several efforts have been made to enrich Web objects with annotations,

    with the aim of improving the retrieval and exchange of resources. Currently, the

    main approaches to add annotations are related to ontologies and so-called folksono-

    mies (see the definition in Appendix A, at the end of the paper), which have been

    defined effectively as two new players in indexing and knowledge representation

    (Weller 2007). The former approach is based on the principles of the Semantic Web,

    I. Torre (B)

    Dipartimento di Informatica, Universit di Torino, Corso Svizzera 185, 10149 Torino, Italy

    e-mail: [email protected]

    123

  • 8/8/2019 Sematic and Social Web

    2/54

    434 I. Torre

    which provides a common framework to allow data to be shared and reused across

    applications, using formal descriptions of concepts and of their relationships. Follow-

    ing this approach, resources are annotated with concepts from ontologies that describe

    a knowledge domain. Conversely, the other approach is related to Web 2.0, which con-

    ceives the Web as a social and collaborative platform: a Social Web, where users havean active role in generating contents and annotating resources, typically using tags

    that, altogether, form the folksonomy of a knowledge domain. In the middle, mixed

    approaches and new standards, such as microformats, are emerging to bridge the gap

    or to find methods to exploit both Semantic Web and Web 2.0 techniques.1

    In this decade, several researchers in the community of Adaptive Systems tried

    to combine semantic techniques with personalization and user modeling techniques.

    Semantic technologies can be useful for allowing personalization techniques to be

    applied to open corpora of resources, to enhance the possibility of providing explana-

    tions and of exchanging information between adaptive systems, such as informationabout user models.

    In this paper we provide an analysis of these lines of research, in order to investigate

    the value that semantic techniques can add to adaptive and user modeling systems. In

    particular, we investigate whether it is possible to identify tasks and areas of adaptive

    systems where these techniques proved to be particularly successful and if the integra-

    tion of these approaches can be a third way that gives the benefits of both of them. To

    date, there have been several reviews about the contribution of ontologies and Seman-

    tic Web to adaptive systems (see for example Dolog and Nejdl 2007; Antoniou et al.

    2004; Baldoni et al. 2005; Abel et al. 2007a; Kravcik and Gasevic 2007). However,to the best of our knowledge, there has been no survey that deals with the use of Web

    2.0 social annotations and mixed approaches in adaptive and user modeling systems.

    The main contribution of this paper is to fill this gap by providing an overview of how

    adaptive systems can benefit from both approaches.

    The methodology we adopted can be summarized as follows:

    (1) analyzing and classifying the main semantic techniques and approaches related

    to Web 2.0 social annotations and the Semantic Web,

    (2) decomposing the adaptive systems functionality into a set oftasks and sub-tasks,

    (3) selecting a set ofsources to discover significant semantic adaptive systems and,consequently, identifying some semantic adaptive systems useful for the analysis

    (in the following we will call them SASs),

    (4) analyzing and classifying each SAS from two points of view: the semantic tech-

    nique it exploits and the adaptation and user-modeling task for which the semantic

    technique has been used.

    The result has been the definition of a matrix which contains the SAS distribution

    with respect to these two parameters.

    The structure of the paper follows the methodological steps mentioned above. Sec-

    tion 2 describes the semantic technologies considered for the review and introduces the

    1 The new concept of Web 3.0 should go in this direction. Though different definitions of Web 3.0 exist,

    this term is being used frequently to mean the next generation of intelligent and social Web (see for example

    the workshop on Web 3.0 at the 2009 ACM Hypertext Conference).

    123

  • 8/8/2019 Sematic and Social Web

    3/54

    Adaptive systems in the era of the semantic and social web 435

    dichotomy weak/strong semantic web that will be the basis for the analysis. Section 3

    presents a functional model of tasks and sub-tasks for SASs and Sect. 4 presents the

    matrix which classifies the reviewed systems on the basis of the semantic technique

    that was used (x-axis) and the task it was used for (y-axis). The remaining sections

    focus on the analysis of the specific tasks and each section is divided in two subsec-tions which describe how weak and strong semantic techniques are used in a set of

    representative systems. In detail, Sect. 5 concerns domain modeling and management,

    Sect. 6 concerns user and context modeling and management, Sect. 7 concerns adap-

    tation and personalization and Sect. 8 concerns privacy and controllability. Finally,

    Sect. 9 discusses the general results of the analysis and Sect. 10 presents some con-

    cluding remarks. Two appendices complete the paper: Appendix A describes the

    main semantic technologies considered in the analysis and Appendix B provides a

    referenced list of the reviewed systems.

    2 Semantic technologies

    According to the first step of the methodology sketched above, in this section we

    introduce the main approaches and technologies that are relevant for this analysis. A

    detailed description of languages and tools is provided in Appendix A.

    2.1 Semantic web and web 2.0

    As known, the objective of the Semantic Web (Berners-Lee 1998; Berners-Lee et al.

    2006) is to describe Web resources in a way that allows machines to understand and

    process them. The main objectives of the Semantic Web can be summarized as follows:

    making information automatically available to software agents, integrating contents

    from different sources, optimizing search and enabling trust on the Web. It implies

    the adoption of a completely new paradigm of data description and retrieval. Each

    content in Web documents has to be formally described, that is annotated, making

    reference to concepts defined in public ontologies, using formal languages. Intelli-

    gent agents are necessary to reason over ontologies and automate the tasks above.Moreover, rules can be defined to complement and extend ontologies, so as to draw

    inferences, express constraints, specify policies, react to events/changes, discover new

    knowledge, transform data, etc.2

    In adaptive and user modeling systems, and also in other domains, the technologies

    most often used are languages and tools for building and querying ontologies. Rules

    and polices for trust have been used less and represent areas for future development

    of the Semantic Web.

    In Web 2.0, the other approach we take into account in this analysis, resources are

    annotated by users without using formal languages. Typically, they are added in theform of tags. Annotations can be added to Web objects such as bookmarks, photos,

    documents and blog posts, but also to Web users. For example, users can specify their

    2 http://2005.ruleml.org/.

    123

    http://2005.ruleml.org/http://2005.ruleml.org/http://2005.ruleml.org/
  • 8/8/2019 Sematic and Social Web

    4/54

    436 I. Torre

    level of trust in other users and the type of relationship with other users and they

    can also tag themselves and other users. Social software platforms at the heart of Web

    2.0 include Weblogs, Wiki tools and Collaborative Tagging Systems. In general terms,

    they are technologies to make information sharing and collaboration on the Web easier,

    but, in our analysis, we take them into account specifically for their ability to involvepeople in annotating Web resources according to their specific point of view. While

    Weblogs and Wiki tools allow users to enter unstructured or semi-structured contents

    (specific microformats have been defined to structure blog contents), Collaborative

    Tagging Systems (Marlow et al. 2006; Golder and Huberman 2006) require users to

    add annotations in the form of keywords. Folksonomies (Vander Wal 2004), which

    derive from this process, can be defined as distributed classification systems without

    a predefined structure, and therefore specifically suitable for unstable and evolution-

    ary information spaces. Moreover, by allowing an element to be placed in different

    categories, they produce overlapping clusters, in contrast to the hierarchical structureof taxonomies.

    If the Semantic Webs main objective is to make Web resources machine under-

    standable and processable, with the ultimate goal of providing users with improved

    services, the Web 2.0 approach is directly focused on users, without the intermedia-

    tion of intelligent agents, though software agents can then be exploited to provide or

    improve some specific services. The significant phenomenon, specifically relevant to

    our analysis, is that in tagging systems, the annotations the user posts progressively

    give rise to a description of the tagged resource that reflects its several facets.3 Ana-

    lyzing the folksonomy growth patterns of Delicious,4

    Golder and Huberman (2006)observe that, after a relatively small number of bookmarks for the same resource (about

    100), a nascent consensus seems to form in the tags entered by users to classify the

    bookmark. Although other studies provide other results and show high percentages of

    errors in tagging there is a consensus that tagging systems can classify resources in

    a way that allows to represent their multiple facets but also to express their prevalent

    meaning (though this meaning is not universal but very dependent on the community

    using the tagging system).5 Algorithms have also been developed to deal with prob-

    lems such as polysemy, synonymy, tags grammatical variations, etc. (e.g., Dix et al.

    2006).

    The peculiarity seen above makes Web 2.0 social annotations and Semantic Web-

    based annotations more similar in their ability to express the meaning of resources.

    However this similarity can be seen better as a complementarity: social annotations

    are mainly useful for information retrieval in changing and instable domains. Some

    authors specify they are suited to informal domains (e.g., MacGregor and McCulloch

    2006), for example to recreational research, personal information management and

    serendipity-based exploration prior to formal exploration. Semantic Web-based

    3 Shirky (2005) states that tagging systems allow knowledge to be fully and faithfully represented, sincethe categorization of terms is not conditioned by just one point of view.

    4 Delicious.com is a tagging system that lets users to tag bookmarks and savethemonline so that bookmarks

    can be classified under different categories/tags and retrieved everywhere.

    5 Golder and Huberman (2006), for example, explain that the achieved consensus on tags could be an effect

    of imitation and shared background of taggers.

    123

    http://delicious.com/http://delicious.com/
  • 8/8/2019 Sematic and Social Web

    5/54

    Adaptive systems in the era of the semantic and social web 437

    annotations, on the contrary, are useful for data exchange, integration and reuse, and

    are particularly powerful in specific and stable domains. Three main problems limit

    their power: the cost and difficulty of development (e.g., ontology definition, descrip-

    tion of contents as RDF statements, etc.), which is especially high for pre-existing

    systems, the difficulty of ontology alignment and matching, necessary to make anno-tations processable by systems that use different ontologies and the complexity to find

    a usabilityreusability trade-off in the definition of ontology concepts that will be used

    to annotate resources6.

    2.2 Middle-ground and mixed approaches

    Several authors have proposed and discussed intermediate solutions to the problems

    described above. Despite their differences, many of them are based on some formof integration between Semantic Web and Web 2.0 techniques. According to Gruber

    (2007), the dichotomy between Semantic Web and Social Web is false, and he states

    that there is increasing value in applying Semantic Web technologies to the data of the

    Social Web. Other mixed approaches propose the incorporation of the social context

    into the representation of ontologies (Mika 2007), the transformation of social anno-

    tations into RDF triples (e.g., Abel et al. 2007c), ontology learning from folksonomies

    (e.g., Specia and Motta 2007), etc.

    We conclude this section by mentioning two technologies that could be defined

    middle-ground standards: microformats and RDFa. Both of them are based on the

    idea of using XHTML attributes to add meaning to resources. RDFa is the W3C rec-

    ommendation for embedding RDF semantics into Web documents. Mircoformats are

    focused on re-using existing standards to represent data in different domains (e.g.,

    contact information, geographical coordinates, calendar events, etc.). Even though

    they are not quite as powerful, expressive and extensible as RDF or OWL, they are

    a way to enable data items to be indexed, searched, cross-referenced and automati-

    cally exported (Khare and elik 2006). This evolutionary or, better, non-revolutionary

    approach to semantization of the Web has been defined by elik and Marks (2004)

    as lowercase semantic web to distinguish it from the uppercase Semantic Web, which

    requires revolutionary changes of information representation and formalization.

    As we have seen, there are various ways to the semantization of the current Web.

    Following Searles definition of weak versus strong AI (Searle 1980), which distin-

    guishes systems that are intelligent only in their output (i.e., act intelligently) from

    those systems that are intelligent also in the modalities of reasoning (i.e., which have

    a mind), we could say that there is a weak semantic web and a strong semantic web.

    The former has the goal of adding meaning to resources to improve services to users,

    independently of the kind of semantic annotations (folksonomies, microformats and

    mixed approaches) used and of the kind of techniques used to obtain them. The latter

    aims at data understanding by machines to improve services to users. The former isless extensible and powerful but more effective and short term applicable. The latter, in

    6 The usabilityreusability problem states that if concepts are very abstract and general they will be hardly

    used in real contexts while if they are very specific, the reusability is limited ( Beys et al. 1996).

    123

  • 8/8/2019 Sematic and Social Web

    6/54

    438 I. Torre

    contrast, is more powerful and not limited in knowledge representation, but typically

    it is expensive and complex to develop, especially for the actual Web, where it risks

    to only be powerful in limited domains.

    If we place weak and strong semantic techniques on an axis, we can represent them

    as a discontinuous-continuum from weak to strong semantic technologies, as on thex -axis in Fig. 2. In the following, we will use this weak/strong dichotomy to ana-

    lyze the adaptive systems that use techniques based on the weak and strong semantic

    web. We will call these systems weak and strong semantic adaptive systems. As a

    typographical convention for this paper, with the expression semantic web, in the

    lowercase format and without adjectives, we refer to both weak and strong semantic

    web, while the uppercase format Semantic Web will be used with its traditional

    meaning.

    3 A functional model of tasks and sub-tasks for SASs

    The goal of this study is to analyze how semantic techniques are used, and can be

    used, to accomplish the different tasks and sub-tasks in semantic adaptive systems.

    Therefore, according to the methodology sketched in the introduction, the next step is

    to decompose the functionality of adaptive systems into a set of tasks and sub-tasks.

    To this aim, we started from the classic papers of Brusilovsky (1996) and Kobsa et al.

    (2001) that analyze tasks, methods and techniques of adaptive hypermedia systems.Both of them identify three tasks (that Brusilovsky calls stages) in the overall adap-

    tation process: the acquisition task (collecting data about the user, for Brusilovsky),

    the representation and secondary inference task (processing the data to build or update

    the user model, for Brusilovsky) and the production task (applying the user model

    to provide the adaptation, for Brusilovsky). Each task can be split into sub-tasks and

    there exist several methods and techniques to accomplish each of them.

    It is important to notice that our analysis focuses on tasks and sub-tasks that can be

    supported and improved by using semantic techniques, and not on the whole model

    of adaptation. Therefore, some sub-tasks have not been taken into account (as not

    relevant to our analysis), while other sub-tasks have been introduced into the model

    (such as Adaptation to support annotation, Trust modeling and social networks,

    etc.), as a consequence of the new issues addressed by the adaptive systems and by

    the SASs in particular. Moreover, it is worth noting that the model is the result of an

    iterative process that redefined it several times, according to the issues and approaches

    that emerged from the reviewed papers.

    We now explain the SAS functional model, displayed in Fig. 1, that will guide our

    analysis. The easiest way to describe the model is to start from the goal of the overall

    adaptation process, that is to produce an adaptation effect that takes into account the

    user and context features, and, in most of the current adaptive systems, respects pri-

    vacy criteria and, usually, takes care of the controllability of the adaptation process.

    Given this goal, we can follow a bottom-up description of the model, starting from the

    production task (at the bottom of the figure) and moving up to the tasks and sub-tasks

    that allow it to be accomplished.

    123

  • 8/8/2019 Sematic and Social Web

    7/54

    Adaptive systems in the era of the semantic and social web 439

    Fig. 1 Functional model of tasks and sub-tasks specifically suited for SASs

    (i) Adaptation and personalization is the most important sub-task of the Produc-

    tion task. It is responsible for producing the adaptation effect, which can be

    achieved in different ways. In our model we identify three not mutually exclu-

    sive ways:

    Adaptation of content and recommendation, by using adaptation techniques

    and methodologies based on different approaches, including content-based

    approaches, collaborative approaches and hybrid approaches (Burke 2002,

    Herlocker et al. 2004). We note that, when we will classify SASs with respect

    to this task, we will just take into account SASs which use semantic

    techniques to support these approaches. The same is true for all the other

    tasks.

    Adaptation of structure and presentation, regards the task in charge of adapting

    the link structure of hypermedia documents or its presentation to users (Kobsa

    et al. 2001). In our model, this task also includes what Kobsa calls Adaptation

    of the Presentation and Modality, which refers to the adaptation of the format,

    layout and modality with which objects are presented.

    Adaptation to support annotation is a new task, compared to Kobsas classifi-

    cation. It is more specific than the previous two tasks, but we introduced and

    placed it here for its relevance in many SASs. Most of the systems that address

    this task provide support in the form of tag recommendation. We could have

    included this under Adaptation of content and recommendation, since sug-

    gesting a tag is a form of content recommendation, instead we picked it off

    123

  • 8/8/2019 Sematic and Social Web

    8/54

    440 I. Torre

    because some SASs7 use a more complex mechanism of annotation support

    that is not just content adaptation.

    (ii) Privacy and controllability is defined as an auxiliary task since it supports other

    tasks to achieve the overall goal outlined above. Privacy and controllability are

    issues which are addressed by an increasing number of systems and are becom-ing particularly relevant in the context of interoperability, as discussed below.

    Controllability is intended to be the ability of the user to control her own data. It

    includes user awareness and explanation mechanisms for transparent and user-

    controllable personalization. This is an auxiliary task for Adaptation and per-

    sonalization, since it clearly impacts on the personalized output (what the user

    can see, can do, can be aware of, etc.), but it is auxiliary to User and context

    modeling and management as well, since constraints and conditions for data

    scrutability and exchange are connected with this task.

    (iii) Trust modeling and social networks is another auxiliary task which can sup-port the Adaptation and personalization and User and context modeling and

    management tasks. The phenomenon ofsocial networks, originally studied by

    sociologists in the real world, grew on the Web as well, pushed by the tools

    made available by social software discussed in Sect. 2. The consequence is a

    heavy need for associated trust- and reputation-management systems, advocated

    both in Web 2.0 and in the Semantic Web. Web 2.0 allows everybody to add

    content, typically without any control over what users enter. Building distrib-

    uted mechanisms to verify the reliability of users and their content has become a

    critical issue. In the so-called Web of Trust, strategies are under developmentto semantically annotate resources by using trust and reputation markers. The

    Semantic Web specifically introduced a proof layer that aims to provide guar-

    antees about distributed semantic resources. The focus, in this case, is on both

    the authors of content and the processes of exchange, inference and integration

    carried out by intelligent agents.

    The attention paid to trust and social networks is growing in adaptive hypermedia sys-

    tems, as well as on the Web, and in SAS it is particularly relevant. For this reason we

    introduced a specific task that addresses these issues. In current systems, trust values

    and relationships between users are typically used to define the users social context,which is modeled in terms of the users social networks. These relationships between

    users are then usually used to estimate the users trust in other users, and these trust

    values can be, and are often, used by recommendation algorithms.

    (iv) User and context modeling and management is a complex task that deals with

    representing and managing user and context models. It has two sub-tasks that

    correspond to two modalities to extend the knowledge about users and context.

    These sub-tasks are particularly relevant in our Functional Model since both

    of them can be greatly improved by the use of semantic techniques.

    User and context models extension by reasoning, is the sub-task that aug-ments user and context models by making inferences from data in such models.

    7 For instance, the systems described in (e.g., Bateman et al. 2006; van der Sluijs and Houben 2008), that

    will be analyzed in Sect. 7.3.2.

    123

  • 8/8/2019 Sematic and Social Web

    9/54

    Adaptive systems in the era of the semantic and social web 441

    Kobsa et al., in the work mentioned above, identify three main types of rea-

    soning (deductive, inductive and analogical8 reasoning), and all of them can

    employ weak or strong semantic techniques. A typical application is making

    class-membership inferences by reasoning over ontologies; other applications

    include inferring similarity between users by computing the semantic distancebetween their models or between their tag clouds.

    User and context models exchange and integration, is a sub-task that allows a

    SAS to extend the models of users and context, by acquiring these data from

    other applications. It may acquire unknown values of features or refine a value

    with fresher or more reliable data. But it can also update the user and context

    models by introducing new features and corresponding values.

    (v) Domain modeling and management is a task for the representation of the domain

    knowledge and its eventual update and extension.

    Domain knowledge and service exchange and integration is a sub-task that

    regards the extension of the content (and even services) that will be person-

    alized, by using external sources. The possibility of using semantic techniques

    and, in particular, shared ontologies described with standard markup languages,

    initially attracted several researchers in the adaptive hypermedia community.

    It gave the opportunity of extending the adaptive techniques outside of a local

    context, opening the way to the recommendation of open corpora of resources

    besides closed sets of already known resources. Brusilovsky (2001) and Henze

    and Nejdl (2002) provide a definition of Open Adaptive Hypermedia System(OAHS) as an adaptive hypermedia system that operates on an open corpus of

    documents.

    Finally, to complete the overview of the Functional Model of tasks and sub-tasks, we

    notice that each task makes use of some type of knowledge to accomplish its goals.

    Considering the systems we reviewed and the main reference models for Adaptive

    Hypermedia Systems, such as AHAM (Bra et al. 1999) and the Munich Reference

    Model (Koch and Wirsing 2002), we identified ten main types of knowledge, often

    called model in the literature. In Fig. 1 they are very roughly represented within

    rounded black boxes. Typical SASs use the semantic approaches seen in the previous

    section to represent these models or a set of them. Therefore, for example, we can

    have a strong SAS that represents the user model as a collection of RDF statements

    and a weak SAS that annotates the domain items by using social tags.

    In more detail, the types of knowledge (models) we have identified are about: the

    domain (including knowledge related to domain concepts, contents and also services),

    the user (e.g., user characteristics, user knowledge, usually an overlay of the domain

    model, interests, preferences, etc.), the context and device model (the definition of

    context is discussed in Sect. 6). Besides these, we have identified three other types

    of knowledge that could have been included in the definition of user knowledge, butwe decided not to include there, since several SASs represent them as independent

    8 The authors use the term analogical reasoning to mean all the approaches which are based on recog-

    nizing similarities between users (Kobsa et al. 2001).

    123

  • 8/8/2019 Sematic and Social Web

    10/54

    442 I. Torre

    models: knowledge about the users interaction with the system, her actions and behav-

    ior (similar to Kobsas usage data), knowledge about the users social network, and

    knowledge about users goals and plans. Other types of knowledge are specifically

    related to the adaptation process and regard the model of presentation and naviga-

    tion of the hypermedia, which can be static or dynamic, the model named activity,describing activities that the user has to perform (e.g., models to represent learning

    activities), and the model ofadaptation, which, according to the main reference mod-

    els for adaptive hypermedia systems, specifies the rules that describe the systems

    adaptive behavior. It is worth noting that rules could be used to implement the adap-

    tation behavior or just to describe the model of the adaptive behavior of the system,

    letting the real implementation use other approaches. For example, the representation

    by means of rules could be useful to the goal of trusting the exchanged resources, as we

    will see in Sect. 8. Finally, we identified a last type of knowledge named integration,

    which refers to the integration models that can be used to support the integrationof heterogeneous user models or domain content or services from different SASs. It

    can be likened to the component referred, in Berkovsky et al. (2008), as an auxiliary

    knowledge base used for the integration of knowledge (integration of User Models,

    in their approach), allowing, for example, reconciliation of the ontologies used by

    various recommender systems.

    4 Classification of strong and weak SASs

    In this section, we describe the matrix we have defined to analyze SASs, which is

    shown in Fig. 2. The x-axis (columns) denotes the semantic techniques used by the

    adaptive system (as defined in Sect. 2) and the y-axis (rows) gives the tasks and

    sub-tasks defined in the SAS Functional Model (Sect. 3). Therefore, the x-axis repre-

    sents a discontinuous-continuum from weak to strong semantic technologies.9 Recall

    that strong semantic techniques are those using languages and technologies for the

    Semantic Web. It includes ontologies, ontological languages, rule languages, query and

    reasoning engines. The specific technologies we considered are described in Appen-

    dix A. Weak semantic techniques are those techniques and approaches that somehow

    add meaning to web resources. They include Web 2.0 social annotations, with partic-

    ular attention to tags and folksonomies, and what we called middle-ground and mixed

    approaches, namely approaches that lie between the two extremes of the discontinu-

    ous-continuum and encompass approaches that integrate Semantic Web (SW, in the

    matrix) and Web 2.0 and emerging technologies such as microformats and RDFa. On

    the y-axis, we have placed the SAS Tasks and Sub-Tasks described in Sect. 3, with a

    slight difference regarding Trust and Social Networks. Given that trust and social

    networks are currently mainly used in SASs as an approach to model the user social

    context and to improve recommendation algorithms using this model, we decided

    to simplify the matrix by including this task just under Adaptation of content andrecommendation.

    9 It is a continuum since these technologies are on a line from low to high semantics but this line is

    discontinuous since the steps toward the higher level are not simple upgrades.

    123

  • 8/8/2019 Sematic and Social Web

    11/54

    Adaptive systems in the era of the semantic and social web 443

    Fig. 2 Matrix representing the distribution of SASs with respect to the tasks they perform and the semantic

    technique they use

    Each cell contains the SASs that exploit the technology specified on the x -axis to

    accomplish the task specified on the y-axis.10 Some remarks:

    (a) To be placed in a cell, a SAS must accomplish the specified task by using the

    specified semantic technology. Thus, for example, a system that provides user

    modeling functionality but mainly uses the semantic representation of domain

    knowledge for interoperability (not for user modeling) is placed under Domain

    10 SASs have been selected using, as sources, the main journal and conferences in the Adaptive Hypermedia

    and User Modeling community (see www.um.org) and subsequently workshops, conferences and journals

    regarding semantic technologies and the combination of semantic technologies and personalization. The

    list of reviewed systems is in Appendix B.

    123

    http://www.um.org/http://www.um.org/
  • 8/8/2019 Sematic and Social Web

    12/54

    444 I. Torre

    knowledge and service exchange and integration and not under User and con-

    text modeling and management.

    (b) Similarly, if a SAS uses semantic techniques to accomplish a task and this choice

    indirectly improves another task, the system is classified according to the task

    that, in our opinion, most benefits from the technique.(c) SASs that use mixed approaches, integrating Semantic Web techniques and Web

    2.0 social annotations, are placed in the column named middle-ground and mixed

    approaches and the specific semantic techniques they use are indicated in brack-

    ets. However, they can also be placed in other columns if, for a specific task, they

    use folksonomies or ontologies alone.

    Note that the goal was not to produce an exhaustive classification but, rather, to give

    an overview of the main trends in this field of research and an indication of promising

    approaches. In particular, it is important to observe that:

    (i) the non-homogeneous description of systems and, clearly, the subjectivity of the

    author of this review, influenced the classification,

    (ii) this review focuses special attention on weak semantic adaptive systems, since

    this topic has not, to date, received much attention.

    Figure 3 integrates the matrix, providing a snapshot of the distribution of SASs with

    respect to the type of knowledge they represent using the semantic techniques on the

    (x -axis). Therefore, on the y-axis we placed the main types of knowledge that canbe used by an adaptive system to accomplish its tasks and which a semantic adaptive

    system can represent by means of semantic techniques. In order to place these various

    types of knowledge on the y-axis of the map, we have approximately used the criterion

    of descriptive vs operative knowledge, making reference to the specific use of these

    types of knowledge in typical adaptive systems. Thus, very roughly, we have placed

    the domain knowledge at the upper edge of the y-axis (descriptive knowledge) and the

    knowledge regarding adaptation at the lower edge (operative knowledge). Between

    these extremes, we have all the other types of knowledge, and all of them can be

    represented by using weak or strong semantic techniques. For example, SASs thatannotate the domains items by means of semantic technologies (such as OntoAIMS,

    PRF, OntobUM), have been placed in the upper right part of the map, while SASs that

    annotate them by means of social annotations (such as AnnotatED, TBCF, etc.) have

    been placed on the upper left part of the map.11

    Notice that all the SASs in Fig. 3 are represented in this map too, placed in corre-

    spondence of at least one type of knowledge. The reason is that semantic techniques

    can only be used for a task if a semantic representation of the required knowledge is

    available .

    The following of the paper will analyze each task and sub-task with respect to bothstrong and weak semantic techniques.

    11 We intentionally use the term map, and not Cartesian plane, to emphasize that the position of the various

    types of knowledge is just very approximately based on the criterion above.

    123

  • 8/8/2019 Sematic and Social Web

    13/54

    Adaptive systems in the era of the semantic and social web 445

    Fig. 3 Distribution of SASs with respect to the semantic techniques they use to represent the different

    types of knowledge (domain model, user model, adaptation model, etc.) in order to accomplish the differenttasks

    5 Domain modeling and management

    As explained, this task concerns the representation and management of the domain

    knowledge, including domain concepts and specific content items. It represents the

    concepts that will be used to annotate and index resources. With respect to our focus of

    analysis, resources can be annotated by using concepts that refer to an ontology/taxon-

    omy or by annotations defined by end-users. The sub-task most relevant to this analysis

    regards the possibility to extending the content (and even services) that will be person-

    alized, by using external sources, building the so called Open Adaptive Hypermedia

    System (OAHS) introduced in Sect. 3.

    5.1 Domain knowledge and service exchange and integration

    Figure 4 provides an overview ofissues relevant to this sub-task in strong (upper part

    of the figure) and weak (lower part of the figure) SASs. It is important to notice that

    the table includes only issues to which strong or weak semantic techniques have been

    applied in the SASs we analyzed. Issues that can be relevant to the task itself but are

    not related to the use of semantic web techniques and issues not addressed by any

    SAS are not included in the figure. More in detail, the first column identifies relevant

    issues for the task, the second column identifies possible approaches to face them and

    the third column provides examples of SASs that follow such approaches.

    123

  • 8/8/2019 Sematic and Social Web

    14/54

    446 I. Torre

    Fig. 4 Domain knowledge and service exchange and integration sub-task

    5.1.1 Strong semantic techniques

    In the past, different XML-based interoperability standards have been developed to

    allow organizations to work together, in different domain fields. In the field of eLearn-

    ing, for example, several standards describe the learning objects (LO), such as IMS

    LIP, IEEE LOM and SCORM. Appendix A provides details of the most used stan-

    dards in adaptive systems and a discussion can be found in Dolog and Nejdl (2003),

    Aroyo et al. (2006), and Kravcik and Gasevic (2007). These standards, which specify

    import and export format, are very useful for enabling interoperability between stable

    and centrally managed systems, but they are not the solution in many other situations,

    since they do not admit flexibility in knowledge representation. The Semantic Web,

    on the other hand, allows this flexibility mainly because, while XML Schemas are

    used for modeling XML documents, RDF is used to model Knowledge: XML/XML

    Schema is a data modeling language, while RDF is a meta-data modeling language

    (Nilsson 2001).

    However, representing data with ontologies has several problems too, when the

    objective is to automatically and dynamically exchange and integrate data. In a very

    flexible and open scenario, each application can define its own ontology. As a conse-

    quence, to exchange knowledge, these applications have to manage the semantic heter-

    ogeneity of the annotations that refer to the concepts of their different ontologies. This

    means facing the problem of ontology alignment, or matching, and, when necessary,

    of ontology merging. Several techniques have been proposed to face these problems

    in the Semantic Web community. Shvaiko and Euzenat (2005) provide a survey of

    123

  • 8/8/2019 Sematic and Social Web

    15/54

    Adaptive systems in the era of the semantic and social web 447

    methods and algorithms for the former problem, others, such as Stumme (2005), deal

    with the latter. Up to the time of writing, we did not find significant examples of

    SASs that perform an automatic alignment of domain ontologies. Instead, there are

    SASs that perform manual or semi-automatic mappings. HERA (Vdovjak et al. 2003),

    for example, defines an integration model ontology which is used by the designer tospecify the links between ontologies. Other SASs use ad hoc transformations. iFanzy

    (Bellekens et al. 2007), for example, integrates several sources transforming them into

    OWL/RDF and using SKOS to describe relations between concepts.

    Other approaches propose to bypass the problem of ontology alignment by using

    reference ontologies, onto which all applications that wish to interoperate must map

    their own ontology (Bodoff et al. 2005). Some of these ontologies could also be

    based on the RDF bindings of pre-existing interoperability standards.12 Using refer-

    ence ontologies (the most typical approach in current SASs), could seem similar to

    using XML-based standards. However, even if the flexibility of defining its ontologyis reduced, each application can describe instances and their relationships (e.g., RDF

    statements) freely. Moreover, mixed approaches are also possible, for example by

    extending the reference ontology to model specific domains.

    One of the most representative systems with respect to this issue is the Personal

    Reader Framework, PRF (Dolog et al. 2004a; Henze 2005). The authors explicitly

    state the necessity of using the existing standards as much as possible, to favor inter-

    operability. PRF is conceived to provide an environment for designing and running

    Personalization Services which are able to understand RDF-annotated resources

    using standard ontologies for learning materials, such as LOM and IMS, and alsospecific domain ontologies which describe the particular subject being taught.

    Besides the issue of knowledge representation and alignment, another issue con-

    cerns the exchange management of annotated resources. Interoperability is usually

    provided using Web Service technology13 and there are several examples in the field

    of adaptive systems as well. The combination with semantic representation of services

    is at the basis of the Semantic Web Services that can be considered a landmark tech-

    nology for semantic interoperability (Martin et al. 2004). In PRF, the Personalization

    Services are defined as Semantic Web Services registered in a service registry using

    the Web Services Description Language (WSDL, see Appendix A). Each one pro-

    vides some adaptive functionality, such as, recommending learning objects, suggest-

    ing exercises, etc. At run-time, Personal Reader instances are generated by plugging

    together a number of these Personalization Services. PRF was developed as a frame-

    work for personalized eLearning but, over time, the concept of Web Services which

    offer personalization functionality has been generalized to other domains such as pub-

    lication viewing and music recommendation. MyEar (Henze and Krause 2006), for

    example, provides a Personalization Service which searches the Web for podcasting

    RSS feeds that fit the users taste in music and combines items of this feed to produce

    12 Notice that the definition and improvement of the RDF binding of most of these standards is an ongoing

    work.

    13 A Web Service is defined by the W3C as a software system designed to support interoperable machine

    to machine interaction over a network. Frequently, they are implemented simply as Web APIs hosted on

    servers accessible to clients.

    123

  • 8/8/2019 Sematic and Social Web

    16/54

    448 I. Torre

    a personalized podcasting feed. Notice that RSS feeds are becoming a common way

    to collect and integrate frequently updated information on the Web.

    Other examples of projects that use Web Services are COMPASS and SmartWeb.

    COMPASS (Van Setten et al. 2004) is a mobile tourist application. It is defined as a

    Web Service based on an open platform, which allows third parties to integrate newservices into the platform, to publish them in the public web service registry and to cor-

    relate them to the registry ontologies. The descriptions of the Web Services, expressed

    in WSDL, are extended with OWL. SmartWeb (Sonntag et al. 2007), a follow-up of

    SmartKom (Wahlster 2006), is a wide-range project that focuses on the development

    of multimodal user interfaces to distributed and composable Semantic Web Services

    on mobile devices. The ontological infrastructure is composed of a Top-Level Ontol-

    ogy (SUMO plus DOLCE), coupled to domain-specific ontologies. The Web Services

    are described by using OWL-S. Ontologies are used for the semantic representation of

    user queries (ontology-driven question analysis) and for the composition of the answerby querying different Web Services.

    5.1.2 Weak semantic techniques

    In Sect. 2 we defined the weak semantic web to be those approaches that semantically

    enrich Web resources but, for different reasons, do not use strong semantic techniques.

    A typical reason for not using ontologies is the need to annotate resources in insta-

    ble domains, subject to frequent changes. In this case, instances could require to be

    annotated with new and evolving metadata categories. Another reason to choose weakapproaches is that the definition of ontologies is directly influenced by the authors sub-

    jectivity, which can conceive the domain concepts and their relationship in a different

    way from the users of the annotated resources. This gap can be a problem for informa-

    tion retrieval. For example, non-specialists often find it hard to retrieve resources from

    art collections that have been classified by specialists, so several researchers, such as

    Trant (2006), have started to introduce social tagging as a complementary approach.

    However, considering the specific task of interoperability, using only Web 2.0 social

    annotations is surely less powerful than Semantic Web techniques. Much more oppor-

    tunities come from middle-ground and mixed approaches, but, up to now, very few

    adaptive systems use them. As Fig. 4 shows, the main issues for this task are similar

    to the issues in the previous section. A difference is that in weak SASs, heterogeneity

    managementis not necessarily a problem but it can be a goal, consisting of obtaining

    multifaceted annotations for a given resource.

    As in the previous section, we provide a brief description of some systems that

    face the issues in the figure using different approaches. However, note in advance that

    we did not find any very representative Semantic Adaptive Systems. Some are very

    slightly semantic, others very slightly adaptive and others slightly related to the task

    of exchange and integration. Moreover, we did not find examples of social annota-

    tions imported into the SAS and used to enrich resources (on the contrary, we will

    see examples of this approach in Sect. 6.1.2, concerning the interoperability of user

    models). We only found examples of systems where social annotations are used to

    index the imported resources (as in Knowledge Sea II) or are managed to be exported

    using strong semantic formalisms (as in the other two examples).

    123

  • 8/8/2019 Sematic and Social Web

    17/54

    Adaptive systems in the era of the semantic and social web 449

    In more detail, Knowledge Sea II (Ahn et al. 2006; Brusilovsky and Henze 2007)

    is a slightly semantic open adaptive system (a very representative OAHS), which auto-

    mates the hypertext construction and the indexing of learning resources on the Web.

    Social annotations, together with other statistical techniques, are used to achieve this

    integration of resources. The automatic indexing is accomplished by calculating andvisualizing, for each page, some indicators based on social annotations provided by

    users to pages. In this way, new documents can be added to the system as soon as a

    user encounters them during navigation, providing, at the same time, adaptive naviga-

    tion support. GroupMe! (Abel et al. 2007c) is an open slightly adaptive system which

    allows users to import multimedia Web resources, tag and group them. The peculiarity

    of the system is that resources, tags and groups are transformed and stored as RDF

    descriptions and made available for export. For example, a photo imported by Flickr

    is integrated into GroupMe!, transforming the Flickr image title into the Dublin Core

    element title and annotating the user tags using a public Tag ontology. Like GroupMe!,Revyu (Heath and Motta 2008) is an example of bridging between Web 2.0 and the

    Semantic Web. It is a recommender that is slightly related to the issue of exchange

    and integration. The main reason to mention it here is that it is an example of SAS that

    uses the microformats technology (the hReview microformat, in particular) to export

    data, while it uses an RDF binding of hReview to store data in its RDF repository. In

    this way, applications that use the hReview microformat can import such data and data

    in the Revyu system can be internally managed using strong semantic techniques.

    Concluding, we note that most of the social tagging systems have APIs for import-

    ing and exporting tagged resources. In this way, it could be possible to achieve severalforms of data integration and exchange between applications. Besides APIs, a simpler

    way to exchange data is using RSS feeds from tagging systems. These data can then

    be imported and integrated with other resources and tags, but currently in this task we

    have only examples of SASs that export RSS feeds.

    6 User and context modeling and management

    The distinction between User and Context models is widely debated. Some authors

    include some features regarding users, in particular short-term features, into the Con-

    text model, while others include all features regarding the user into the User Model.

    Still others model the user as a component of the Context. A recent discussion of this

    issue is in Brusilovsky and Milln (2007). The authors state that there is no definite

    agreement about what should be included into the area of context. It depends on the

    community to which the researcher belongs. The user modeling community has a

    user-centric perspective while the mobile and ubiquitous computing community has

    a context/device-centric perspective. According to the Functional Model in Sect. 3,

    the User and context modeling and management task is structured in two sub-tasks

    that specify the main ways of extending knowledge about users and context by using

    semantic techniques, namely: acquiring knowledge from external sources and appli-

    cations and reasoning on the currently available knowledge to infer new knowledge.

    In the Ubiquitous Computing community, the main approach is the former, while in

    the User Modeling community the foremost approach is the latter.

    123

  • 8/8/2019 Sematic and Social Web

    18/54

    450 I. Torre

    6.1 User and context models exchange and integration

    The term syndication, which generically is used in the sense of making available a

    feed for an information source, has been applied to the field of user modeling too, in

    an environment of cooperating and decentralized user and context models (Houbenet al. 2005). An overview of issues and approaches to face this task using strong and

    weak semantic techniques is provided in Fig. 5.

    Fig. 5 User and context models exchange and integration sub-task

    123

  • 8/8/2019 Sematic and Social Web

    19/54

    Adaptive systems in the era of the semantic and social web 451

    6.1.1 Strong semantic techniques

    In addition to the issues related to the exchange of Domain Knowledge, seen in the

    previous section, we find two new problems in this task. The first is resolving possible

    conflicts and contradictions between values of the user models of different systemsfor the same user, due, for example, to the different update time, different inference

    method, imprecise manipulation, etc. (Aroyo et al. 2006). The second is managing

    privacy related to the exchange and visibility of user data (Kay et al. 2002). Notice

    that, according to the specific application scenario, some issues could be more or

    less critical. For example, the issue related to user identification is less relevant if the

    applications have a common identification system, if the identification is carried out

    by a client-side agent or, finally, if a User Model Server is used to share user models

    (e.g.,Heckmann and Krger 2003; Mehta et al. 2005; Henze 2005).

    Considering the issue of representing users features, we can find both examplesof SASs that use reference ontologies to annotate the user features that have to be

    exchanged and examples that use their own ontologies. The proposal of a general

    reference ontology for modeling users comes from Heckmann and Krger (2003),

    who defined a General User Model Ontology, GUMO, to represent the user model

    in a ubiquitous environment in a standard way. To this aim they introduce the defini-

    tion ofsituational statements. A situational statement carries information about user

    model features, context data or low-level sensor data. A specific language, UserML,

    is defined to represent and exchange these statements. It extends the RDF triples with

    predefined semantic slots, so that the triple is in fact a 7-tuple which includes temporaland spatial restrictions, meta-data about ownership, privacy, evidence and confidence.

    The idea is to allow more powerful, but still structured, statements about situations.

    They also define the UbisWorld model which, besides the GUMO ontology, includes

    a set of other six specific ontologies that aim to model context. The PEACH museum

    guide, as described in Kruppa et al. (2005), implements the situational statements

    model and the U2M UserModelService of UbisWorld which is used to inspect the

    user model.

    Following the approach of Heckmann and colleagues, most problems that may

    happen when integrating partial user models concern conflicts14 about the exchanged

    values of the user models between the systems involved, while the problem of seman-

    tic heterogeneity management of annotations is not relevant.15 The main drawback of

    this approach, also acknowledged in Heckmann (2005), is the necessity of

    domain-specific features to properly model the user. Each domain may require specific

    features to model the user and, moreover, the same user feature can be perceived and

    classified in different ways in different domains.

    Compromise solutions have been investigated to use well-established, but non omni-

    comprehensive, ontologies using already existing standards, when available, in differ-

    ent domains. The Learner API in the ELENA project (Dolog and Schaefer 2005), for

    14 An accurate categorization of conflicts which may occur at different levels of the user models interop-

    erability process is provided in Heckmann (2005).

    15 Actually, the model includes the possibility of managing semantic conflicts due to annotations based on

    different ontologies, however this issue is not deeply analyzed in their approach.

    123

  • 8/8/2019 Sematic and Social Web

    20/54

    452 I. Torre

    example, uses a RDFS learner profile ontology, configured from fragments based on

    three specifications: the learning performance of a student is defined with the IEEE

    PAPI standard, the competencies that have to be achieved are represented with the

    IMS RDCEO standard and, finally, the learner assessment is defined with the IMS

    QTI standard. The problem the authors highlight is that none of the referenced stan-dards present their metadata in a way that makes it possible to use them in combination

    as depicted above. Therefore, an RDF translation of these standards had to be devel-

    oped, which made it possible to use them in combination. Moreover they add that this

    RDF translation is unofficial, and they therefore view it as an important direction

    for future standardization work that the standards use a common framework such as

    RDF and the Semantic Web, to enable the added value of using the standards together

    (Aroyo et al. 2006).

    A different approach to user model interoperability is allowing each user modeling

    system to define the user model ontology that better fits its needs. In this scenario,techniques are necessary to manage the semantic heterogeneity of the distributed user

    models. A general framework, non-specific for SASs, for the integration of heter-

    ogeneous User Models is provided in (Berkovsky et al. 2008). The authors define

    this approach mediation of user models and explain that it requires a UM mediator

    responsible for resolving conflicts and heterogeneities in the obtained user mod-

    eling data using various reasoning and inference mechanisms. GUC, the Generic

    User model Component of van der Sluijs and Houben (2006), is an example of this

    approach. It creates mappings between the user models schemas of the SASs involved

    in the exchange. These mappings are created manually by designers and expressedas SWRL rules that are executed every time user model data are exchanged between

    these SASs. SWRL rules are also used to specify how to manage conflicts between

    data imported from the user model of other SASs. In Carmagnola and Dimitrova

    (2008), the exchange of user data is automatically performed through an algorithm

    which computes the semantic similarities among classes of different users and domain

    ontologies, combining measures borrowed from Word Sense Disambiguation theory

    with other syntactic similarity measures.

    Considering now the user identification issue, we note that it is critical only for

    SASs that operate in open scenarios which require discovering the available adaptive

    systems the user interacted with, and in particular those which stored some informa-

    tion about that user. The framework ofCarmagnola and Dimitrova (2008), mentioned

    above, automatically identifies users over different systems by considering them to

    be a collection of properties and then comparing the values of common properties.

    Such a comparison is held by the identification algorithm described in Carmagnola

    and Cena (2009). A simpler approach to user identification uses reference ontologies

    to represent the user. It is adopted, for example, in ELENA and works by matching

    the annotations that identify a user on the basis of the ontology that describe learners

    in a standard way. FOAF (Appendix A), could be another reference ontology useful

    to this aim.

    Considering the exchange management issue, we found two main approaches that

    regard, respectively, the use of (Semantic) Web Services with public APIs that export

    user features/values and the use of a semantic infrastructure for querying user fea-

    tures/values, such as Edutella. These solutions are described in the work of Dolog

    123

  • 8/8/2019 Sematic and Social Web

    21/54

    Adaptive systems in the era of the semantic and social web 453

    and Schaefer (2005). The approach of using Web Services to support interoperability

    has been proposed also in CHIME (Chepegin et al. 2003), a framework for adaptive

    systems which extends AHA! Each component is seen as a service and each service

    is defined on the basis of a specific ontology. Of particular relevance to this task is the

    implementation of a Broker-based Discovery Service (Chepegin et al. 2005), whichis defined as an adaptive Web Service that allows AHA! applications to exchange

    user models, and that uses a specific markup language for user modeling (MLUM).

    Finally, another example of application that makes use of Web Services is the work of

    Cena and Furnari (2008). They propose an approach for achieving user model inter-

    operability by means of formally described semantic dialogues. In case of different

    ontologies, the exchange is performed by means of negotiation techniques based on

    dialogue models (Cena and Aroyo 2007).

    Not only user model data, but also context data can be exchanged. In this case,

    besides all the problems already seen for user models exchange, another critical prob-lem is the rapid update of data. These issues are faced, for example, in the MobiLife

    project (Kernchen et al. 2006), where different recommenders share and exchange

    information about the current context conditions of the user. The objective is to inte-

    grate these data, cover any lack of information and make this kind of knowledge more

    precise and complete. In Luther et al. (2005), the authors explain how OWL ontologies

    and OWL-DL reasoning methods are used to enrich the description of contextual situa-

    tions defined in terms of location, time, people and sensor information. Previously, also

    other researches proposed service architectures based on context ontologies (e.g.,Chen

    et al. 2003; Power et al. 2004).Related to context, another approach to managing interoperability and cross-

    systems personalization is the definition of a Context Passport (Mehta et al. 2005)

    that accompanies users on their travels through the information space. When interact-

    ing with a system, the relevant context of use is extracted from this context passport,

    related to a Unified User Context Model (UUCM), and is used for improving the

    support to the respective information-related activity. In a similar way to GUMO,

    the ontology at the basis of the UUCM tries to provide an integrated representation

    of the user and the context model. In the proposed scenario, all applications must use

    the same unified user and context model (UUCM).

    6.1.2 Weak semantic techniques

    Up to now, only a few weak semantic adaptive systems exchange and integrate user

    and context features and all use middle-ground and mixed approaches. In more detail,

    they exchange user-generated tags and use strong semantic techniques to map them

    with respect to the data in their applications. User generated-tags can concern the user

    herself or resources the user tagged. In the second case, tags describe the resource and

    not the user, but they will be used by the importer SAS as a some kind of evidence

    about the user, for example as a feedback about her knowledge or her interest in the

    resource.

    Considering the issue of automatic identification of the user on different SASs,

    we found just one SAS (Szomszor et al. 2008) that tries to compare features of the

    user on different systems, to uniquely identify it. In more detail, the authors use the

    123

  • 8/8/2019 Sematic and Social Web

    22/54

    454 I. Torre

    Google Social Graph API, which includes a matching technique for cross-profiling

    based on the user homepage, when available. Other SASs, such as CHIP and iCITY,

    use a practical, semi-automatic approach, that is asking users a subscription to the

    RSS feeds made available by the systems that interoperate.

    RSS feeds are also an economical approach for the exchange management ofresources and annotations. iCITY (Carmagnola et al. 2007b), for instance, is a social

    mobile guide with Importer and Exporter modules that allow the exchange of

    user tags in the form of RSS feeds. Notice, however, that the authors have recently

    developed some public APIs to increase the flexibility of the exchange. An exam-

    ple of interoperability between iCITY and another, completely independent, adaptive

    system, CHIP (Aroyo et al. 2007), is provided in (Wang et al. 2008). It presents an

    approach to using iCITY user tags about cultural events in order to integrate dis-

    tributed user profiles, with the ultimate goal of addressing the cold-start problem

    in adaptive systems. The basic approach involves the re-use of user interaction data(tags) generated by one application in another application in a similar domain, for

    providing cross-system recommendations. In this respect, it follows the previously

    mentioned approach defined as mediation of user models (Berkovsky et al. 2008).

    CHIP is not a tagging system. Artworks are annotated by using external vocabu-

    laries and thesauri and some its own concepts. Therefore, a mapping with iCITY

    tags is necessary. This heterogeneity of annotations is managed by the authors by

    using the Simple Knowledge Organization System (SKOS) Core Mapping Vocabu-

    lary Specification, created for linking thesauri to each other with relationships such

    as equivalent concept, broader, etc. The mapping between the iCITY user tags andthe CHIP concepts is realized in two steps: (i) identification of the type (e.g., creator,

    place, material, etc.) of tags; (ii) tag mapping by using SKOS Core mapping relations.

    Another approach to face this issue is to use a reference ontology to understand

    and use the exchanged resources. This approach is at the basis of the proposal of

    Heckmann et al. (2007) of extending their user and context ontology GUMO by tak-

    ing into account Web 2.0 user annotations, and in particular tags, adopting approaches

    of tag-space mining described in Sect. 6.2.2. Instead, Szomszor et al. (2008) col-

    lect the user tag cloud on different systems, unify them in a unique profile and use

    the Wikipedia oltology as the reference for the mapping process, as we will see in

    Sect. 6.2.2.

    Finally, regarding privacy management, to date, we have not found any SASs that

    implement privacy management. Some ideas come from Schwarzkopf et al. (2007),

    that will be presented in Sect. 8.2, while we did not find any SAS that describe

    approaches for conflict resolutions of user data (as on the contrary we saw in the

    previous section).

    6.2 User and context models extension by reasoning

    An overview of issues and approaches to face this task using strong and weak semantic

    techniques is provided in Fig. 6.

    123

  • 8/8/2019 Sematic and Social Web

    23/54

    Adaptive systems in the era of the semantic and social web 455

    Fig. 6 User and context models extension by reasoning sub-task

    6.2.1 Strong semantic techniques

    The issue of inferring user and context knowledge by reasoning over ontologies is

    tricky and contentious, even from a terminological point of view. The basic way of

    reasoning over an ontology is to perform basic deductions to make explicit some

    implicit ontological knowledge. However, this kind of reasoning is not always suf-

    ficient. For example, it can be necessary to define complex relationships among

    concepts, requiring composition operators for properties, or specify rules that infer

    non-implicit knowledge. In these cases a rule language is necessary. Semantic rule

    languageshave been defined; however, when the interoperability of rules is not required,

    another possibility is to use common non-semantic rule languages. Hatala and Wakkary

    (2005), for example, explain that, in ec(h)o, they load their DAML+OIL ontologies

    into Jess to infer all the missing relations in the RDF graph and note that semantic

    rule languages like RuleML were not considered since other requirements such as

    performance had a priority over the interoperability (Hatala et al. 2005).

    In the field of eLearning, ontological reasoning has been often used to infer the

    users level of knowledge of concepts. In these systems the user model is usually rep-

    resented as an overlay of the domain ontology. A discussion about this issue can be

    found in Brusilovsky and Milln (2007). The ontological representation is particularly

    suitable to model concepts and relationships between concepts and to reason about

    them to update the user model and allow the SAS to present the student suitable activ-

    ities. Kay and Lum (2004) provide examples and interesting experiments on inferring

    core terms from the finer grained terms in an ontology.

    123

  • 8/8/2019 Sematic and Social Web

    24/54

    456 I. Torre

    An example of a SAS that uses ontologies and ontological reasoning to build the

    user model is OntoAIMS (Denaux et al. 2005) and its interactive user modeling com-

    ponent OWL-OLM. OntoAIMS is an ontology-based version of a previous system,

    AIMS, and provides an information searching and browsing environment for learners

    that recommends the most appropriate task to work on and aids them to explore domainconcepts and read resources related to the task. The user model, the knowledge about

    concepts, resources and learning tasks are represented by means of ontologies. The

    user model is built through a graphical dialog with the user. Each utterance is analyzed

    to infer if concepts and relationships between concepts are used correctly or if there

    is a discrepancy with the domain ontology. Reasoning over the domain ontology and

    the conceptual model is also used to determine how to continue the dialog with the

    user.

    So far, we have dealt with the use of ontologies to model the users domain knowl-

    edge, but ontologies and ontological reasoning can be also used to model other features.For example in ec(h)o, an immersive audio museum guide, they are used to model

    interests, as a set of weighted concepts from the concept ontology. Ontologies are espe-

    cially useful when the concepts to be modelled are related to each other and knowledge

    of one concept can be used to make inferences about another. They can be also used to

    resolve conflicts between evidences. Niu and Kay (2008, 2009) provide an interesting

    example of this in the domain of context modeling. This work represents a significant

    shift in the community of mobile and ubiquitous computing, as it uses ontologies for

    reasoning, rather than for interoperability. PERSONAF is a framework for ontological

    reasoning about smart buildings and the people in them. Reasoning is beased on PECO,a three layered-ontology. The first two layers model the inside of a building, using

    mainly is-a and part-of relationships, while the third layer models the dynamics of

    people, devices, sensors and location inside such building and is automatically accreted

    using information from a range of sources, such as sensors. ONCOR is the reasoning

    module, built as a set of resolvers, for interpreting all this knowledge. In particular, it

    includes a set of resolvers that exploit subsumption relationships to tackle the problem

    of uncertainty and sensor evidence conflicts in location resolution.

    A final consideration concerns the modeling of social context, which has grown

    in importance in recent years. To this aim, some systems, such as CHIP use FOAF

    (Appendix A) to represent the relationships between users. However, in the reviewed

    systems, we did not find examples of reasoning over social networks using FOAF. In

    this respect, the recent initiative to enhance FOAF with a Web Rule Language such as

    RuleML is interesting.

    6.2.2 Weak semantic techniques

    Considering that social annotation-based systems store tags and annotations inserted

    by each user, we could observe that these systems are all intrinsically user modeling

    systems. However, more specifically, there are examples of systems that further pro-

    cess this knowledge to extend the user model and provide personalized services. In

    most of the SASs we analyzed, users tags are used to infer the user interests as a direct

    association with them. However, this inference can be compromised if: (i) users tag

    resources in which they are not especially interested or (ii) they tag resources by using

    123

  • 8/8/2019 Sematic and Social Web

    25/54

    Adaptive systems in the era of the semantic and social web 457

    personal tags, opinion tags and so on, rather than tags that describe the resource. The

    first problem is not very relevant if resources are bookmarks, since, typically, a user

    bookmarks resources she is interested in. The second problem is not very relevant if

    we consider the whole set of tags of a resources,16 but it can become a problem if

    we consider the specific users tags. In this case, it would be necessary to identify thecategory the tags belong to. If a song about violence is tagged with music, crime

    and sad, we cannot automatically assume that the user is interested in these things.

    This issue probably represents the main problem in this task, however, up to now, it

    has not been analyzed in depth with results that can be generalized. Sometimes it has

    been simply ignored, other times it has been partially bypassed by adopting collab-

    orative approaches. It has been recognized as a basic problem in only a few cases

    (e.g.,Szomszor et al. 2008) and studies have been done to validate the approach.

    The main approaches we found in current SASs to reason on tags and extend the

    user and context models are summarized in Fig. 6. They can be split in approachesthat use cognitive analysis of tags and approaches that use tag space mining tech-

    niques. Considering the first approach, Szomszor et al. (2008) semantically model

    user interests by mapping user tags on a multi-domain model. The basic assumption,

    which is supported by the authors studies, is that the tags used to describe resources

    mostly reflect their interests. Given that, firstly they clean the users tag cloud collected

    from different tagging systems (see Sect. 6.1.2 for details), then they attempt to match

    each tag to a Wikipedia category. Finally, they generate a FOAF file to express the

    users interests using references to Wikipedia category URIs. In our view, choosing

    the Wikipedia ontology as a multi-domain model is a pragmatic and effective solutionsince it is constantly updated by the community and includes acronyms and slang

    terms, typically used in tags. Moreover, it allows to exploit the relationships between

    categories and thus to make inferences that generalize or specialize the user interests.

    While the above work used tags to infer user interests, Carmagnola et al. (2007a)

    analyse how other user properties can be inferred from tagging. In particular, the

    authors consider tagging from two points of view. The act of tagging gives informa-

    tion about the users interest in the tagged content, her inclination to interact with the

    system, her tendency to organize contents, etc. Instead, the meaning of tags (analyzed

    by means of WordNet), is used to infer other user properties, such as her knowledge of

    concepts, her trust in the community and her creativity. The approach is under devel-

    opment in iCITY (Carmagnola et al. 2008), where these features have to be combined

    with a user model represented as an RDF ontology, where the user interests are an over-

    lay of the domain ontology and the user tags are an overlay of tagged instances of the

    domain. Nauerz et al. (2007) also sketch ways to infer users interests, preferences and

    knowledge from their annotations. They also introduce the idea of building or refining

    user models by using user-to-user annotations, which can be useful to model passive

    users. In contrast, Schwarzkopf et al. (2007) use mining algorithms to automatically

    learn a taxonomy of tags and then to determine how strongly the user is associated

    with a specific tag of the taxonomy. The taxonomy groups several tags around a set of

    16 As we observed in Sect. 2, after some time, a stable pattern of tags tend to emerge for each tagged

    resource (Golder and Huberman 2006) so that the entire set of tags provides a multifaceted description of

    the resource, where personal and non-descriptive tags tend to have lower occurrence.

    123

  • 8/8/2019 Sematic and Social Web

    26/54

    458 I. Torre

    concepts, therefore by mapping her tags to the taxonomy, they can infer the concepts

    the user is interested in, even if these concepts are not explicit in the users tags. Even

    though the authors admit that the approach simplifies the complexity of the linguistic

    variations, the results seem promising. Another approach to mining the tag space is

    to use collaborative techniques to find similarities between users, as sketched in vanSetten et al. (2006). The basic idea is using similarities between tags entered by differ-

    ent users to find similarities between these users. This idea is interesting, though there

    are risks of basing recommendations solely on these similarities: classical collabora-

    tive filtering infers sets of users with common interests, based on the ratings they give

    to express their interest in items; differently, this approach obtains sets of users who

    categorize items in the same way, showing a common view of items. Depending on

    the meaning and category of the inserted tags, this common mental view could also

    reveal common interests, but this could be not always the case.

    7 Adaptation and personalization

    The objective of this section is to provide some examples of systems that enhance

    the adaptation and personalization task by using semantic technologies. Therefore,

    it does not include references to SASs that mainly use semantic techniques for user

    modeling, even if this results in better personalization. Such systems were discussed in

    the previous section. Before analyzing this task, we refer the reader to other analyses

    of personalization in the Semantic Web such as Baldoni et al. (2005) and Dolog and

    Nejdl (2007).

    7.1 Adaptation of content and recommendation

    As Fig. 7 shows, we have identified two issues relevant with respect to the use of

    semantic techniques: resource retrieval and information filtering. Resource retrieval

    is not considered to be the same as general information retrieval, rather, it is a step in

    the process of content adaptation, which can be enhanced using specific techniques.

    Considering information filtering, the so-called content-based approach (see Sect. 3)

    is the natural form of content adaptation when domain items are represented semanti-

    cally. Despite that, semantic techniques have also been used to enhance collaborative

    algorithms. In addition, we describe a new approach, which we call the social network-

    based approach, that performs content personalization by exploiting relationships in

    social networks.

    7.1.1 Strong semantic techniques

    The first step of content adaptation is typically querying the domain ontology (Dolog

    and Nejdl 2007), often based on an explicit user query. The query can be expanded or

    restricted, as in the Personal Reader Framework PRF, introduced above, depending

    on the user and context profile and on the retrieval strategy (e.g., looking for super-

    concepts if the query does not return enough results; looking for instances with the

    lowest deviation from the user preferences, as in PPSS).

    123

  • 8/8/2019 Sematic and Social Web

    27/54

    Adaptive systems in the era of the semantic and social web 459

    Fig. 7 Adaptation of content and recommendation sub-task

    Before the content can be selected or specific items recommended, reasoning can be

    performed. This may be ontological reasoning (i.e., reasoning upon an ontology), or,

    more generally, ontology-based reasoning (i.e., reasoning on knowledge that refers to

    an ontology). CHIP uses ontology querying and ontological reasoning to exploit the

    ontological relationships between concepts. Users are asked to rate as many artworks

    as they wish on a five-point scale. Each artwork is described by several properties,

    such as creator and theme. When the user rates the item, the ratings are propagated

    to the properties. These ratings are used by the system to recommend other artworks

    of potential interest to the user and related topics in the semantic network. A similar

    approach is followed in PNS (Conlan et al. 2006).

    The Personal Reader Framework, PRF, offers several examples of content adapta-

    tion provided by its Personalization Services. For instance, the Personal Curriculum

    Planner (Baldoni et al. 2006) is a Personalization Service that uses ontology-based

    reasoning. Users knowledge, learning goals and courses are represented semantically.

    The curriculum is planned by a reasoning engine whose input are RDF statements about

    user knowledge and learning goals and which uses course prerequisites and effects

    as a set of rules whose terms refer to the domain knowledge ontology. The result is

    an RDF document that contains the sequences of courses that fit the users learning

    goals and profile. Further examples of SASs can be found in Abel et al. (2007a) which

    123

  • 8/8/2019 Sematic and Social Web

    28/54

    460 I. Torre

    describes several personalized applications developed within the REWERSE Network

    of Excellence.

    7.1.1.1 Content adaptation using trust modeling and social networks Following the

    Functional Model in Sect. 3, Trust modeling and social networks is an auxiliarytask which can support the Adaptation and personalization and User and context

    modeling and management tasks. In current SASs, this task is mostly used to support

    content adaptation. Typically, it is used to define the users trust in other users and these

    trust values are then used by recommendation algorithms. For this reason we decided

    to address it as an approach of the Adaptation of content and recommendation task.

    The principle ofpersonalized trustis that users of social software do not care much

    about whether the user community trusts a particular user. Rather, users care about

    whether people they trust (i.e., people in their own social networks) trust that user. By

    propagating trust values through social networks, the system can calculate how muchone user should trust another. Personalized trust has been used to define recommen-

    dation algorithms. The TrustProject (Golbeck 2005; Golbeck and Mannes 2006), for

    example, uses a TrustOntology based on the FOAF vocabulary and calculates person-

    alized trust for each user and specific topics. Effectiveness of its recomendations has

    been tested in FilmTrust.17 Similar works are Ziegler and Lausen (2004) and Massa

    and Avesani (2004)

    7.1.2 Weak semantic techniques

    Similarly to strong SASs, folksonomy querying is often the a preliminary step for con-

    tent adaptation in weak SASs. Tag-based queries can be processed by using, for exam-

    ple, meaning disambiguation techniques, tag clustering algorithms (as in Shepitsen

    et al. 2008) and query expansion (as in De Meo et al. 2009). As we noted in Sect. 7.1,

    content-based approaches are the most natural form of information filtering for sys-

    tems that use semantic techniques. Tags are used as keywords that, together, describe a

    resource item; user profiles are typically based on tags as well and are matched against

    the content profile with different approaches, as summarized in Fig. 7.

    The first approach, used for example in the Movie Recommender of Szomszoret al. (2007), calculates the users interest in instances using a sort of folksonomy

    reasoning. Users ratings for a movie are propagated to the tag-cloud describing that

    movie. In this way, the system can compute the personal tag-cloud of the user associ-

    ated to each rating.18 The other approach in the figure, applied for example in Shepitsen

    et al. (2008), defines the users interests on the basis of her tags (see the discussion in

    Sect. 6.2.2 about the use of such approaches). In this way, both resources and the user

    profiles can be represented as vectors of tags and classical techniques of information

    retrieval, such as vector space models can be adopted (with the advantage that they are

    applied only to the social annotations and not to the whole content of the resource).

    17 http://trust.mindswap.org/FilmTrust/.

    18 Notice the similarly of this technique to CHIPs ontological reasoning. Notice also that the Movie Rec-

    ommender defines an ontology to represent movies and tags associated to each movie, but the technique

    described above does not use it.

    123

    http://trust.mindswap.org/FilmTrust/http://trust.mindswap.org/FilmTrust/http://trust.mindswap.org/FilmTrust/
  • 8/8/2019 Sematic and Social Web

    29/54

    Adaptive systems in the era of the semantic and social web 461

    So far, we have only mentioned content-basedapproaches. However we also found

    an interesting case which uses social annotations and semantic analysis of such anno-

    tations to improve collaborative approaches. TBCF, ofZhao et al. (2008), uses social

    annotations to improve the effectiveness of neighbor searching in classical collabora-

    tive filtering. Therefore, they do not use users annotations to directly compare users(with the problems discussed in Sect. 6.2.2), but to refine the collaborative filtering

    algorithm. Two users are considered similar not only if they rated the items similarly,

    but also if they tagged the items similarly. The idea is that similar tags reveal that

    the users have similar cognitions over these items and similar


Recommended