by Semantic Web Technologies Social Media Monitoring … - Fabian Gasser... · Social Media...

Post on 13-Jun-2018

214 views 0 download

transcript

Social Media Monitoring Tools enhanced by Semantic Web Technologies

Presentation of the Master ThesisFabian Gasser

Contents

1. Main Concepts2. Challenges3. Research Question4. Social Media Monitoring Tool Application Landscape5. Comparison of the Semantic Approach6. Prototype - Semantic Social Media Monitoring Tool7. Use Cases8. Conclusion

Main Concepts:Social Media for Brands

Many new challenges through social media (SM) and web 2.0.Most important factors [1]:

○ Confidence○ Time○ Amount

Companies want to achieve benefits for their business:

Source: [2]

Main Concepts:Social Media Monitoring Tools… “enable companies to gather, categorize, analyze, monitor and possibly engage in online conversations about companies, brands, products, competitors, industry and other topics across different social media platforms.” [3, p. 15]

Main Concepts:Semantic Web Technologies● Semantic web (SW) technologies are

intended as “an extension of the World Wide Web that allows computers to intelligently search, combine and process Web content based on the meaning that this content has to humans”. [4, p. 11]

● Ontologies describe an area of concern.

● Resource description framework (RDF) for the description of resources and creation of ontologies via relations(triples):○ subject - predicate - objects (- context)

Challenges (1/2)

Many different Tools and Providers

Challenges (2/2)

Possible reasons:● Different needs from companies and their departments.● Different tools focus on different abilities.● Emerging market.

Challenges:● The tool should fit to many different applications.● The buyers should have high benefits.● Development and adaptation efforts should not be too extensive.

Research Question (1/2)Research Question:Can SW technologies help to overcome existing challenges and limitations of traditional SMM tools?

Proceeding:1. Elaboration of a SMM tool application landscape.

2. Comparison of the semantic approach with traditional approaches.

3. Development of a semantic SMM tool prototype.

4. Evaluation within two use cases.

Social Media Monitoring Tool Application Landscape

main impact of SW technologies

SMM Tool Application Landscape:Social Media Data GatheringTraditional SMM tools:● Own data sets ⇨ e.g. csv, relational database● Open source repositories ⇨ e.g. World Bank, Wikipedia● Data accessible via tools ⇨ free (Google Trends) or commercial (Gnip)● Data accessible via APIs ⇨ e.g. Social networks, Wikimedia, RSS feeds, blogs

Semantic SMM tools:● Own data sets can be represented in ontologies● Open source repositories ⇨ RDF dumps from e.g. Wikidata (dbpedia)● SPARQL-endpoints for querying ⇨ e.g. from participants of the LOD2 project

SMM Tool Application Landscape:Data Preprocessing (1/5)

1. Cleaning data (missing, incorrect, inconsistent data)

2. Structuring and tagging3. Storage

SMM Tool Application Landscape:Data Preprocessing (2/5)

Most important information requirements:● Categories and topics● Main phrases and keywords (and connections)● Sentiments

SMM Tool Application Landscape:Data Preprocessing (3/5)to 2. Structuring and tagging:➢ in traditional tools [5]:

○ symbolic/linguistic approaches → rule sets○ statistical/empirical approaches → training data○ combined approaches

SMM Tool Application Landscape:Data Preprocessing (4/5)

➢ in semantic tools

Source: [6]

SMM Tool Application Landscape:Data Preprocessing (5/5)1. Cleaning data2. Structuring and tagging3. Storage:

➢ in traditional tools: flat files, relational databases, NoSQL databases

➢ in semantic tools: focus on triple stores

Comparison of the Semantic SMM Tool Approach

Comparison focus on:

● additional abilities

● additional requirements

Prototype - Semantic Social Media Monitoring Tool

Use Cases

1. Tourism domain:➢ Identification of the needs against a SMM

tool.

2. Television producer:➢ Application of the prototype to one of their

social networking channels.

Use Cases:1. Tourism DomainMain goals on SM:1. Attract customers outside of the busy seasons.2. Increase reach and awareness of the hotel

(especially in foreign countries, e.g. Russia,France) [8].

3. Ensure that no claims are overseen.

Findings:1. Just specific topics are relevant (e.g. skiing or hiking news, claims about

rooms and facilities).2. Multiple languages should be supported.3. Bad sentiments/claims should be identified together with their topics (e.g.

to check their relevance).

Data source: [7]

Use Cases:2. Television ProducerData extracted from social networking channel:● over 500 posts and 1,800 comments

Ontologies used in the annotation pipeline:● DBpedia (remote via DBpedia Spotlight)● Consumer Electronics Ontology (CEO)

2,118 instances extracted:● CEO: 603 hits (belonging to 48 different concepts)● DBpedia: 1,515 hits (belonging to 621 different concepts)

Use Cases:2. Television Producer (1/2)

Use Cases:Application of a semantic SMM Tools - Evaluation

Source of the main application fields of SMM tools: [1]

Agenda

++ good results, important

+ good results, not so important

0 no impact/not relevant

- poor results, not so important

- - poor results, important

Conclusion● Companies want to monitor different information and

need different features.● Adaptations in traditional approaches require the

creation of comprehensive rule sets or training corpora.● Ontologies allow to flexibly adapt the tool and to localize

the most interesting topics in different languages.● More generic disambiguation techniques are needed (e.

g. through evaluating the annotations from related posts and comments).

Thank you!

Questions?

Sources:[1] Kasper, H., Stuttgart Fraunhofer-Institut für Arbeitswirtschaft und Organisation: Marktstudie Social Media Monitoring

Tools: IT-Lösungen zur Beobachtung und Analyse unternehmensstrategisch relevanter Informationen im Internet. Fraunhofer-Verlag (2010)

[2] Etlinger, S., Li, C.: A framework for social analytics. Altimeter Group. USA. Published on 10, 2011 (2011)

[3] Milic, L.: Social media monitoring tools and services report. Tech. rep., Ideya Business and Marketing Consultancy (2013)

[4] Hitzler, P., Krötzsch, M., Rudolph, S.: Foundations of Semantic Web Technologies. Chapman & Hall/CRC (2009)

[5] Jackson, P., Moulinier, I.: Natural language processing for online applications: Text retrieval, extraction and categorization, vol. 5. John Benjamins Publishing (2007)

[6] Wimalasuriya, D.C., Dou, D.: Ontology-based information extraction: An introduction and a survey of current approaches. Journal of Information Science (2010)

[7] Landesinstitut für Statistik ASTAT - Datenbanken - Self Service (2015), http://www.provinz.bz.it/astat/de/service/daten-online.asp

[8] Agentur Südtirol Marketing: Mehr internationale gäste und stimmige produktentwicklung für südtirol (2014), http://www.smg.bz.it/de/unternehmen/presse/mehr-internationale-gaeste-und-stimmige-produktentwicklung-fuer-suedtirol