Grouper: A Dynamic Clustering Interface to Web Search Results Fatih Çalı ş ır Tolga Çekiç Elif...

Post on 14-Dec-2015

227 views 2 download

Tags:

transcript

Grouper: A Dynamic Clustering Interface

to Web Search Results

Fatih ÇalışırTolga Çekiç

Elif DalAcar Erdinç

07.03.2013

1/9

Introduction

Ranked List vs. Clustering

Pre-retrieval Clustering vs. Post-retrieval Clustering

2/9

Methodology

STC (Suffix Tree Clustering)

Interface

Cluster Representation

3/9

STC

Three steps: Document cleaning Identifying base clusters using a suffix tree Merging base clusters

4/9

STC

Characteristics of STC: Incremental Overlapping Unspecified number of clusters Clustering based on phrases Robust to noise

5/9

Interface

6/9

Interface

7/9

Cluster Representation

Selecting the phrases for clustering representation: Word Overlapping Sub- and Super-Strings Most-general phrase with low coverage

8/9

Thanks for Listening

9/9