Lecture Notes in Computer Science 7224Commenced Publication in 1973Founding and Former Series Editors:Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David HutchisonLancaster University, UK
Takeo KanadeCarnegie Mellon University, Pittsburgh, PA, USA
Josef KittlerUniversity of Surrey, Guildford, UK
Jon M. KleinbergCornell University, Ithaca, NY, USA
Alfred KobsaUniversity of California, Irvine, CA, USA
Friedemann MatternETH Zurich, Switzerland
John C. MitchellStanford University, CA, USA
Moni NaorWeizmann Institute of Science, Rehovot, Israel
Oscar NierstraszUniversity of Bern, Switzerland
C. Pandu RanganIndian Institute of Technology, Madras, India
Bernhard SteffenTU Dortmund University, Germany
Madhu SudanMicrosoft Research, Cambridge, MA, USA
Demetri TerzopoulosUniversity of California, Los Angeles, CA, USA
Doug TygarUniversity of California, Berkeley, CA, USA
Gerhard WeikumMax Planck Institute for Informatics, Saarbruecken, Germany
Ricardo Baeza-Yates Arjen P. de VriesHugo Zaragoza B. Barla CambazogluVanessa Murdock Ronny LempelFabrizio Silvestri (Eds.)
Advances inInformation Retrieval
34th European Conference on IR Research, ECIR 2012Barcelona, Spain, April 1-5, 2012Proceedings
13
Volume Editors
Ricardo Baeza-YatesB. Barla CambazogluVanessa MurdockYahoo! Research, Barcelona, SpainE-mail: [email protected]; {barla,vmurdock}@yahoo-inc.com
Arjen P. de VriesCWI, Amsterdam, The NetherlandsE-mail: [email protected]
Hugo ZaragozaWebsays, Barcelona, SpainE-mail: [email protected]
Ronny LempelYahoo! Labs, Haifa, IsraelE-mail: [email protected]
Fabrizio SilvestriISTI-CNR, Pisa, ItalyE-mail: [email protected]
ISSN 0302-9743 e-ISSN 1611-3349ISBN 978-3-642-28996-5 e-ISBN 978-3-642-28997-2DOI 10.1007/978-3-642-28997-2Springer Heidelberg Dordrecht London New York
Library of Congress Control Number: 2012933583
CR Subject Classification (1998): H.3, H.2, I.2, H.4, H.2.8, I.7, H.5
LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Weband HCI
© Springer-Verlag Berlin Heidelberg 2012This work is subject to copyright. All rights are reserved, whether the whole or part of the material isconcerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting,reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publicationor parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965,in its current version, and permission for use must always be obtained from Springer. Violations are liableto prosecution under the German Copyright Law.The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply,even in the absence of a specific statement, that such names are exempt from the relevant protective lawsand regulations and therefore free for general use.
Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India
Printed on acid-free paper
Springer is part of Springer Science+Business Media (www.springer.com)
Preface
These proceedings contain the high-quality papers, posters, and demonstrationspresented at the 34th European Conference on Information Retrieval (ECIR2012), held during April 1–5, 2012, in Barcelona, Spain. The conference wasjointly organized by Yahoo! Research Barcelona, Universitat Pompeu Fabra andthe Barcelona Media Foundation. It was supported by the Information RetrievalSpecialist Group at the British Computer Society (BCS-IRSG) and in coopera-tion with the Special Interest Group in Information Retrieval of the Associationfor Computing Machinery (ACM SIGIR).
ECIR 2012 received a total of 261 submissions across four categories: 163 full-paper submissions, 78 poster submissions, 11 demonstration submissions and9 industry track submissions. Of these submissions, 66% were from Europe,17% from Asia, 14% from America and 3% from the rest of the world. Allsubmissions were reviewed, across all categories, by at least three members of alarge international Program Committee to whom we express all our gratitude. Ofthe 167 full papers submitted to the main research and industry tracks, 37 wereselected for oral presentation (a 22% acceptance rate). In addition, 28 postersand 7 demonstrations were accepted. The accepted contributions represent thestate of the art in information retrieval and cover a diverse range of topics,including many novel applications that were of interest to all the audience.
ECIR 2012 also included two invited talks. The first talk was given by PaoloBoldi (University of Milan), who presented graph algorithms for Web searchand social networks. The second talk was on the impact of usage data on Websearch and was given by Yoelle Maarek (Yahoo! Labs Haifa). In addition, theconference included five tutorials (expert finding and entity search in the Web,question answering, quantum IR, music IR, and search experience), three work-shops (task-based and aggregated search, IR over query sessions and searchingfor fun) and a student-mentoring program. We wish to thank Alvaro Barreiro,David Losada and Mounia Lalmas, respectively, for these tasks. We also wantto thank the Organizing Committee, represented by Roi Blanco, Aristides Gio-nis and Mari-Carmen Marcos, especially the latter who was in charge of thelocal arrangements. Finally, special thanks to our sponsors, Google (platinum),Microsoft Research (gold) and Yahoo! Labs (gold).
January 2012 Ricardo Baeza-YatesArjen de VriesHugo Zaragoza
B. Barla CambazogluVanessa Murdock
Ronny LempelFabrizio Silvestri
Organization
The conference was jointly organized by Yahoo! Research Barcelona, Univer-sitat Pompeu Fabra and the Barcelona Media Foundation. It was supportedby the Information Retrieval Specialist Group at the British Computer Society(BCS-IRSG) and in cooperation with the Special Interest Group in InformationRetrieval of the Association for Computing Machinery (ACM SIGIR).
Conference Chair
Ricardo Baeza-Yates Yahoo! Research, Spain
Program Committee Co-chairs
Arjen de Vries Centrum Wiskunde & Informatica,The Netherlands
Hugo Zaragoza Websays, Spain
Poster Chair
B. Barla Cambazoglu Yahoo! Research, Spain
Demonstrations Chair
Vanessa Murdock Yahoo! Research, Spain
Industry Track Co-chairs
Ronny Lempel Yahoo! Labs, IsraelFabrizio Silvestri ISTI-CNR, Italy
Local Arrangements Chair
Mari-Carmen Marcos Universitat Pompeu Fabra, Spain
Workshop Chair
David Losada Universidade de Santiago de Compostela, Spain
VIII Organization
Tutorial Chair
Alvaro Barreiro Universidade da Coruna, Spain
Student Mentoring Chair
Mounia Lalmas Yahoo! Research, Spain
Proceedings Chair
B. Barla Cambazoglu Yahoo! Research, Spain
Registration Chair
Aristides Gionis Yahoo! Research, Spain
Advertising Chair
Roi Blanco Yahoo! Research, Spain
Accounting
Marıa Laura Llapur Barcelona Media Foundation, Spain
Website
Eduardo Graells Universitat Pompeu Fabra, Spain
Conference System
Michele Trevisiol Universitat Pompeu Fabra, Spain
Official Email
Luca Chariandini Universitat Pompeu Fabra, Spain
Social Media
Ruth Garcıa Universitat Pompeu Fabra, Spain
Organization IX
Social Events
David Nettleton Universitat Pompeu Fabra, SpainGeorgina Ramırez Universitat Pompeu Fabra, Spain
Program Committee
Giambattista Amati Fondazione Ugo Bordoni, ItalyMassih-Reza Amini LIP6, FranceLeif Azzopardi University of Glasgow, UKRicardo Baeza-Yates Yahoo! Research, SpainAlvaro Barreiro University of A Coruna, SpainNicholas Belkin Rutgers University, USABettina Berendt Katholieke Universiteit Leuven, BelgiumRoi Blanco Yahoo!, SpainWray Buntine NICTA, AustraliaClaudio Carpineto Fondazione Ugo Bordoni, ItalyCarlos Castillo Yahoo!, SpainPaul-Alexandru Chirita Adobe Systems Inc., RomaniaPaul Clough University of Sheffield, UKFabio Crestani University of Lugano, SwitzerlandW. Bruce Croft University of Massachussets, USAArjen P. de Vries CWI, The NetherlandsFernando Diaz Yahoo! Research, USAJuan M. Fernandez-Luna University of Granada, SpainNorbert Fuhr University of Duisburg-Essen, GermanyEric Gaussier University J. Fourier/Grenoble 1, FranceJulio Gonzalo UNED, SpainGregory Grefenstette Exalead, FranceCathal Gurrin Dublin City University, IrelandDonna Harman NIST, USADjoerd Hiemstra University of Twente, The NetherlandsDavid A. Hull Google, USAGabriella Kazai Microsoft Research, UKRonny Lempel Yahoo! Labs, IsraelDavid E. Losada University of Santiago de Compostela, SpainDunja Mladenic Jozef Stefan Institute, SloveniaJan O. Pedersen Microsoft, USAFabrizio Sebastiani Consiglio Nazionale delle Ricerche, ItalyFabrizio Silvestri ISTI-CNR, ItalyAlan Smeaton Dublin City University, IrelandEmine Yilmaz Koc University, TurkeyHugo Zaragoza Websays, Spain
X Organization
Reviewers
Ahmet Aker University of Sheffield, UKElif Aktolga University of Massachusetts Amherst, USAOmar Alonso Microsoft, USAIsmail Sengor Altingovde L3S, GermanyRobin Aly University of Twente, The NetherlandsAvi Arampatzis Democritus University of Thrace, GreeceJaime Arguello University of North Carolina at Chapel Hill,
USAJaved A. Aslam Northeastern University, USACevdet Aykanat Bilkent University, TurkeyKrisztian Balog Norwegian University of Science and
Technology, NorwayBarry Smyth University College Dublin, IrelandRoberto Basili University of Rome, Tor Vergata, ItalySrikanta Bedathur IIIT-Delhi, IndiaMichel Beigbeder Ecole des Mines de Saint-Etienne, FranceAlejandro Bellogın Universidad Autonoma de Madrid, SpainKlaus Berberich Max Planck Institute for Informatics, GermanyToine Bogers RSLIS, DenmarkGloria Bordogna CNR, ItalyMohand Boughanem IRIT - UMR 5505, FranceMarc Bron University of Amsterdam, The NetherlandsPeter Bruza Queensland University of Technology, AustraliaDaragh Byrne Arizona State University, USAFidel Cacheda University of A Coruna, SpainFazli Can Bilkent University, TurkeyMark Carman Monash University, AustraliaMarc-Allen Cartright University of Massachusetts Amherst, USAJames Caverlee Texas A&M University, USAMax Chevalier IRIT - UMR 5505, FranceKevyn Collins-Thompson Microsoft Research, USABin Cui Peking University, ChinaAlfredo Cuzzocrea ICAR-CNR and University of Calabria, ItalyNa Dai Lehigh University, USAPablo de la Fuente Universidad de Valladolid, SpainMaarten de Rijke University of Amsterdam, The NetherlandsGianluca Demartini University of Fribourg, SwitzerlandGiorgio Maria Di Nunzio University of Padua, ItalyShuai Ding Polytechnic Institute of New York University,
USAPavlos Efraimidis Democritus University of Thrace, Greece
Organization XI
David Craig Elsweiler University of Regensburg, GermanyYi Fang Purdue University, USAHenry Field University of Massachusetts Amherst, USAPaul Ferguson Dublin City University, IrelandNicola Ferro University of Padua, ItalyEdward Fox Virginia Tech, USAIngo Frommholz University of Bedfordshire, UKPatrick Gallinari University Pierre et Marie Curie, Paris 6,
FranceGiorgos Giannopoulos National Technical University of Athens,
GreeceAyse Goker City University London, UKDavid Adam Grossman IIT, USAAntonio Gulli Microsoft Bing, UKQi Guo Emory University, USAAllan Hanbury Vienna University of Technology, AustriaPreben Hansen SICS - Swedish Insitute of Computer Science,
SwedenMorgan Harvey University of Erlangen-Nuremberg, GermanyClaudia Hauff Delft University of Technology,
The NetherlandsJer Hayes IBM Dublin Research Lab, IrelandBen He Graduate University of Chinese Academy of
Sciences, ChinaJiyin He Centrum Wiskunde & Informatica,
The NetherlandsYulan He Open University, UKKatja Hofmann University of Amsterdam, The NetherlandsAndreas Hotho University of Wurzburg, GermanyYuexian Hou Tianjin University, ChinaYunhua Hu Microsoft Research Asia, ChinaGilles Hubert IRIT - University of Toulouse, FranceTheo Huibers University of Twente, The NetherlandsTereza Iofciu Xing AG, GermanyJagadeesh Jagarlamudi University of Maryland, USARichard Johansson University of Gothenburg, SwedenFrances Johnson Manchester Metropolitan University, UKHideo Joho University of Tsukuba, JapanKristiina Jokinen University of Helsinki, FinlandSimon Jonassen Norwegian University of Science and
Technology, NorwayGareth Jones Dublin City University, Ireland
XII Organization
Joemon M. Jose University of Glasgow, UKJaap Kamps University of Amsterdam, The NetherlandsEvangelos Kanoulas University of Sheffield, UKRianne Kaptein Oxyme, The NetherlandsMaryam Karimzadehgan University of Illinois at Urbana-Champaign,
USAJussi Karlgren SICS, SwedenMostafa Keikha Universita della Svizzera Italiana, SwitzerlandLiadh Kelly Dublin City University, IrelandMarijn Koolen University of Amsterdam, The NetherlandsManolis Koubarakis National and Kapodistrian University of
Athens, GreeceUdo Kruschwitz University of Essex, UKTayfun Kucukyilmaz Bilkent University, TurkeyJerome Kunegis University of Koblenz–Landau, GermanyOren Kurland Technion University, IsraelDmitry Lagun Emory University, USAJames Lanagan Technicolor, FranceMonica Angela Landoni USI University of Lugano, SwitzerlandBirger Larsen Royal School of Library and Information
Science, DenmarkFotis Lazarinis University of Western Greece, GreeceHyowon Lee Dublin City University, IrelandKyumin Lee Texas A&M University, USAWang-Chien Lee Pennsylvania State University, USAJohannes Leveling Dublin City University, IrelandXuelong Li Chinese Academy of Sciences, ChinaChao Liu Microsoft Research, USAElena Lloret University of Alicante, SpainYuanhua Lv University of Illinois - Urbana Champaign, USAAndrew MacFarlane City University London, UKMarco Maggini University of Siena, ItalyThomas Mandl University of Hildesheim, GermanyMauricio Marin Yahoo! Research Latin America, ChileYosi Mass IBM Research, IsraelEdgar Meij University of Amsterdam, The NetherlandsWagner Meira Jr. Universidade Federal de Minas Gerais, BrazilMassimo Melucci University of Padua, ItalyMarcelo Mendoza Yahoo! Research, ChileDonald Metzler University of Southern California, USAAlessandro Micarelli “Roma Tre” University, ItalyMarie-Francine Moens Katholieke Universiteit Leuven, Belgium
Organization XIII
Henning Muller University of Applied SciencesWestern Switzerland, Switzerland
Vanessa Murdock Yahoo! Research, SpainWolfgang Nejdl University of Hannover, GermanyJian-Yun Nie Universite de Montreal, CanadaNeil O’Hare Dublin City University, IrelandMichael O’Mahony University College Dublin, IrelandMichael Philip Oakes University of Sunderland, UKIadh Ounis University of Glasgow, UKMonica Paramita University of Sheffield, UKGabriella Pasi Universita degli Studi di Milano Bicocca, ItalyVirgil Pavlu Northeastern University, USAAri Pirkola University of Tampere, FinlandBenjamin Piwowarski CNRS, FranceVassilis Plachouras Presans, FranceBarbara Poblete University of Chile, ChileJohan Pouwelse Delft University of Technology,
The NetherlandsGeorgina Ramirez Universitat Pompeu Fabra, SpainAndreas Rauber Vienna University of Technology, AustriaThomas Roelleke Queen Mary, University of London, UKDmitri Roussinov University of Strathclyde, UKStefan Rueger The Open University, UKIan Ruthven University of Strathclyde, UKRodrygo Santos University of Glasgow, UKMarkus Schedl Johannes Kepler University, AustriaRalf Schenkel Saarland University and Max-Planck-Institut
fur Informatik, GermanyFalk Scholer RMIT University, AustraliaFlorence Sedes Universite Paul Sabatier, FranceGiovanni Semeraro University of Bari “Aldo Moro”, ItalyJangwon Seo Google Inc., USAPavel Serdyukov Yandex, RussiaJialie Shen Singapore Management University, SingaporeMilad Shokouhi Microsoft Research, UKMario J. Silva Instituto Superior Tecnico / INESC-ID,
PortugalMark D. Smucker University of Waterloo, CanadaVaclav Snasel VSB-Technical University of Ostrava,
Czech RepublicThomas Sodring HiOA, NorwayMin Song New Jersey Institute of Technology, USAYang Song Microsoft Research, USAAmanda Spink Loughborough University, UKL. Venkata Subramaniam IBM Research India, India
XIV Organization
Krysta Marie Svore Microsoft Research, USAIdan Szpektor Yahoo! Research, IsraelOscar Tackstrom Swedish Institute of Computer Science, SwedenLynda Tamine-Lechani IRIT, FranceMartin Theobald Max Planck Institute for Informatics, GermanyBart Thomee Yahoo! Research, SpainAnastasios Tombros Queen Mary University of London, UKThanh Tran Karlsruher Institut fur Technologie, GermanyDolf Trieschnigg University of Twente, The NetherlandsManos Tsagkias University of Amsterdam, The NetherlandsMing-Feng Tsai National Chengchi University, Taiwan, R.O.C.Theodora Tsikrika University of Applied Sciences
Western Switzerland, SwitzerlandAta Turk Bilkent University, TurkeyAndrew Turpin University of Melbourne, AustraliaPertti Vakkari University of Tampere, FinlandEmre Varol Bilkent University, TurkeySergei Vassilvitskii Yahoo! Research, USAOlga Vechtomova University of Waterloo, CanadaSumithra Velupillai Stockholm University, SwedenSuzan Verberne Radboud University Nijmegen,
The NetherlandsStefanos Vrochidis CERTH-ITI, GreeceXiaojun Wan Peking University, ChinaFei Wang IBM, USAKai Wang Institute for Infocomm Research, SingaporeLidan Wang University of Maryland, USAWouter Weerkamp University of Amsterdam, The NetherlandsRyen William White Microsoft Research, USAKam-Fai Wong The Chinese University of Hong Kong,
Hong KongJun Xu Microsoft Research, ChinaShuang-Hong Yang Georgia Institute of Technology, USATao Yang UCSB and Ask.com, USAXing Yi Yahoo! Labs, USAChengXiang Zhai University of Illinois at Urbana-Champaign,
USADan Zhang Purdue University, USADell Zhang Birkbeck, University of London, UKLanbo Zhang UC Santa Cruz, USAPeng Zhang Robert Gordon University, UKDong Zhou Trinity College Dublin, IrelandKe Zhou University of Glasgow, UKGuido Zuccon CSIRO, Australia
Organization XV
Industry Track Program Committee
Sihem Amer-Yahia Qatar Computing Research Institute, QatarRon Bekkerman LinkedIn, USADavid Carmel IBM Research, IsraelPaolo Ferragina University of Pisa, ItalyAntonio Gulli Microsoft, UKRosie Jones Akamai, USAIrwin King Chinese University of Hong Kong, Hong KongUdo Kruschwitz University of Essex, UKChristina Lioma University of Stuttgart, GermanyGilad Mishne Twitter, USARaffaele Perego ISTI-CNR, ItalyDiego Puppin Google, USAArjen de Vries Delft University of Technology,
The Netherlands
Workshops Program Committee
Enrique Alfonseca Google, SwitzerlandOmar Alonso Microsoft Bing, USAPablo Castells Universidad Autonoma de Madrid, SpainNorbert Fuhr University of Duisburg-Essen, GermanyCathal Gurrin Dublin City University, IrelandDjoerd Hiemstra University of Twente, The NetherlandsJian-Yun Nie University of Montreal, CanadaAndrew Trotman University of Otago, New ZealandJustin Zobel University of Melbourne, Australia
Tutorials Program Committee
Gianni Amati Fondazione Ugo Bordoni, ItalyMouhand Boughanem Paul Sabatier University, FranceFabio Crestani University of Lugano, SwitzerlandW. Bruce Croft University of Massachussets, USAEric Gaussier Joseph Fourier University, FranceEvangelos Kanoulas The University of Sheffield, UKRaffaele Perego ISTI-CNR, ItalyYi Zhang University of California, Santa Cruz, USA
Platinum Sponsor
Gold Sponsors
Organizers
In Cooperation With
Supporter
Table of Contents
Query Representation
Explaining Query Modifications: An Alternative Interpretation of TermAddition and Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Vera Hollink, Jiyin He, and Arjen de Vries
Modeling Transactional Queries via Templates . . . . . . . . . . . . . . . . . . . . . . . 13Edward Bortnikov, Pinar Donmez, Amit Kagian, and Ronny Lempel
Exploring Query Patterns in Email Search . . . . . . . . . . . . . . . . . . . . . . . . . . 25Morgan Harvey and David Elsweiler
Interactive Search Support for Difficult Web Queries . . . . . . . . . . . . . . . . . 37Abdigani Diriye, Giridhar Kumaran, and Jeff Huang
Blog and Online-Community Search
Predicting the Future Impact of News Events . . . . . . . . . . . . . . . . . . . . . . . . 50Julien Gaugaz, Patrick Siehndel, Gianluca Demartini, Tereza Iofciu,Mihai Georgescu, and Nicola Henze
Detection of News Feeds Items Appropriate for Children . . . . . . . . . . . . . . 63Tamara Polajnar, Richard Glassey, and Leif Azzopardi
Comparing Tweets and Tags for URLs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73Morgan Harvey, Mark Carman, and David Elsweiler
Geo-Location Estimation of Flickr Images: Social Web BasedEnrichment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Claudia Hauff and Geert-Jan Houben
Semi-structured Retrieval
A Field Relevance Model for Structured Document Retrieval . . . . . . . . . . 97Jin Young Kim and W. Bruce Croft
Relation Based Term Weighting Regularization . . . . . . . . . . . . . . . . . . . . . . 109Hao Wu and Hui Fang
A New Approach to Answerer Recommendation in CommunityQuestion Answering Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Zhenlei Yan and Jie Zhou
XVIII Table of Contents
On the Modeling of Entities for Ad-Hoc Entity Search in the Webof Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
Robert Neumayer, Krisztian Balog, and Kjetil Nørvag
Result Disambiguation in Web People Search . . . . . . . . . . . . . . . . . . . . . . . . 146Richard Berendsen, Bogomil Kovachev, Evangelia-Paraskevi Nastou,Maarten de Rijke, and Wouter Weerkamp
Evaluation
On Smoothing Average Precision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158Stephen Robertson
New Metrics for Meaningful Evaluation of Informally StructuredSpeech Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
Maria Eskevich, Walid Magdy, and Gareth J.F. Jones
On Aggregating Labels from Multiple Crowd Workers to InferRelevance of Documents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
Mehdi Hosseini, Ingemar J. Cox, Natasa Milic-Frayling,Gabriella Kazai, and Vishwa Vinay
Applications
How Random Walks Can Help Tourism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195Claudio Lucchese, Raffaele Perego, Fabrizio Silvestri,Hossein Vahabi, and Rossano Venturini
Retrieving Candidate Plagiarised Documents Using Query Expansion . . . 207Rao Muhammad Adeel Nawab, Mark Stevenson, and Paul Clough
Reliability Prediction of Webpages in the Medical Domain . . . . . . . . . . . . 219Parikshit Sondhi, V.G. Vinod Vydiswaran, and ChengXiang Zhai
Automatic Foldering of Email Messages: A Combination Approach . . . . . 232Tony Tam, Artur Ferreira, and Andre Lourenco
Retrieval Models
A Log-Logistic Model-Based Interpretation of TF Normalizationof BM25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 244
Yuanhua Lv and ChengXiang Zhai
Score Transformation in Linear Combination for Multi-criteriaRelevance Ranking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256
Shima Gerani, ChengXiang Zhai, and Fabio Crestani
Table of Contents XIX
Axiomatic Analysis of Translation Language Model for InformationRetrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268
Maryam Karimzadehgan and ChengXiang Zhai
An Information-Based Cross-Language Information Retrieval Model . . . . 281Bo Li and Eric Gaussier
Extended Expectation Maximization for Inferring Score Distributions . . . 293Keshi Dai, Virgil Pavlu, Evangelos Kanoulas, and Javed A. Aslam
Top-k Retrieval Using Facility Location Analysis . . . . . . . . . . . . . . . . . . . . . 305Guido Zuccon, Leif Azzopardi, Dell Zhang, and Jun Wang
Image and Video Retrieval
An Interactive Paper and Digital Pen Interface for Query-by-SketchImage Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
Roman Kreuzer, Michael Springmann, Ihab Al Kabary, andHeiko Schuldt
Image Abstraction in Crossmedia Retrieval for Text Illustration . . . . . . . . 329Filipe Coelho and Cristina Ribeiro
A Latent Variable Ranking Model for Content-Based Retrieval . . . . . . . . 340Ariadna Quattoni, Xavier Carreras, and Antonio Torralba
Text and Content Classification, Categorisation,Clustering
Language Modelling of Constraints for Text Clustering . . . . . . . . . . . . . . . 352Javier Parapar and Alvaro Barreiro
A Framework for Unsupervised Spam Detection in Social NetworkingSites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364
Maarten Bosma, Edgar Meij, and Wouter Weerkamp
Classification of Short Texts by Deploying Topical Annotations . . . . . . . . 376Daniele Vitale, Paolo Ferragina, and Ugo Scaiella
Cluster Labeling for Multilingual Scatter/Gather Using ComparableCorpora . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 388
Goutham Tholpadi, Mrinal Kanti Das,Chiranjib Bhattacharyya, and Shirish Shevade
XX Table of Contents
Systems Efficiency
Adaptive Time-to-Live Strategies for Query Result Caching in WebSearch Engines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401
Sadiye Alici, Ismail Sengor Altingovde, Rifat Ozcan,B. Barla Cambazoglu, and Ozgur Ulusoy
Intra-query Concurrent Pipelined Processing for Distributed Full-TextRetrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413
Simon Jonassen and Svein Erik Bratsberg
Industry Track
Usefulness of Sentiment Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 426Jussi Karlgren, Magnus Sahlgren, Fredrik Olsson,Fredrik Espinoza, and Ola Hamfors
Modeling Static Caching in Web Search Engines . . . . . . . . . . . . . . . . . . . . . 436Ricardo Baeza-Yates and Simon Jonassen
Posters
Integrating Interactive Visualizations in the Search Process of DigitalLibraries and IR Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 447
Daniel Hienert, Frank Sawitzki, Philipp Schaer, and Philipp Mayr
On Theoretically Valid Score Distributions in Information Retrieval . . . . 451Ronan Cummins and Colm O’Riordan
Adaptive Temporal Query Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 455Maria-Hendrike Peetz, Edgar Meij, Maarten de Rijke, andWouter Weerkamp
The Design of a Visual History Tool to Help Users Refind Informationwithin a Website . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459
Trien V. Do and Roy A. Ruddle
Analyzing the Polarity of Opinionated Queries . . . . . . . . . . . . . . . . . . . . . . . 463Sergiu Chelaru, Ismail Sengor Altingovde, and Stefan Siersdorfer
Semi-automatic Document Classification: Exploiting DocumentDifficulty . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 468
Miguel Martinez-Alvarez, Sirvan Yahyaei, and Thomas Roelleke
Investigating Summarization Techniques for Geo-Tagged ImageIndexing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 472
Ahmet Aker, Xin Fan, Mark Sanderson, and Robert Gaizauskas
Table of Contents XXI
Handling OOV Words in Indian-language – English CLIR . . . . . . . . . . . . . 476Parin Chheda, Manaal Faruqui, and Pabitra Mitra
Using a Medical Thesaurus to Predict Query Difficulty . . . . . . . . . . . . . . . 480Florian Boudin, Jian-Yun Nie, and Martin Dawes
Studying a Personality Coreference Network in a News Stories PhotoCollection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 485
Jose Devezas, Filipe Coelho, Sergio Nunes, and Cristina Ribeiro
Phrase Pair Classification for Identifying Subtopics . . . . . . . . . . . . . . . . . . . 489Sujatha Das, Prasenjit Mitra, and C. Lee Giles
Full and Mini-batch Clustering of News Articles with Star-EM . . . . . . . . . 494Matthias Galle and Jean-Michel Renders
Assessing and Predicting Vertical Intent for Web Queries . . . . . . . . . . . . . 499Ke Zhou, Ronan Cummins, Martin Halvey, Mounia Lalmas, andJoemon M. Jose
Predicting IMDB Movie Ratings Using Social Media . . . . . . . . . . . . . . . . . 503Andrei Oghina, Mathias Breuss, Manos Tsagkias, andMaarten de Rijke
Squeezing the Ensemble Pruning: Faster and More AccurateCategorization for News Portals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 508
Cagri Toraman and Fazli Can
A General Framework for People Retrieval in Social Media withMultiple Roles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 512
Amin Mantrach and Jean-Michel Renders
Analysis of Query Reformulations in a Search Engine of a LocalWeb Site . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517
M-Dyaa Albakour, Udo Kruschwitz, Nikolaos Nanas,Ibrahim Adeyanju, Dawei Song, Maria Fasli, andAnne De Roeck
Temporal Pseudo-relevance Feedback in Microblog Retrieval . . . . . . . . . . . 522Stewart Whiting, Iraklis A. Klampanos, and Joemon M. Jose
Learning Adaptive Domain Models from Click Data to BootstrapInteractive Web Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 527
Deirdre Lungley, Udo Kruschwitz, and Dawei Song
A Little Interaction Can Go a Long Way: Enriching the QueryFormulation Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 531
Abdigani Diriye, Anastasios Tombros, and Ann Blandford
XXII Table of Contents
Learning to Rank from Relevance Feedback for e-Discovery . . . . . . . . . . . . 535Peter Lubell-Doughtie and Katja Hofmann
When Simple is (more than) Good Enough: Effective Semantic Searchwith (almost) no Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 540
Robert Neumayer, Krisztian Balog, and Kjetil Nørvag
Evaluating Personal Information Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . 544Liadh Kelly, Paul Bunbury, and Gareth J.F. Jones
Applying Power Graph Analysis to Weighted Graphs . . . . . . . . . . . . . . . . . 548Niels Bloom
An Investigation of Term Weighting Approaches for MicroblogRetrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 552
Paul Ferguson, Neil O’Hare, James Lanagan, Owen Phelan, andKevin McCarthy
On the Size of Full Element-Indexes for XML Keyword Search . . . . . . . . . 556Duygu Atilgan, Ismail Sengor Altingovde, and Ozgur Ulusoy
Combining Probabilistic Language Models for Aspect-Based SentimentRetrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 561
Lisette Garcıa-Moya, Henry Anaya-Sanchez, andRafael Berlanga-Llavori
In Praise of Laziness: A Lazy Strategy for Web InformationExtraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 565
Rifat Ozcan, Ismail Sengor Altingovde, and Ozgur Ulusoy
Demos
LiveTweet: Monitoring and Predicting Interesting Microblog Posts . . . . . 569Arifah Che Alhadi, Thomas Gottron, Jerome Kunegis, andNasir Naveed
A User Interface for Query-by-Sketch Based Image Retrieval with ColorSketches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 571
Ivan Giangreco, Michael Springmann, Ihab Al Kabary, andHeiko Schuldt
Crisees : Real-Time Monitoring of Social Media Streams to SupportCrisis Management . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 573
David Maxwell, Stefan Raue, Leif Azzopardi, Chris Johnson, andSarah Oates
A Mailbox Search Engine Using Query Multi-modal Expansion andCommunity-Based Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576
Amin Mantrach and Jean-Michel Renders
Table of Contents XXIII
EmSe: Supporting Children’s Information Needs within a HospitalEnvironment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 578
Leif Azzopardi, Doug Dowie, Sergio Duarte, Carsten Eickhoff,Richard Glassey, Karl Gyllstrom, Djoerd Hiemstra,Franciska de Jong, Frea Kruisinga, Kelly Marshall, Sien Moens,Tamara Polajnar, Frans van der Sluis, and Arjen de Vries
Retro: Time-Based Exploration of Product Reviews . . . . . . . . . . . . . . . . . . 581Jannik Strotgen, Omar Alonso, and Michael Gertz
Querium: A Session-Based Collaborative Search System . . . . . . . . . . . . . . . 583Abdigani Diriye and Gene Golovchinsky
Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 585