Enterprise Search con Liferay 6.1

Post on 06-Dec-2014

4,622 views 3 download

Tags:

description

Presentazione di Marcello Torriani e Filippo Frignocca al Liferay Symposium italiano 2012

transcript

Advanced Search & IndexingIn Liferay 6.1

External sources integration

Filippo Frignocca | Marcello TorrianiAriadne

November 16th, 2012

Intro & Agenda

Search in intranet portals Business requirements: clustering,

advanced indexing configuration, external sources integration

Ariadne experience Liferay 6.1 indexing and searching

solutions Our solution: Liferay 6.1, Solr 4.0,

ManifoldCF

Search in intranet portal

Large amount of structered and unstructured content

Content can be anywhere: DB, filesystem, repository…

Users have to find information quickly and efficiently

Business requirements

Indexing & Searching in an enterprise environment means:

Seamless integration in SERP External sources indexing Clustered environment Scalability Advanced indexing features needs

Our past work in Ariadne

On Liferay 5.x Deep customization: a new Web CMS Integration of a Content Repository ElasticSearch Integration of external contents: Google Search Appliance

Liferay 6.1 and search

Opensearch Faceted search Document & Media search:

Liferay 6.1 supports searching both local and remote repositories.

The search capabilities, however, are limited to the facilities that is provided by the protocol

Solr 4

Schema aware Search functions Plug handlers and function Cache Replication Statistics

SolrCloud

High scale, fault tolerant, distributed indexing and search

Collections (indexes) may span through different SolrCores

Zookeeper manage cluster configuration and coordination

ManifoldCF

Crawler framework for connecting source content repositories (Sharepoint, Documentum, web sites, shares,...) to indexers (Solr, Elasticsearch, …)

Resilient, incremental Integrated with document security Connectors:

● Authority connectors: map user name to access to a repository

● Repository connectors: fetch documents from a repository

● Output connectors: push documents to (an indexer)

ManifoldCF input connectors

Portal Assets and external CMIS-compliant repositories

UNIFIED INDEX(Solr)

External sources crawling(ManifoldCF)

Liferay 6.1 & Solr 4

Liferay integrates Solr 1.4 Upgrade to 4.0:

● Index Schema Definition

● Solr plugin fix to handle void snippet results

● Solr plugin configuration to connect to external Index Engine/Cluster

Schema Definition for SolR 4.0 – Liferay 6.1 integration

primary key: uid

Fix to handle results with matches in all metadata, not only Content or Title

External Index Configuration

ManifoldCF output connector

ManifoldCF repository connector

ManifoldCF job scheduler

ManifoldCF job monitor

ManifoldCF output configuration

Custom plugin

To show external source search results in the Liferay Faceted Search Portlet we need to:

Define one (or more) Asset Model Class Configure a custom Indexer Class Implement proper Hook to override the result page jsps Properly configure the Faceted Search Portlet

Indexed and Model classes

Faceted Search jsp Hook

Faceted Search Configuration

Faceted Search Results

Work in progress

Indexing and searching protected resources Optimize External Asset Management Tags and Categories consistence check Optimize Index schema mapping Entry detail link management for external results

Thank you!

www.ariadne.it

info@ariadne.it

@ariadnesrl

MarcelloTorrianitorriani@ariadne.it

FilippoFrignoccafrignocca@ariadne.it