Date post: | 28-Nov-2014 |
Category: |
Technology |
Upload: | rinky25 |
View: | 3,195 times |
Download: | 2 times |
Marketplace Overview: Text Analytics Vendor Options
Nick PatienceResearch Director, Information Management
The 451 Group
Choosing a vendor: things to consider
► YOUR REQUIREMENTS� Corpus size and growth
� Scalability
� On-site vs. SaaS
� Languages
� Interoperability with existing systems
� Compatibility with future tech (OWL, RDF, XML, etc)
Choosing a vendor: things to consider
► VENDOR ISSUES� Are there other
customers in your specialty?
� Viability of vendor
� Willingness for pilot or proof of concept
� Help available for configuration and installation
Issues affecting the text analytics market
1. Mergers and acquisitions
2. Regulatory mandates (FRCP, SarbOx)
3. On-premise licensing or SaaS
4. Economic uncertainty – discretionary or must-have?
5. Does a market even exist?
M&A to Date
Boost Sharepoint search1/08$1.24bMicrosoftFAST
Customer buying supplier4/07$25M*ReutersClearforest
Need to understand text5/07$76MBiz ObjectsInxight
E-discovery10/07$158MIron MountainStratify
Acquire own text analytics3/08$10-15m*SAS InstituteTeragram
Why?WhenDeal ValueAcquirerTarget
*451 estimate; Source: 451 M&A KnowledgeBase
►eDiscovery
Business Drivers
December 1, 2006:
Effective date of the Electronic Discovery Amendments to the Federal Rules of Civil Procedure
Business Drivers
►Electronic Publishing
“There will be no media consumption left in ten years that is not delivered over an IP network. There will be no newspapers, no magazines that are delivered in paper form. Everything gets delivered in an electronic form.”
Steve Ballmer, CEO Microsoft, June 6, 2008
Business Drivers
► Security / Fraud Detection / Risk Mgmt.
Market Map – End of 2008
Govt/Military Intelligence
Pharma & Life Sciences
Early Warning (Mfg)
Banking & Insurance
Media & Publishing
SAP [Inxight], IBM Cognos, SAS, SPSS, Attensity, Autonomy, Infonic
Temis, IBM, SPSS
SPSS, SAS, Attensity
IBM, SAS, SPSS, Autonomy, FAST, Megaputer
FAST, Temis, Nstein, Infonic,Autonomy, ClearForest, Lexalytics
Market Map – End of 2008, cont.
Market Research & Surveys
Customer Analytics
Business Intelligence
Security
General OEM
SPSS, SAS
SAS, SPSS, Autonomy, Attensity, IXReveal
SAP [Inxight], Cognos, Clarabridge
Autonomy, IBM, Cyveillance
Basis, Lexalytics, SAS [Teragram]
Records Management IBM
►Travelocity►Whirlpool►JetBlue
the Attensity Text Analytics suite
►Law enforcement►Travel and hospitality
►Intelligence►Customer analytics
Key Customers:
►Auto-categorization
►Anaphora resolution
►Output in OWL ontology language
►Statistical extraction
Base: Palo Alto, CA Funding: $28m venture capital
Key verticals:
�SaaS
►Avg deal size: $250,000 before services
►“Exhaustive extraction”
►Targeted extraction
�On premise: Windows, Linux
Founded: 2000
►Halliburton►Gillette
►Standard & Poors►Cisco►Sony
►Banking and insurance
►Media and publishing
►Law
►Government / military intelligence
►Security
►Customer analytics
Key Customers:
►“Conceptual retrieval”
►Automatic taxonomy generation
Base: Cambridge, UK Funding: Public
Key verticals:
►Avg deal size: not available
►Automatic categorization
�SaaS: XXXX�On Premise: Windows, Linux, Solaris, AIX
IDOL Server 7
Founded: 1996
►HP►Yahoo
►Siebel►FAST
►Name matching
►Name Translation
►Multilingual text analytics
►Language identification
►Entity extraction
►Google►Oracle
►General OEM
►Government / military intelligence
►Commercial Search Engines
Key Customers:
Base: Cambridge, MA Funding: < $10 million, In-Q-Tel
Key verticals:
►Avg deal size: $250-300,000
�SaaS: XXXXXXX�On Premise: Windows, Linux, Solaris,
Rosette Linguistics Platform
Founded: 1995
�SaaS: XXXXXX�On premise: Windows, Solaris, Linux, HPUX, UIX
►Entity extraction
►Document-level classification
►Document summarization
►32 languages
►Segmentation
►Stemming
►Part-of-speech tagging
►Federal Agencies (DOA, DAA, DHS)
►OEM: SAS, IBM, Oracle
►Business intelligence►Government & military intelligence
Key Customers:
Base: Walldorf, Germany Funding: Public
Key verticals:
►Avg deal size: $$$$
BusinessObjects Text Analysis
Founded: 1972
►BI-tool friendly►Categorization
►Gaylord Hotels►Intuit►H&R Block
►Business intelligence
Key Customers:
Base: Reston, VA Funding: $10.2m, venture capital
Key verticals:
�SaaS
►Avg deal size: $150-300,000, $10,000 / month SaaS
�On-premise: ???????
Content Mining Platform
Founded: 2005
Calais
►Air Force
►Entity, fact and event extraction
►Packaged extraction modules
►Statistical and semantic tagging
►Tagging concepts
►Categorization
►Semantic tagging
►Elsevier►Dow Jones
►Media and publishing
Key Customers:
Base: Waltham, MA Funding: Public
Key verticals:
�SaaS: available
►Avg deal size: Not available
�On Premise
Founded: 1998
�SaaS: XXXXXXX�On Premise: Windows, Linux, HP UX, Solaris, UIX
►Autotrader.com
►Thesaurus
►Phrase detection
►Spell-checking
►Anti-phrasing
►Language detection
►Lemmatization
►Synonyms
►WeightWatchers.com
►National Instruments
►Media & publishing►Banking & insurance
Key Customers:
Base: Needham, MA Funding: MSFT, public
Key verticals:
►Avg deal size: $$$
FAST ESP
Founded: 1997
►Large financial data provider
►Keyword search
►Semantic search
►Drill down search
►Trend analysis
►Delta analysis
►Automated alerting
►Large Japanese auto manufacturer
►Large Japanese telcoprovider
►Security
►Records management
►Banking & insurance
►Military / govt intelligence
►Pharma & life sciences
Key Customers:
Base: Armonk, NY Funding: Public
Key verticals:
►Avg deal size: $$$$
�SaaS: XXXXX�On Premise: Windows, AIX, Linux
Omnifind Analytics Edition
Founded: 1889
►Sentiment analysis of print media
►Dow Jones factiva►Thomson Reuters
►Media and publishing
Key Customers:
Base: London, UK Funding: Public
Key verticals:
►Avg deal size: Not available
�SaaS:XXXXXXXX�On premise: Windows
Sentiment
Founded: 2000
►Fireman’s fund
►Categorization – Bayesian, SVD, Keyword, concept search
►Clustering
►Classification
►Concept extraction
►Thesaurus
►Relationship discovery
►Jacksonville Sheriff’s office
►Security►Law enforcement
Key Customers:
Base: Jacksonville, FL
Funding: Private
Key verticals:
�SaaS: available
►Avg deal size: $
�On Premise: Windows
uReveal
Founded: 2000
�SaaS: XXXXX
►SmartBrief
►Cisco Systems
►Sentiment extraction
►Tailored sentiment toolkit
►Entity extraction
►Entity relationships
►Document summarization
►FT.com
►Cymfony
►Marketing & surveys►Media & publishing
Key Customers:
Base: Amherst, MA Funding: Private
Key verticals:
►Avg deal size: $125-150,000
�On premise: Windows
Salience Engine w/ Sentiment Toolkit
Founded: 2003
►DVA
►FAA
•Taxonomy-based categorization►Taxonomy creation
►Entity extraction
►Clustering
►Ernst & Young
►Pfizer
►Pharmaceuticals
►Insurance
►Defense
►Aviation
Key Customers:
Base: Bloomington, IN Funding: Private
Key verticals:
�SaaS: XXXXXX
►Avg deal size: $300,000
�On premise: Windows
Polyanalyst
Founded: 1997
Text Mining Engine (TME)
�SaaS: XXXXX�On premise: Windows, Linux
►Reader’s Digest
►Time, Inc.
►Optional summarizer
►Sentiment analysis engine
►Automated entity extraction
►Categorizer
►Concept extraction
►Taxonomy management
►Le Monde
►Conde Nast
►Reed Business
►Media & publishing
Key Customers:
Base: Montreal, Quebec Funding: Public
Key verticals:
►Avg deal size: $750,000
Founded: 2001
SAS Text Miner►Clustering►POS tagging
►Concept extraction
�SaaS: XXXXX�On premise: Windows, Solaris, AIX
►Eli Lilly
►Department of the Treasury
►Multiple languages
►Stemming
►Multi-lingual
►Entity extraction
►Ford
►Pitney Bowes
►Customer analytics
►General OEM
►Market research & surveys
►Government / military
►Early warning (mfg)
►Banking & insurance
Key Customers:
Base: Cary, NC Funding: Private
Key verticals:
►Avg deal size: $200-300,000 (Inxight 2005)
Founded: 1976
�SaaS: XXXXX
Clementine 12►recency, frequency and monetary
►survival analysis
�On Premise: Windows, Linux, Solaris, HP-UX, IBM AIX
►Support Vector Machines algorithms
►Bayesian Networks algorithms
►Multi-lingual sentiment analysis
Fortune 500
►Customer analytics
►Early warning (mfg)
►Customer analytics
►Banking & insurance
►Market research & surveys
►Govt / military intelligence
►Pharma & life sciences
Key Customers:
Base: Chicago, Illinois
Funding: Public
Key verticals:
►Avg deal size: $$$
Founded: 1968
�SaaS: Hosted version available
►Concept-based searching
►Keyword searching
�On premise: Windows, Linux
BASF
►Entity extraction
►Categorization
►Information clustering
Novartis
Pfizer
►Industrial►Govt / military intelligence
►Pharma & life sciences
Key Customers:
Base: Paris, FR Funding: €7m, private equity
Key verticals:
►Avg deal size: €3,000-10,000 per user per year. On-premise version is priced on a per CPU basis and typically costs €200,000-300,000
Luxid
Founded: 2000
�SaaS: XXXXX
►Entity extraction and stemming
►Classification
►Discovery
�On Premise: Linux
►Base set of 10 taxonomies
►Statistical and NLP techniques
►“Frame of Reference”
►Federal agencies
►Govt / military intelligence
Key Customers:
Base: Mclean, VA Funding: Private - undisclosed
Key verticals:
►Avg deal size: ????????
Viziant 1.0
Founded: 2003
Sentiment Analysis
► Andiamo Systems
► Biz360, a veteran of the space
► BrandIntel
► Buzzlogic, a recent startup
► Collective Intellect – about a year old
► Jodange – media-based opinion tracking for chosen topics or influencers
► Monitor110, aimed at institutional investors
► MotiveQuest, tweaks its linguistic model depending on the domain being analyzed
► Nielsen Media Research's BuzzMetrics – the 800-pound gorilla that rolled up some of the early players
► Northern Light - veteran search company, with its MI Analyst sentiment analysis product
► Perception Metrics, claims to be able to do phrase-level sentiment analysis, aimed at PR and marketing professional
► RavenPack International, counts Dow Jones & Company as a partner Sentiment Metrics, a British-based brand monitoring company
► SAS – offers the service
► SPSS – offers the service
► Sentiment Metrics
► SentiMetrix – still in stealth, apparently
► ScoutLabs, is in beta and uses Lexalytics technology
► SkyGrid, aggregates and analyzes financial news
► Summize, analyzes online product reviews for sentiment
► Umbria, focused on online sentiment analysis of social media, such as blogs
Nick Patience
Research Director, Information Management
http://blogs.the451group.com/information_management/