Date post: | 05-Aug-2015 |
Category: |
Data & Analytics |
Upload: | seth-grimes |
View: | 319 times |
Download: | 0 times |
Text Analytics 2015
Text Analytics Today
Seth GrimesAlta Plana Corporation
@sethgrimes
IIeX – AtlantaJune 16, 2015
Text Analytics 2015
Text Analytics 2015
“Who controls the past, controls the future. Who controls the present, controls the past.”
-- 1984, George Orwell
Let’s start with the past…
Document input and processing
Knowledge handling is key
Desk Set (1957): Computer engineer Richard Sumner (Spencer Tracy) and television network librarian Bunny Watson (Katherine Hepburn) and the "electronic brain" EMERAC.Hans Peter Luhn
“A Business Intelligence System”IBM Journal, October 1958
Text Analytics 2015
2005: “The bulk of information value is perceived as coming from data in relational tables. The reason is that data that is structured is easy to mine and analyze.”
-- Prabhakar Raghavan, now Google VP Engineering
Text Analytics 2015
2007: “Organizations embracing text analytics all report having an epiphany moment when they suddenly knew more than before.”
-- Philip Russom, the Data Warehousing Institute
Text Analytics 2015
2010: “The Web has dramatically changed the way that people express their views and opinions.”
-- Prof. Bing Liu, Univ. of Illinois, Chicago
“The future is clearly about analyzing feedback in any form that your customers give it. That’s a trend that won’t go away.”
-- Bruce Temkin
Text Analytics 2015
Text Analytics 2015
Text Analytics 2015
2011:2015:
When?
http://www.fastcompany.com/3028106/innovation-agents/watsons-next-challenge-smarter-cancer-treatments
HPPCsystems.com
Text Analytics 2015
Ava applies Affective, Cognitive & Psychomotor methods (per Bloom’s Taxonomy of educational objectives).
Text Analytics 2015
Drivers and Trends
For insights, technology drives method.• Data science, data monetization.• Big data: Social, online & enterprise.• Volume and velocity mean new
analytical approaches.• Variety: new types and a new fusion
imperative.• Algorithms… cognitive and affective.• Stats.• Language engineering.• Deep learning; Unsupervised, semi-,
supervised & active methods.• Via-API cloud services… the API economy.
Text Analytics 2015
Events
Semantic annotations
Other entities – phone numbers, part/product numbers, e-mail & street addresses, etc.
Metadata such as document author, publication date, title, headers, etc.
Concepts, that is, abstract groups of entities
Named entities – people, companies, geo-graphic locations, brands, ticker symbols, etc.
Relationships and/or facts
Sentiment, opinions, attitudes, emotions, perceptions, intent
Topics and themes
-10% 10% 30% 50% 70% 90%
Current; 33%
Current; 31%
Current; 34%
Current; 47%
Current; 51%
Current; 56%
Current; 47%
Current; 54%
Current; 66%
Expect; 21%
Expect; 24%
Expect; 23%
Expect; 23%
Expect; 28%
Expect; 25%
Expect; 33%
Expect; 28%
Expect; 22%
Do you currently need (or expect to need) to extract or analyze –
http://altaplana.com/TA2014
Text Analytics 2015
“The share rise in users who selected Arabic…coincided with much of the civil unrest… in Middle Eastern countries.”
http://bits.blogs.nytimes.com/2014/03/09/the-languages-of-twitter-users/
Text Analytics 2015
Arabic
Chinese
French
Greek
Italian
Korean
Portuguese
Scandinavian or Baltic
Turkish or Turkic
Other Arabic script (including Urdu, Pashto, Farsi, Dari)
Other European or Slavic/Cyrillic
-10% 0% 10% 20% 30% 40% 50% 60%
10%
1%
16%
9%
36%
34%
2%
2%
18%
7%
4%
3%
13%
8%
7%
38%
3%
2%
3%
2%
5%
9%
17%
3%
28%
7%
17%
24%
2%
10%
11%
15%
8%
4%
17%
21%
3%
20%
4%
0%
1%
1%
2%
0%
Current
Within 2 years
Non-English language support
Text Analytics 2015
Rules, Lexical & Semantic Nets, Learning, Stats
http://eecs-newsletter.mit.edu/articles/2009-fall/climbing-the-tower-of-babel-advances-in-unsupervised-multilingual-learning/
http://theanalyticsstore.ie/deep-learning/
http://courses.washington.edu/hypertxt/cgi-bin/book/maps/semantic.html
https://www.brandwatch.com/2013/08/the-importance-of-a-custom-search-functionality-in-a-monitoring-tool/
Text Analytics 2015
Word2Vec
https://code.google.com/p/word2vec/
Text Analytics 2015
Emoji
http://instagram-engineering.tumblr.com/post/117889701472/emojineering-part-1-machine-learning-for-emoji
Text Analytics 2015
Facebook Topic Data
http://datasift.com/products/pylon-for-facebook-topic-data/
Text Analytics 2015
sentimentsymposium.com
IIEX code = $300 off
Text Analytics 2015
Text Analytics Today
Seth GrimesAlta Plana Corporation
@sethgrimes
IIeX – AtlantaJune 16, 2015