+ All Categories
Home > Data & Analytics > Text Analytics Today

Text Analytics Today

Date post: 05-Aug-2015
Category:
Upload: seth-grimes
View: 319 times
Download: 0 times
Share this document with a friend
Popular Tags:
21
Text Analytics 2015 Text Analytics Today Seth Grimes Alta Plana Corporation @sethgrimes IIeX – Atlanta June 16, 2015
Transcript
Page 1: Text Analytics Today

Text Analytics 2015

Text Analytics Today

Seth GrimesAlta Plana Corporation

@sethgrimes

IIeX – AtlantaJune 16, 2015

Page 2: Text Analytics Today

Text Analytics 2015

Page 3: Text Analytics Today

Text Analytics 2015

“Who controls the past, controls the future. Who controls the present, controls the past.”

-- 1984, George Orwell

Let’s start with the past…

Page 4: Text Analytics Today

Document input and processing

Knowledge handling is key

Desk Set (1957): Computer engineer Richard Sumner (Spencer Tracy) and television network librarian Bunny Watson (Katherine Hepburn) and the "electronic brain" EMERAC.Hans Peter Luhn

“A Business Intelligence System”IBM Journal, October 1958

Page 5: Text Analytics Today

Text Analytics 2015

2005: “The bulk of information value is perceived as coming from data in relational tables. The reason is that data that is structured is easy to mine and analyze.”

-- Prabhakar Raghavan, now Google VP Engineering

Page 6: Text Analytics Today

Text Analytics 2015

2007: “Organizations embracing text analytics all report having an epiphany moment when they suddenly knew more than before.”

-- Philip Russom, the Data Warehousing Institute

Page 7: Text Analytics Today

Text Analytics 2015

2010: “The Web has dramatically changed the way that people express their views and opinions.”

-- Prof. Bing Liu, Univ. of Illinois, Chicago

“The future is clearly about analyzing feedback in any form that your customers give it. That’s a trend that won’t go away.”

-- Bruce Temkin

Page 8: Text Analytics Today

Text Analytics 2015

Page 9: Text Analytics Today

Text Analytics 2015

Page 10: Text Analytics Today

Text Analytics 2015

2011:2015:

When?

http://www.fastcompany.com/3028106/innovation-agents/watsons-next-challenge-smarter-cancer-treatments

HPPCsystems.com

Page 11: Text Analytics Today

Text Analytics 2015

Ava applies Affective, Cognitive & Psychomotor methods (per Bloom’s Taxonomy of educational objectives).

Page 12: Text Analytics Today

Text Analytics 2015

Drivers and Trends

For insights, technology drives method.• Data science, data monetization.• Big data: Social, online & enterprise.• Volume and velocity mean new

analytical approaches.• Variety: new types and a new fusion

imperative.• Algorithms… cognitive and affective.• Stats.• Language engineering.• Deep learning; Unsupervised, semi-,

supervised & active methods.• Via-API cloud services… the API economy.

Page 13: Text Analytics Today

Text Analytics 2015

Events

Semantic annotations

Other entities – phone numbers, part/product numbers, e-mail & street addresses, etc.

Metadata such as document author, publication date, title, headers, etc.

Concepts, that is, abstract groups of entities

Named entities – people, companies, geo-graphic locations, brands, ticker symbols, etc.

Relationships and/or facts

Sentiment, opinions, attitudes, emotions, perceptions, intent

Topics and themes

-10% 10% 30% 50% 70% 90%

Current; 33%

Current; 31%

Current; 34%

Current; 47%

Current; 51%

Current; 56%

Current; 47%

Current; 54%

Current; 66%

Expect; 21%

Expect; 24%

Expect; 23%

Expect; 23%

Expect; 28%

Expect; 25%

Expect; 33%

Expect; 28%

Expect; 22%

Do you currently need (or expect to need) to extract or analyze –

http://altaplana.com/TA2014

Page 14: Text Analytics Today

Text Analytics 2015

“The share rise in users who selected Arabic…coincided with much of the civil unrest… in Middle Eastern countries.”

http://bits.blogs.nytimes.com/2014/03/09/the-languages-of-twitter-users/

Page 15: Text Analytics Today

Text Analytics 2015

Arabic

Chinese

French

Greek

Italian

Korean

Portuguese

Scandinavian or Baltic

Turkish or Turkic

Other Arabic script (including Urdu, Pashto, Farsi, Dari)

Other European or Slavic/Cyrillic

-10% 0% 10% 20% 30% 40% 50% 60%

10%

1%

16%

9%

36%

34%

2%

2%

18%

7%

4%

3%

13%

8%

7%

38%

3%

2%

3%

2%

5%

9%

17%

3%

28%

7%

17%

24%

2%

10%

11%

15%

8%

4%

17%

21%

3%

20%

4%

0%

1%

1%

2%

0%

Current

Within 2 years

Non-English language support

Page 16: Text Analytics Today

Text Analytics 2015

Rules, Lexical & Semantic Nets, Learning, Stats

http://eecs-newsletter.mit.edu/articles/2009-fall/climbing-the-tower-of-babel-advances-in-unsupervised-multilingual-learning/

http://theanalyticsstore.ie/deep-learning/

http://courses.washington.edu/hypertxt/cgi-bin/book/maps/semantic.html

https://www.brandwatch.com/2013/08/the-importance-of-a-custom-search-functionality-in-a-monitoring-tool/

Page 17: Text Analytics Today

Text Analytics 2015

Word2Vec

https://code.google.com/p/word2vec/

Page 18: Text Analytics Today

Text Analytics 2015

Emoji

http://instagram-engineering.tumblr.com/post/117889701472/emojineering-part-1-machine-learning-for-emoji

Page 19: Text Analytics Today

Text Analytics 2015

Facebook Topic Data

http://datasift.com/products/pylon-for-facebook-topic-data/

Page 20: Text Analytics Today

Text Analytics 2015

sentimentsymposium.com

IIEX code = $300 off

Page 21: Text Analytics Today

Text Analytics 2015

Text Analytics Today

Seth GrimesAlta Plana Corporation

@sethgrimes

IIeX – AtlantaJune 16, 2015


Recommended