+ All Categories
Home > Software > II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and...

II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and...

Date post: 11-May-2015
Category:
Upload: dr-haxel-congress-and-event-management-gmbh
View: 459 times
Download: 0 times
Share this document with a friend
Popular Tags:
31
Dr. Philipp Daumke Analyze Text, Gain Answers
Transcript
Page 1: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

Dr. Philipp Daumke

Analyze Text, Gain Answers

Page 2: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

ABOUT AVERBIS

Founded: 2007

Location: Freiburg im Breisgau

Team: Domain- & IT-Experts

Focus: Leverage structured & unstructured information

Current Sectors: Pharma, Health, Automotive, Publishers & Libraries

Page 3: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

PORTFOLIO

PRODUCTS:

CORE TECHNOLOGIES:

Page 4: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)
Page 5: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

CHALLENGE

Exponential growth of data

• need for data-driven decisions

• limited human resources for analysis

New analytics tools needed for

• Semantic search and discovery

• Competitor analysis

• Identification of market trends

• IP landscaping

• Portfolio analysis

• …

Patent applications:

Medline articles:

Page 6: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

� (Semi-)Automate patent categorization

with high precision

� Learning system

imitates the behavior of IP professionals

� Semantic search

Search for meanings, not just keywords

Page 7: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

PATENT ANALYTICS

Page 8: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

PATENT ANALYTICS

TerminologiesText Mining Rules

Text Mining Machine Learning

Patent Collection

Page 9: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

TERMINOLOGY MANAGEMENT

Define the ‚semantic space‘ of your technology fields• Keywords

• Categories

• Hierarchies

• ….

Include relevant word lists from your company• Products

• Devices

• Companies

• Components

• Indications

• …

Reuse already existing terminologies on the market

Page 10: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

TEXT MINING

Lung metastasis lung metastasis

lung metastases

metastases in the lung

metastases in the lower lobe of the lung

pulmonal metastates

pulmonal relapse of a metastasis

pulmonal filia

pulmonal filiae

lung filiae

lower lobe filiae

Page 11: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

TEXT MINING

tumors tumour

cancer

carcinoma

lymphoma

endometrioma

astrocytoma

glioblastoma

seminoma

ALL

leukemia

Page 12: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

TEXT MINING

Page 13: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

PATENT CLASSIFICATION – MACHINE LEARNING

System learns how to fine-classify patents

�Observes and imitates human decision making

Advantages

• No explicit externalization of knowledge needed

• No rule-writing

• Better results

• System generalizes (higher recall)

• Statistical model can handle „noise“ better than rules

• Ambiguity and textual variations better handled

Page 14: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

THE PROCESS OF MACHINE LEARNING

Labeling

• Up to 100 categories

• ~10-50 patents per category

• Hierarchical categories

• Multi-labeling

Learning

• Learn characteristic patterns in labeled data

• Lots of different classification algorithms

Prediction & Review

• Automatically map new patents to categories

• Confidence value for each category

• Different selection criteria

14

Page 15: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

POWERFUL FRONTEND

Linguistic full text search

Lingustic

Filters

Patent Summary

Additional info, e.g. picture

Multilabel Classification

Page 16: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE1: LARGE-SCALE PATENT LANDSCAPING

• Goal: to semi-automatically categorize patents to the

company‘s technology landscape

• Technology Landscape: 35 Classes (8 main classes, 27 sub-

classes)

• 7.000 patents, 10 competitors

• Evaluation

– between automated judgement with expert judgement

– between two expert judgements (Interrator-Agreement)

Page 17: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE1: LARGE-SCALE PATENT LANDSCAPING

Page 18: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

CONFUSION MATRIX

Page 19: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE1: LARGE-SCALE PATENT LANDSCAPING

Results Accuracy Time Savings

Automated, Scenario I 85% 70%

Automated, Scenario II 82% 80%

Manual (2 expert judges) 80%

Averbis Patent Analytics save up to 80% of time with

accuracy being on par with manual judges!

Page 20: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE2: RESEARCH LITERATURE RELEVANCY

• Goal: to automatically identify company‘s relevant

literature

• Rule set:

– Mentionings of company‘s indications, products, etc.

– Competitor products and indications

– „Testosterone, but only given externally“

– „Products shall not be found in an enumeration“

– …

Page 21: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

PATENT ANALYTICS

Rule SetText Mining,

Machine LearningSearch, Analysis

Medline, Embase

Page 22: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

VERAPAMIL

Page 23: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE2: RESEARCH LITERATURE RELEVANCY

Rule: Testosterone, but only given externally

Page 24: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE2: RESEARCH LITERATURE RELEVANCY

Rule: Ignore products listed in enumerations

Page 25: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE 3: SOCIAL MEDIA ANALYTICS

Page 26: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE 3: SOCIAL MEDIA ANALYTICS

Page 27: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE 3: SOCIAL MEDIA ANALYTICS

Main Challenge: what is positive, what is

negative?

– „Could somebody please remove the dead bird from the

balcony“?

– „From the breadcrumbs lying under the bed one could live for

ages“

– „The hotel is situated in the crowdiest party district of the town“

– „The toilets were that big that I couldn‘t sit down for …“

Page 28: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE4: PATIENT RECRUITMENT/DIAGNOSIS SUPPORT

Disease ProfilesInclusion/Exclusion Criteria

Categorization Visualization

Electronic Health Records

Page 29: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE4: PATIENT RECRUITMENT/DIAGNOSIS SUPPORT

Page 30: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

USE CASE4: PATIENT RECRUITMENT/DIAGNOSIS SUPPORT

Page 31: II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (Katrin Tomanek and Philipp Daumke - Averbis, Germany)

For further questions, please contact

Dr. Philipp Daumke

[email protected]

+49 761 - 203 9769 0


Recommended