+ All Categories
Home > Documents > Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Date post: 22-Dec-2015
Category:
View: 213 times
Download: 0 times
Share this document with a friend
Popular Tags:
39
– from Re- engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France
Transcript
Page 1: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Graphics Recognition – from Re-engineering to RetrievalKarl Tombre, Bart Lamiroy

LORIA, France

Page 2: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Document Analysis in the IR era

Information is at the core of industrial strategies

A lot of digital or digitized information, but often in very “poor” formats

The challenge: not necessarily re-engineering of documents, but enrich poorly structured information, add (limited) amount of semantics, build indexes

Purposes: browsing, navigation, indexing DAR methods and tools useful, but must

be adapted

Page 3: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Specific challenges of large-scale IR applications Genericity: we cannot necessarily build a

complete and exhaustive a priori model of contextual knowledge (ontology)

Adaptability: various input data – scanned paper, PDF, DXF, HTML, GIF… – various resolutions

Robustness: “back-office” applications Efficiency: online searching in

heterogeneous data Scaling: methods have to scale to

increasing number of symbols/features

Page 4: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

DAR and IR

Media without (or with very little) contextual knowledge

Image-based indexing and retrieval, indexing of video sequences

Documents do explicitly convey information from one person to another person

Much more structure, syntax and semantics

Page 5: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

DAR and IR – some examples

Indexing and/or searching scanned text without OCR

Similarities, signatures Query or index on layout structure Table spotting Keyword spotting …

Page 6: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

What about Graphics Recognition? Subfield of DAR, for graphics-rich

documents Numerous methods for various analysis

and recognition problems Raster-to-vector conversion Text/graphics separation Symbol recognition

Many specific technical areas: maps, architectural drawings, engineering drawings, diagrams and schematics, …

Page 7: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Graphics recognition methods Text/graphics separation

Page 8: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Vectorization

Graphics recognition methods

Page 9: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Graphics recognition and IR applications Usual text-based indexing and retrieval

still useful But need for access to other kinds of

information: Symbols Text-drawing connections Description-illustration connections

Page 10: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Some contributions Syeda-Mahmood – maintenance drawings

IEEE Trans. On PAMI 21(8):737-751, Aug. 1999

Page 11: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Some contributions Arias et al., Najman et al. – use of information

contained in legend / title block

Proc. GREC’01, Kingston (Ontario, Canada), p.19-26, Sept. 2001

Page 12: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Some contributions Samet & Soffer – symbols from legend

IEEE Trans. On PAMI 18(8):783-798, Aug. 1996

Page 13: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Some contributions Müller & Rigoll – graphical retrieval in database

of engineering drawings

Proc. ICDAR’99, Bangalore (India), pp. 697-700, Sept. 1999

Page 14: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Some contributions Boose et al. (Boeing) – Generation of Layered

Illustrated Parts Drawings (GREC’ 03)

Proc. GREC’03, Barcelona, pp. 139-144

Page 15: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Wishful thinking?

Symbol DB

Or even better…

Page 16: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Symbol recognition Natural features for indexing and retrieval Most methods work with known databases

of reference symbols – what about interactive querying of arbitrary symbols?

From segmentation followed by recognition, to segmentation-free recognition, or segmenting while recognizing

Scalability Efficiency / complexity Discrimination power

Signatures

Before we move on:

1st contest on

symbol recognition

held last week

See IAPR TC10 homepage

for further details

Page 17: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Image-based signatures

Compute invariant signatures on binary document image F-signatures (ICDAR’01) Radon transform: R-signatures [Tabbone

& Wendling] Ridgelets [Ramos Terrades & Valveny –

GREC’03] – aka wavelet transform of Radon transform

Page 18: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

R-signaturesDetection of arrowheads [Girardeau & Tabbone]

DEA degree thesis, INPL, Nancy, Jul. 2002

Page 19: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

R-signaturesAnother example [Girardeau & Tabbone]

Page 20: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Ridgelets[Ramos Terrades & Valveny – GREC’03]

Proc. GREC’03, Barcelona,

pp. 202-211

Page 21: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Vector-based signatures

[Dosch & Lladós – GREC’03] Based on set of basic graphical features:

Parallelism Overlap Collinearity T- and V-junctions

Quality factor associated with the various relations

Match signatures of reference symbols with signatures of buckets

Page 22: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Vector-based signatures

Proc. GREC’03,

Barcelona,

pp. 159-169

Page 23: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Towards symbol spotting

Pre-compute – or compute on the spot – a set of basic signatures

Can be sufficient for symbol spotting and retrieval

Followed by classical symbol recognition if more discrimination is needed

Page 24: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Symbol spotting [Jabari & Tabbone] : graph matching through

probabilistic relaxation, with nodes=segments and vertices=relations

DEA degree thesis, INPL, Nancy, Jul. 2003

Page 25: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Symbol spotting [Jabari & Tabbone] : another example

Page 26: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Combining Text and Graphics

Extracting Text/Graphics relationships within document

Using Text matching for inter-document relationships

Transitive inter-document Graphics matching

No need for complex graphics matching Restricted to well known document types

Page 27: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Example: continuation of Wiring Diagrams (Boeing) [Baum et al. – GREC’03]

Proc. GREC’03, Barcelona, pp. 132-138

Page 28: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Scan2XML Example

Proc. GREC’01, Kingston (Ontario, Canada), pp. 312-325

Page 29: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Indexing and Semantics

Signature + metric Semantics = measured distance to signature Applies only to homogenous contexts

Pre-segmented images Pre-determined image classes Implicit application of domain kowledge ...

Semantics = Syntax

Page 30: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Example

Signature type AMetric M

Semantics1 = (1, 1)Semantics2 = (, 2)

Signature value M(M(

semantics = measurement to reference value

Page 31: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Heterogenous Document Bases Semantics do not have a unique syntax

anymore Syntax metrics may be context sensitive Semantics = Syntax + Context

Context needs to be considered

Page 32: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Two different contexts from the automobile industry

Page 33: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Example

Context 1:Signature type AMetric M

(1, 1) = Semantics1 = (1, 1) (, 2) = Semantics2 = (, 2)

Context 2:Signature type BMetric N

Signature value What if

M( and N(

Page 34: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

A step to taking into account context(while consolidating existing approaches)

Component Algebra : Image Analysis = Pipeline Syntax + algorithm = semantics

AlgorithmAlgorithmDataData

(syntax)

DataData

(semantics)

AlgorithmAlgorithmDataData

(semantics)

Syntax and semantics need not be distinguished

Page 35: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Component Algebra

Components :Known and implemented document analysis

algorithms, taking input data from one domain, and producing data into another domain.

Application Context :Set of all available Components.

Semantics :Data sets needed by or produced by Components.

Page 36: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Component Algebra is a Graph

ComponentComponentDataData

DataData

ComponentComponent

DataDataDataData

DataData DataData

DataData

ComponentComponent

Page 37: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Advantages

Each node is a semantic concept, semantic relationships are explicitly expressed.

Structure may support automatic reasoning and knowledge inference.

Context is embedded in components, different contexts give different paths in the graph.

Highly scalable and open architecture. Bridge between signal-level document

analysis and high-level document representation.

Page 38: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

However ...

The formalism exists, the realization doesn't (yet)

What about parametrization ? How context independant can you get ? What about « guessing » context

appropriateness ? How to design fully interoperable components ?

Page 39: Graphics Recognition – from Re-engineering to Retrieval Karl Tombre, Bart Lamiroy LORIA, France.

Conclusion A lot of DA methods – and more specifically

GR methods – can be of direct use in IR, indexing and browsing applications

Specific challenges Scaling and efficiency Heterogeneous sets of documents Incomplete domain knowledge Symbol spotting On-the-fly symbol searching

Sketch of open framework for including document semantics when context can be heterogeneous


Recommended