+ All Categories
Home > Documents > Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf ·...

Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf ·...

Date post: 19-Apr-2018
Category:
Upload: nguyennhu
View: 214 times
Download: 0 times
Share this document with a friend
134
Soft Computing, Machine Intelligence and Granular Data Mining: Data to Knowledge Sankar K. Pal Indian Statistical Institute Calcutta http://www.isical.ac.in/~sankar ISI-SU Autumn School on Machine Intelligence and Applications, Sept 22-26, Sikim University, Gangtok, Sikim
Transcript
Page 1: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Soft Computing, Machine Intelligence and Granular Data Mining:

Data to Knowledge

Sankar K. Pal Indian Statistical Institute

Calcutta http://www.isical.ac.in/~sankar

ISI-SU Autumn School on Machine Intelligence and Applications, Sept 22-26, Sikim University, Gangtok, Sikim

Page 2: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Contents What is Soft Computing ? Pattern Recognition and Machine

Intelligence Relevance of Soft Computing Tools Data Mining from PR point of view

Fuzzy Sets and Flexibility ANN and GAs: Features Rough Sets and Information Granules

Example: Case mining Other applications

Page 3: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Integrations of SC Tools : Challenges

Rough-neural computing Neural-rough-fuzzy computing Rough-fuzzy computing

Generalized rough sets and entropy Examples

Object extraction in image/ video

Challenging issues Relevance to Big Data Conclusions

Page 4: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

SOFT COMPUTING (L. A. Zadeh)

Aim : • To exploit the tolerance for imprecision

uncertainty, approximate reasoning and partial truth to achieve tractability, robustness, low solution cost, and close resemblance with human like decision making

• To find an approximate solution to an imprecisely/precisely formulated problem.

Page 5: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Parking a Car Generally, a car can be parked rather easily because the final position of the car is not specified exactly. If it were specified to within, say, a fraction of a millimeter and a few seconds of arc, it would take hours or days of maneuvering and precise measurements of distance and angular position to solve the problem. ⇒ High precision carries a high cost

Page 6: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

⇒ The challenge is to exploit the tolerance for imprecision by devising methods of computation which lead to an acceptable solution at low cost. This, in essence, is the guiding principle of soft computing.

Page 7: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

• Soft Computing is a collection of methodologies (working synergistically, not competitively) which, in one form or another, reflect its guiding principle: Exploit the tolerance for imprecision, uncertainty, approximate reasoning and partial truth to achieve Tractability, Robustness, and close resemblance with human like decision making.

Foundation for the conception and design of high MIQ (Machine IQ) systems.

Page 8: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

• At this junction, the principal constituents of soft computing are Fuzzy Logic , Neurocomputing , Genetic Algorithms and Rough Sets . RS

• Within Soft Computing FL, NC, GA, RS are Complementary rather than Competitive

FL NC GA

Page 9: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

FL : the algorithms for dealing with imprecision and uncertainty NC : the machinery for learning and curve fitting GA : the algorithms for search and optimization

RS RS handling uncertainty arising from the granularity in the domain of discourse

Role of

Page 10: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Machine Intelligence

Knowledge-based Systems

Probabilistic reasoning Approximate reasoning Case based reasoning

Fuzzy logic Rough sets

Pattern recognition and learning

Hybrid Systems

Neuro-fuzzy Genetic neural Rough fuzzy Fuzzy neuro genetic

Non-linear Dynamics

Chaos theory Rescaled range analysis (wavelet) Fractal analysis

Data Driven Systems

Neural network system Evolutionary computing

Machine Intelligence: A core concept for grouping various advanced technologies with Pattern Recognition and Learning

IAS are physical embodiments of Machine Intelligence

Page 11: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Measurement → Feature → Decision Space Space Space – Uncertainties arise from deficiencies of information

available from a situation – Deficiencies may result from incomplete,

imprecise, ill-defined, not fully reliable, vague, contradictory information in various stages of a PRS

Pattern Recognition System (PRS)

10

Page 12: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

M : Height, Weight, Complexion, Diet….

Height

….. …. ……

xxxxx xxxxxx

xxxx B

P

F:

Weight

D : Straight Line

D ⇒ Classifier Design

Page 13: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance
Page 14: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Father Mother

Son Daughter

Sex-wise

Age-wise

Clustering

Blood group wise

Page 15: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Classification: Sampled data are given about the pattern space And the Challenge is to estimate the unknown regions of the pattern space based on the sampled data (incomplete information) Abstraction + Generalization

Clustering: Entire data is given And the

Challenge is to partition it into meaningful regions. Number of regions may be known or unknown

Tasks & Challenges

Page 16: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Image Classification Pixel Classification Supervised Image Segmentation Pixel clustering Unsupervised

15

Page 17: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Pattern Recognition and Machine Learning principles applied to a very large (both in size and dimension) heterogeneous database ≡ Data Mining Data Mining + Knowledge Interpretation ≡ Knowledge Discovery Process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data

Page 18: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Fuzzy Sets and Flexibility

Page 19: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

FUZZY SETS

Classical set Hard Fuzzy set Soft

1,0∈µ

µA(x) : degree of belonging of x to A or degree of possessing some imprecise property represented by A

Example : tall man, long street, large number, sharp corner, very young, etc.

• Fuzzy set is a Generalization of classical set theory

⇒ Greater flexibility in capturing faithfully various aspects of incompleteness or imperfection in a situation.

WHITE YES

1

0 BLACK

NO

]1,0[∈µ

A = (µA(x),x) : for all x ∈ X

Page 20: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Meeting at 5 PM

5 6 7 4 0

1 Crisp Set

Fuzzy Set Memb. Function

• Fuzzy Sets are nothing but Membership Functions • Membership Function: Context Dependent

Page 21: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Flexibility of fuzzy set theory is associated with the Concept of

• : A measure of compatibility of an object with the concept represented by fuzzy set.

• TALL = 0.3 means Compatibility of some one with the set ``TALL´´ NOT the prob. that some one is TALL

i.e., 0.3 is the extent to which the concept ``TALL´´ must be stretched to fit him

• As Amount of Stretching Concept

FUZZINESS IS ANALOGOUS TO ELASTICITY

µ ↓

µ

µ

µ

Page 22: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Concept of Flexibility & Uncertainty Analysis

Page 23: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Relevance of Fuzzy Sets in PR Representing linguistically phrased input

features for processing • Representing multi-class membership of

ambiguous patterns • Generating rules & inferences in linguistic form • Extracting ill-defined image regions,

primitives, properties and describing relations among them as fuzzy subsets 20

Page 24: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

µ3

µ1 µ2

X

Y

Page 25: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Combined choice

Correct Wrong

First choice Second choice Wrong

Conventional two states

Proposed four states

Page 26: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Samples under combined choice, second choice can be corrected at higher level under the control of a supervisory programme.

Example : Satellite Imagery Analysis

INFORMATION on a pixel is not only from the concrete, but also from vegetation or water body

Decision on a pixel should indicate its multi-class membership with certainty values Linking of Broken Roads can be guided with second / combined choice for their detection

Roads shaded with trees - Roads nearby water body or vegetation.

Page 27: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Calcutta (SPOT, Infrared) Enhanced Image

Page 28: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Null Water P.Wat. Con. Hab. Veg. Op.Sp. Classified Image

Page 29: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Second Combined First Choice Pure Water Class

Page 30: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Second Combined First Choice Concrete Structures Class

Page 31: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Linear Structures Segmented Image

Page 32: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Artificial Neural Networks (ANNs): Learning and Adaptation

Page 33: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Major Characteristics of ANN

• Adaptability to new data/environment • Robustness/ Ruggedness to failure of components • Speed via massive parallelism • Optimality w.r.t error Machinery for learning (abstraction and

generalization) and curve fitting

Page 34: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

ANNs provide Natural Classifiers having Resistance to Noise, Tolerance to Distorted Patterns /Images

(Ability to Generalize) Superior Ability to Recognize Overlapping

Pattern Classes or Classes with Highly Nonlinear Boundaries or Partially Occluded or Degraded Images

• Potential for Parallel Processing

• Non parametric

30

Page 35: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Genetic Algorithms (GAs): Search and Optimization

Page 36: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Why GAs in PR ?

Methods developed for Pattern Recognition and Image Processing are usually problem dependent.

Many tasks involved in analyzing/identifying a pattern need Appropriate Parameter Selection and Efficient Search in complex spaces to obtain Optimal Solutions

Makes the processes - Computationally Intensive - Possibility of Losing the Exact Solution

Page 37: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

•• GAs : Efficient, Adaptive and robust Search Processes, Producing near optimal solutions and have a large amount of Implicit Parallelism

GAs are Appropriate and Natural Choice

for problems which need – Optimizing Computation Requirements, and Robust, Fast and Close Approximate Solutions

Page 38: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example of GA based Classification

Automatic selection of no. of hyper planes for approximating class boundaries for minimum miss-classification (VGA classifier) Chromosome (sexual) discrimination to reduce computation time (VGACD classifier) Robust Searching Ability (suitable when the search space is large)

33

Page 39: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

SPOT Image of Calcutta in the Near Infra Red Band

Garden Reach Lake

Hooghly

Howrah Bridge

Racecourse

Khidirpore Dockyard

Intl. J. Remote Sensing, 22(13), 2545-2569, 2001

(spatial resolution = 20m x 20m wavelength = 0.79µm-0.89µm)

IEEE Trans. Geosci. & Remote Sensing, 39(2), 303-308, 2001

Page 40: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Scatter plot of the training set of SPOT image of Calcutta, containing seven classes.

Page 41: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

(b) (c) (a)

(d) (e) (f)

Classified SPOT image of Calcutta (zooming the race course ‘R’ only) using (a) VGACD-Classifier, Hmax=15, final value of H=13, (b) VGA classifier, Hmax=15, final value of H=10, (c) Bayes maximum likelihood Classifier, (d) k-NN rule, k=1, (e) k-NN rule, k=3, (f) k-NN rule, k=sqrt(n).

IEEE Trans. Geosci. & Remote Sensing 39(2), 303-308, 2001

Page 42: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Variation of the number of points misclassified by the best Chromosome with generations for VGACD classifier and VGA classifier

IEEE Trans. Geosci. & Remote Sensing 39(2), 303-308, 2001

Page 43: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough Sets and Granular Computing

Page 44: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough Sets

. x

Upper Approximation BX

Set X

Lower Approximation BX

[x]B (Granules)

[x]B = set of all points belonging to the same granule as of the point x in feature space ΩB.

[x]B is the set of all points which are indiscernible with point x in terms of feature subset B.

UB ⊆ΩZ. Pawlak 1982, Int. J. Comp. Inf. Sci.

Page 45: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Approximations of the set UX ⊆

B-lower: BX = ][: XxUx B ⊆∈

B-upper: BX = ][: φ≠∩∈ XxUx B

If BX = BX, X is B-exact or B-definable Otherwise it is Roughly definable

Granules definitely belonging to X

w.r.t feature subset B

Granules definitely and possibly belonging to X

Rough Sets are Crisp Sets, but with rough description

Page 46: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough Sets

Uncertainty Handling

Granular Computing

(Using lower & upper approximations) (Using information granules)

Two Important Characteristics

40

Page 47: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

IEEE Trans. Syst., Man and Cyberns. Part B, 37(6), 1529-1540, 2007

Cluster definition in terms of rough lower and upper approximations

Lower and upper approximate regions could be crisp or fuzzy

Page 48: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

In Real life problems – Sets and Granules can either or both be

fuzzy Generalized Rough Sets

Upper and Lower approx. regions could be crisp or fuzzy

- Stronger framework for uncertainty handling - Rough-fuzzy computing : New paradigm

Page 49: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Before I describe the application of rough-fuzzy computing, let me explain the concept of f-information granules

Relevance of integration in SC paradigm

Page 50: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Concept of - f- Information Granules using Rough Rules

Page 51: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

low medium high

low

m

ediu

m

high

F1

F2

Rule 21 MM ∧←• Rule provides crude description of the class using granule

Information Granules and Rough Set Theoretic Rules

Page 52: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rule characterizing the granule can be viewed as the Case or Prototype representing the class/ concept/ region

Elongated objects need multiple rules/

granules Unsupervised: No. of granules is

determined automatically Cases (prototypes) are granules, not

sample points case generation, NOT selection

Note:

Page 53: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

All the features may not appear in rules Dimensionality reduction Depending on topology, granules of

different classes may have different dimensions Variable dimension reduction

Less storage requirement Fast retrieval

Suitable for mining data with large dimension and size

Note:

Page 54: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example: IRIS data case generation Three flowers: Setosa, Versicolor and Virginica No of samples: 50 from each class Features: sepal length, sepal width, petal length, petal width

IEEE Trans. Knowledge Data Engg., 16(3), 292, 2004

Page 55: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

(a) Sepal L- Sep W (b) Sepal L – Petal L (c) Sepal L – Petal W

Iris Folowers: Setosa, Versicolor and Virginica

(a) (b)

(c)

Page 56: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

(a) Petal L - Sepal W (b) Petal W - Sepal W (c) Petal W - Petal L

Iris Folowers: Setosa, Versicolor & Virginica

(a) (b)

(c)

Page 57: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Iris Flowers: 4 features, 3 classes, 150 samples

0

0.51

1.5

22.5

33.5

4

avg. feature/case

Rough-fuzzyIB3IB4Random

Number of cases = 3 (for all methods)

80%82%84%86%88%90%92%94%96%98%

100%

Classification Accuracy (1-NN)

Rough-fuzzyIB3IB4Random

00.5

11.5

22.5

33.5

44.5

tgen(sec)

Rough-fuzzyIB3IB4Random

00.0010.0020.0030.0040.0050.0060.0070.0080.009

0.01

tret(sec)

Rough-fuzzyIB3IB4Random

Page 58: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Information compression

Computational gain

Information Granules: A group of similar objects clubbed together by an indiscernibility relation Granular Computing: Computation is performed using information granules and not the data points (objects)

50

Page 59: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Applications of Rough Granules

Case based reasoning (evident is sparse) Case representation and indexing

Prototype generation and class representation involving datasets large in dimension and size Dimensionality reduction and Data mining

Data compression and storing Clustering & Image segmentation (k selected autom)

Knowledge encoding (NN structure formation)

Granular information retrieval in heterogeneous media (e.g., text, hypertext, image) like WWW

Page 60: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Applications of Rough Granules

Case Based Reasoning (evident is sparse) Prototype generation and class representation Clustering & Image segmentation (k selected autom)

Case representation and indexing Knowledge encoding Dimensionality reduction Data compression and storing Granular information retrieval

Page 61: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Certain Issues

Selection of granules and sizes Class dependent or independent Fuzzy granules

Fuzzy set over crisp granules Crisp set over fuzzy granules Fuzzy set over fuzzy granules

Granular fuzzy computing Fuzzy granular computing

These issues would be addressed, in one form or others, in the following examples - Nature of granules - Role of granules - GFC or FGC - Superiority of R-F computing

Page 62: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

• Individual Relevance of FL, ANN, GAs, RS to PR and mining Problems is Established adequately

Page 63: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Challenging Issues in Soft Computing Research: Judicious Integrations

Page 64: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

In late eighties scientists thought – Why NOT Integrations ?

Fuzzy Logic + ANN ANN + GA Fuzzy Logic + ANN + GA Fuzzy Logic + ANN + GA + Rough Set

Neuro-fuzzy hybridization is the most visible integration realized so far.

Page 65: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Why Fusion Fuzzy Set theoretic models try to mimic human reasoning and the capability of handling uncertainty – (SW) Neural Network models attempt to emulate architecture and information representation scheme of human brain – (HW)

NEURO-FUZZY Computing (for More Intelligent System)

Page 66: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

FUZZY SYSTEM

ANN used for learning and Adaptation NFS

ANN

Fuzzy Sets used to Augment its Application domain

FNN

Page 67: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough-fuzzy Computing : A stronger Paradigm for Uncertainty Handling

Recently -

Page 68: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Merits and Challenges

GENERIC APPLICATION SPECIFIC

Page 69: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Certain Issues

Selection of granules and sizes Class dependent or independent Fuzzy granules

Fuzzy set over crisp granules Crisp set over fuzzy granules Fuzzy set over fuzzy granules

Granular fuzzy computing Fuzzy granular computing

These issues would be addressed, in one form or others, in the following examples - Nature of granules - Role of granules - GFC or FGC - Superiority of R-F computing

Page 70: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough-fuzzy Computing : Applications

Page 71: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example: Class-Dependent Rough-Fuzzy Granular Space and Classification

Granules’ shapes are class dependent Rough sets are used on fuzzy granulated space for feature selection Effectiveness of Neighborhood rough sets is studied Fuzzy granules & Crisp computation → FGC

Fuzzy granules in modeling overlapping classes

Pattern Recognition, 45(7), 2690-2707, 2012

59

Page 72: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Fuzzy (f) granulation

F1

F2

C1

C2 C3

C4

CD granulation

CI granulation

Fuzzy granulation of features F1 and F2 characterizing granules for four overlapping classes

Example:

# of granules: cn vs. 3n (l, m, h = 3)

Pattern Recognition, 45(7), 2690-2707, 2012

Page 73: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Schematic diagram for pattern classification

Page 74: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

• Model 1 : k-nearest neighbor (k-NN) classifier • Model 2 : CI fuzzy granulation + PaRS based feature selection + k-NN classifier • Model 3 : CI fuzzy granulation + NRS based feature selection + k-NN classifier • Model 4 : CD fuzzy granulation + PaRS based feature selection + k-NN classifier • Model 5 : CD fuzzy granulation + NRS based feature selection + k-NN classifier

Five classification models combining different granular feature spaces and feature selection methods

Page 75: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

+

+

+

+

+

+ +

+ +

+

+

+

+

+

+

+

+

+

* *

* *

*

*

*

*

*

*

* *

*

* *

*

*

*

* *

*

*

* *

F1

F2 +

+

*

+ +

*

*

x1

x2

ɸ

ɸ +

Two neighborhood granules centered at samples x1 and x2 in F1- F2 feature space. φ is the radius of the granules and ∆(xi, xj) ≤ φ. Granules’ shape & size are

determined by p norm distance function (∆) and threshold ɸ.

Neighborhood Granule Generation for two overlapping classes

Page 76: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Variation of classification accuracy with granule radius φ for three p-norm distances for model 5 and VOWEL data (Train set = 20%)

Optimum φ = 0.45 Beyond 0.5, NRS based model can’t select relevant features to distinguish patterns, since possibility of possessing irrelevant/ contradictory feature information by granules increases

Page 77: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Multi-Spectral IRS Image of Calcutta (Dim = 512x512, Spatial resolution = 36.25 m X 36.25 m, Wavelengths = 0.77-0.86µm, Major land covers = pure water, turbid water, concrete area, habitation, vegetation, open space)

Band 1 Band 2

Band 3 Band 4

Page 78: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Indices

Davies-Bouldin (DB) Index:

Dunn (D) Index:

S(vi): Variance d(.,.): Distance

• DB: for every i, it computes S & d values and . w.r.t. other k values, and then takes the max value of them; and then computes the average of c such values. (lower)

• D: for every i, it computes S & d values and . w.r.t. other k values, and then takes the min value of them; and then compute minimum of such c values. (higher)

Page 79: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Index βn : total number of pixels in image x : mean gray value of the image xi : number of pixels in the ith (I = 1,…,c) region obtained by a

segmentation method. xij : gray value of jth pixel (j=1,…, ni) in region i

ix : the mean of ni gray values of ith region. Then

∑∑

∑∑

∑ ∑

∑∑

= =

= =

=

= =

−×

= =

=c

i jixij

c

i jxij

c

i jixij

in

i

c

i jxijn

n

n

n

n

i

i

i

i

x

x

xnn

x

1 1

21 1

2

1 1

21

1 1

21

β

Int. J Remote Sensing, 21(11), 2269-2300, 2000

Page 80: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

DB β

models

0

1

2

3

4

5

6

7

8

9

10

1 2 3 4 5 60

0.2

0.4

0.6

0.8

1

1.2

1 2 3 4 5 6

Trainingsamples

1 2 3 4 5

models Trainingsamples

1 2 3 4 5

Multi-spectral IRS-1A image: Comparison of models Four bands and partially labeled data (3439 out of 512x512) for six classes

Model 1 : 1-NN classifier Model 2 : CI FG + PaRS FS + 1-NN classifier Model 3 : CI FG + NRS FS+ 1-NN classifier Model 4 : CD FG + PaRS FS + 1-NN classifier Model 5 : CD FG + NRS FS + 1-NN classifier

Pattern Recognition, 45(7), 2690-2707, 2012

(p=2, ɸ =0.45 )

Page 81: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Classified IRS-1A images (a) model 1 (β= 6.86, DB= 0.95) (b) model 5 (β= 8.41, DB= 0.73)

Pattern Recognition, 45(7), 2690-2707, 2012

Page 82: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Zoomed region (bridge) of classified IRS-1A image with (a) model 1 (b) model 5

Page 83: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

D(i) quantifies the dispersion of the misclassified patterns into different classes when the true class is i

Given an overlapping of a class with others, lower dispersion is desirable

Page 84: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Model 2

Model 1

Model 3

Model 4

Model 5

Dispersion score of R-F models for six classes of IRS-1A image1: pure water (PW), 2: turbid water (TW), 3: concrete (CON), 4: habitation (HAB), 5: vegetation (VEG), 6: open spaces (OS)

Model 1: 1-NN Model 2 : CI FG + PaRS FS + 1-NN Model 3 : CI FG + NRS FS + 1-NN Model 4 : CD FG + PaRS FS + 1-NN Model 5 : CD FG + NRS FS + 1-NN

Page 85: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

305

310

315

320

325

330

335

340

345

350

355

1 2 3 4 5

Models

Computation time (Sec)

Computation time of R-F models with IRS-1A image (512x512, 4-band image; # train samples 3439; p = 2, ɸ = 0.45; classes: PW, TW, concrete, habitation, vegetation, open space; MATLAB (matrix lab) environment in Pentium-IV with 3.19 GHz processor speed)

1: 1-NN 2: CI FG+PaRS FS+1-NN 3: CI FG+NRS FS+1-NN 4: CD FG+PaRS FS+1-NN 5: CD FG+NRS FS+1-NN

Page 86: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Summary

CD based f-granulation enables memb. functions to explore degree of belonging of features to different classes → better class label estimation

NRS based feature selection (requires no discretization) facilitates to gather local information through neighbor granules for better class discrimination

Classification performance of Model 5 with 10% training is even higher than models incorporating CI + (PaRS or NRS) with 50% training

Significant when scarcity of training samples

70

Page 87: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

So far FGC Now Crisp granules & Fuzzy computation →

GFC

So far Supervised Now Unsupervised

Page 88: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough-Fuzzy Clustering & Uncertainty Analysis

Example:

Defining Class Exactness in terms of Granules (Clustering – a basic module for data analysis and mining)

Fuzzy sets enable handling of overlapping partitions Rough sets deal with vagueness and incompleteness in class definition Improved performance & faster than fuzzy clustering - GFC

IEEE Trans. Syst., Man and Cyberns., Part B, 37(6), 1529-1540, 2007 IEEE Trans. Knowledge Data Engg., 19(6), 859-872, 2007

71

Page 89: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

IEEE Trans. Syst., Man and Cyberns. Part B, 37(6), 1529-1540, 2007

Integrates the concepts of membership of fuzzy sets, and lower and upper approximations of rough sets into hard clustering

While fuzzy sets enable handling of overlapping partitions, rough sets deal with vagueness and incompleteness in class definition

Rough-Fuzzy Clustering

Page 90: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Only objects in boundary are fuzzified assign µij = 1 for objects in lower approx.

region, while µij in [0, 1] for those in boundary region

assign higher weight for objects in lower approx region as compared to boundary region in computing centroids

influence in computing centroids of own and other clusters (for lower – only on own centroid, for boundary – on all centroids)

Page 91: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Each cluster in rough-fuzzy clustering is represented by: a cluster prototype a crisp core (lower approximation) a fuzzy boundary

Page 92: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Provides a balanced mixture between restricptive partition of hard clustering descriptive partition of fuzzy clustering

Rough-Fuzzy Clustering

• Faster than fuzzy clustering • Better uncertainty handling/ performance

IEEE Trans. Syst., Man and Cyberns. Part B, 37(6), 1529-1540, 2007

Page 93: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Brain MR Images (AMRI, Kolkata)

original HCM FCM RCM RFCMMBP RFCM

IEEE Trans SMC-B, 37(6), 1529-1540, 2007

c = 4 Background, White matter, Gray matter, and Cerebrospinal fluid

Page 94: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Results on Brain MR Images

HCM: hard c-means; FCM: fuzzy c-means; RCM: rough c-means; RFCM(MBP): rough-fuzzy c-means of Mitra et al.; RFCM: rough-fuzzy c-means

DB Index of Different C-Means

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

1 2 3 4

Sample Images

DB In

dex

HCMFCMRCMRFCM(MBP)RFCM

IEEE Trans SMC-B, 37(6), 1529-1540, 2007 LNCS Trans. on Rough Sets, 5390, 114-134, 2008

Page 95: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Results on Brain MR Images

HCM: hard c-means; FCM: fuzzy c-means; RCM: rough c-means; RFCM(MBP): rough-fuzzy c-means of Mitra et al.; RFCM: rough-fuzzy c-means

Execution Time of Different C-Means

0

200

400

600

800

1000

1200

1400

1600

1800

2000

1 2 3 4

Sample Images

Exec

utio

n Ti

me

(in m

illi s

ec)

HCMFCMRCMRFCM(MBP)RFCM

(Pentium IV, 3.2 GHz, 1 MB cache, and 1 GB RAM)

80

Page 96: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Average difference δ between two highest memberships of pixels: δ = 0.145 (low) at 1st iteration; δ = 0.652 (high) at final iteration

Scatter plot of two highest membs. of pixels for MRI Segmentation (256x180 MRI image with 16 bit gray levels, # of pixels 46080; c = 4)

LNCS Trans. on Rough Sets, 5390, 114-134, 2008

Example: R-F c-means based segmentation

Page 97: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Rough-fuzzy -

C-means clustering (numerical data)

C-medoids clustering (string, relational data)

Page 98: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Protein Sequence Analysis &

Determination of Biobases (c-Medoids)

IEEE Trans. Knowledge Data Engg., 19(6), 1-14, 2007

Application to -

Page 99: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Selection of Relevant Genes from Microarray Data using R-F Sets in Information Measure

IEEE Trans, Syst., Man and Cyberns, Part B, 40(3), 741-752, 2010

Application to -

Class independent fuzzy granules for modeling Low, Medium & High from overlapping classes Gene selection → Maximization of relevance to decision attribute and minimization of redundancy with other genes Merits of FEPM Fuzzy granules & Crisp computation → FGC

IEEE Trans, Syst., Man and Cyberns, Part B, 40(3), 741-752, 2010

Page 100: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

So far Image segmentation based on Pixel classification

Now Image segmentation based on Gray level thresholding

We define Generalized Rough Sets as stronger paradigm of uncertainty handling

Page 101: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Generalized Rough Sets

Incorporate fuzziness in set & granules of rough sets

Concept of -

Page 102: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Generalized Rough Sets

. x

Upper Approximation BX

Set X

Lower Approximation BX

[x]B (Granules)

In practice, the Set and Granules, either or both, could be Fuzzy.

Generalized Rough Set Stronger Paradigm for Uncertainty Handling

UB ⊆Ω

IEEE Trans. Syst, Man and Cyberns. Part B, 39(1), 117-128, 2009

Page 103: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Generalized Rough Sets

[ ] RRX u u U u X= | ∈ : ⊆

[ ] RRX u u U u X= | ∈ : ∩ ≠ ∅

X is a crisp set & Granules have crisp boundaries

RX RX< , >The pair is referred to as the rough set of X.

[ ]Ru represents the granule that contains u.

When R is an equivalence relation

Page 104: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

X is a fuzzy set & Granules have crisp boundaries

RX RX< , >The pair is referred to as the rough-fuzzy set of X.

[ ]( inf ( ))

RXz u

RX u z u Uµ∈

= , | ∈

[ ]( sup ( ))

RX

z uRX u z u Uµ

∈= , | ∈

represents the membership function associated with X. Xµ

When R is an equivalence relation

Page 105: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

RX RX< , >The pair is referred to as the fuzzy rough set of X.

X is a crisp set & Granules have fuzzy boundaries R is an equivalence relation

Page 106: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

RX RX< , >The pair is referred to as the fuzzy rough-fuzzy set of X.

X is a fuzzy set & Granules have fuzzy boundaries

R is an equivalence relation

Page 107: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

a measure of inexactness of X

and are the lower and upper approxs. of X

( ) 1RRXXRX

ρ | |= −

| |

RX RX

Roughness Measure

Page 108: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Entropy Measures using Roughness Values

Entropy measures based on roughness values of a set X in U and its complement XC are:

( ) ( )1( ) [ ( ) log ( ) ( ) log ( )]2

CL CR RR R R

X XH X X Xβ βρ ρρ ρ

β β= − +

(1 ( )) (1 ( ))1( ) [ ( ) ( ) ]2

CR RX XE C

R R RH X X Xρ ρρ β ρ β− −= +

Measure using logarithmic gain function:

Measure using exponential gain function:

, eβ ≥

, 1 eβ< ≤

e = 2.71

Page 109: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Plot of logarithmic entropy Plot of exponential entropy

Plots of entropy for different values of base β and gain functions (e = 2.718)

A B

( )R Xρ

( )CR Xρ

90

Page 110: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example: Image Analysis

R-F entropy takes care of - fuzzy boundaries of regions + rough resemblance between nearby gray levels + rough resemblance between nearby pixels (i.e., fuzziness + granulation)

Several Applications in Data Analysis

Nearby gray levels have limited discernibility Example: A region containing gray values separated by 6 gray levels.

Page 111: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

A grayscale image with sinusoidal gray value gradation. Boundaries can not be defined exactly due to gray value gradation Fuzziness

Example: A portion from the above

image where the pixels in ‘white’ area belong uniquely to a region.

Nearby gray levels have limited discernibility

Example: A small region in the grayscale image containing gray values separated by 6 gray levels.

Example

Granules

Page 112: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Set X is fuzzy & Granules are crisp

x-axis: 0-N gray levels partitioned in crisp granules y-axis: µ values of pixels

Fuzzy entropy: µ value of a pixel is entirely dependent on its own gray value Rough-fuzzy entropy: µ value is dependent on the 1-d gray granule to which it belongs

Entropy based Grayness Ambiguity: Pixel Membership

Pair < > is referred to as the rough-fuzzy set of X

RX RX

Page 113: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Segmentation (and edge extraction):

Minimize GA – w.r.t. crossover point of memb. function μ (assuming fuzzy set and fuzzy granules) Membership of a pixel is dependent on the 1-d gray granule to which it

belongs, and it is independent of its spatial location

Results are compared to those of a fuzzy entropy with no concept of granule. Membership of a pixel is entirely dependent on its own gray value, and it is

also independent of its spatial location

Difference is basically the effect of fuzzy granules

Example: Effect of fuzzy granules

Page 114: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Proposed r-f entropy Fuzzy entropy

Proposed r-f entropy Fuzzy entropy

Proposed r-f entropy Fuzzy entropy Baboon image

Brain MR image

Remote sensing image

Segm

enta

tion

Res

ults

Effect of granules

Page 115: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

β-index for segmentation results on 45 images

Significance of using the concept of f-granules is evident

IEEE Trans. Syst, Man and Cyberns. Part B, 39(1), 117-128, 2009

(window/ granule size ω = 6, Weber’s law)

95

Page 116: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

So far we considered granules of equal size Next, consider granules of unequal size

Page 117: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Formation of Unequal Granules and Spatial Segmentation

Spatial Ambiguity (SA) measure Crisp set and crisp granules Granules formed by quad-tree decomposition Effect of granules of unequal size vis-a-vis fixed size

Example:

Applied Soft Computing, 3(9), 4001-4009, 2013

Page 118: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example: Quad-tree decomposition and granule formation

Page 119: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example Comparison

Original

Otsu’s thresholding

RE with 4x4 granule

RE with 6x6 granule

Rough-fuzzy with crisp set and 6x6 granule

Proposed methodology

Page 120: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Variation of β-Index over sequence ‘a’

Homogeneous granules of unequal size reduce the formation of spurious segments → Reduce abrupt change of index-value over frames.

Page 121: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Video Tracking

• Spatial segmentation on each frame +

• Temporal segmentation based on 3 previous frames

Page 122: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Proposed

Sen+Pal Entropy (6x6) Otsu

Pal+Uma+Mitra RE (6x6) Pal+Uma+Mitra RE (4x4)

Page 123: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Relevance to BIG Data handling

Page 124: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Big-Data is

High volume (scalable), high velocity (dynamic), high variety (heterogeneous) information

Usually involves a collection of data sets so large and complex that it becomes difficult to process using conventional data analysis tools

Requires exceptional technologies to efficiently process within tolerable elapsed of times

NEED completely new forms of processing to enable enhanced decision making and knowledge discovery

New approaches – challenges, techniques, tools & architectures to solve new problems

Page 125: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Dealing with big data (Handling challenges lying with all Vs)

Veracity

Variability

Demands a revolutionary change both in Research Methodologies and Tools

Terabyte: 1012 = 10004 Zettabyte: 1021 = 10007

Page 126: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Example: PR (till 80’s) –-> DM (since late Ninties) • New approaches developed for different tasks of PR

to handle DM problems (large data both in size and dimension)

• Example: Feature Selection - where instead of clustering samples in conventional PR, you cluster features themselves in DM

Page 127: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Dealing with big data: Challenges

Challenges include - capture, preprocessing, storage, search, retrieval, analysis, and visualization

Page 128: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Dealing with big data: Tasks

Tasks like:

Data size and feature space adaptation Feature selection/ extraction in Big data Uncertainty modeling in learning, sample selection, and

classification/ clustering on Big data Granular computing (a clump of objects…) Distributed learning techniques in uncertain

environment Uncertainty in cloud computing - -

(Where SC methodologies can be used, in general)

Page 129: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Without “Soft Computing” Machine Intelligence and Data Mining Research Remains Incomplete.

In conclusion -

100

Page 130: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

S K Pal and S C K Shiu, Foundations of Soft Case-Based Reasoning, Wiley, N.Y., 2004

Page 131: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

SK Pal and P Mitra, Pattern Recognition Algorithms for Data Mining, CRC/ Chapman & Hall, Florida, 2004

Page 132: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

S Bandyopadhyay and S K Pal, Classification and Learning Using Genetic Algorithms: Applications in Bioinformatics and Web Intelligence, Springer, Heidelberg, 2007

Page 133: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

P Maji and S K Pal, Rough Fuzzy Pattern Recognition: Applications in Bioinformatics and Medical Imaging, John Wiley-IEEE, N.Y., 2012

Page 134: Soft Computing, Machine Intelligence and Data Miningmiune/LECTURES/SKP_Sikim Univ 2014.pdf · Contents What is Soft Computing ? Pattern Recognition and Machine Intelligence Relevance

Thank You!!


Recommended