Dwm Presentation

Post on 03-Dec-2014

154 views 6 download

Tags:

transcript

Clustering Technique & its Products

Presented By :Shikha Mishra-142Sonal Pal-149Vikram Singh-292

Clustering

It is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other than to those in other clusters.

Difference

O Freeware

O Shareware

O Commercial Software

WEKAO Waikato Environment for Knowledge

Analysis is a popular suit for machine learning software written in Java.

O Weka is a free software available under the GNU general public license.

Advantages of WekaO Free availability under the GNU general

public license.O Portability, since it is fully implemented

in the java programming language and thus runs on almost any modern computing platform.

O A comprehensive collection of data processing and modeling techniques.

O Ease of use due to its graphical user interfaces.

KEY FEATURES OF WEKA

Weka supports several standard data mining task-

Data processing.ClusteringClassificationRegression

VisualizationFeature selectionAccess to SQL databases using

JAVA database connectivity.It is not capable of multi-relational

data mining but there is separate software for converting a collection of linked database tables into a single table that is suitable for processing using Weka.

Weka productWeka product

Different algorithms for data mining and machine learning

Different algorithms for data mining and machine learning

Easily useable Easily useable

Platform-independe

nt

Platform-independe

nt

Open source and freely available

Open source and freely available

Flexible facilities for

scripting experiment

Flexible facilities for

scripting experiment

WEKA INTERFACE-O Explorer : An environment for

exploring data with WEKA .

O Experimenter : An environment for performing experiments and conducting statistical tests between learning schemes.

OKnowledge Flow : This environment supports essentially the same functions as the Explorer but with a drag-and-drop interface. One advantage is that it supports incremental learning.

OSimple CLI : Provides a simple command-line interface that allows direct execution of WEKA

commands for operating systems that do not provide their own command line interface.

ARFF FILEAttribute Relationship File Format

(ARFF) is the text format file used by weka to store data in data base.

The ARFF file contains two sections: the header and the data section. The first line of the header tells us the relation name. Then there is the list of the attributes (@attribute...).

BMW CLUSTER DATA IN WEKA

BMW CLUSTER ALGORITHM

CLUSTER ATTRIBUTES

THANK YOU