+ All Categories
Home > Documents > 42410 Data Mining Concepts and Techniques 3305

42410 Data Mining Concepts and Techniques 3305

Date post: 06-Apr-2018
Category:
Upload: pooja-dhanda
View: 221 times
Download: 0 times
Share this document with a friend

of 18

Transcript
  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    1/18

    By POOJA DHANDA

    MCA 2NDYEAR

    410104

    1

    Internet access via tv cable

    network

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    2/18

    y

    Introductiony How accessinternetusingcabletvnetwork

    y Cable modem

    y Connection

    y Workofcable modem

    y Isolationoftv& pc

    y Ethernetovercoaxial adapter

    y Hybrid accesssystem

    y

    Advantages & disadvantages 2

    Contents:

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    3/18

    INTRODUCTION

    y Internet data can betransferredthroughcablenetworkswiredtothe

    usercomputer

    y

    Accessingtheinternet at 10 megabits persecond.

    y Acable modem connectsto a pcusingthesameco-axial cablethat

    brings all channelsto yourtelevision

    y India has a cable penetrationof80 millionhomes,offering avast

    networkfor leveragingtheinternet access.

    3

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    4/18

    4

    How access the internet using

    the cable networky Usethedial-up telephoneservices provided by your

    cablecompanyinconjunctionwith a modem orISDN

    adapter.y Touse a Cable Modem.

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    5/18

    Cable Modem

    y A Cable Modem that allowshigh-speed accesstotheinternetvia a cabletv network.

    y Thiscable modem attachesto computerthrough anEthernet NetworkInterface Card.

    y Ittakes a signal from thecomputer andconvertitfortransmissionoverthecablenetwork.

    5

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    6/18

    how does it connect

    January 19, 2012 Data Mining: Concepts and Techniques 6

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    7/18

    DATA BASE vs. DATA MININGy QUERY

    -Well defined

    - SQL, Xquery

    y QUERY

    -Poorlydefined

    -no precise querylanguage

    7

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    8/18

    Data Mining: A KDD Process

    y Data mining:thecoreofknowledgediscovery process.

    8

    Data Cleaning

    Data Integration

    Databases

    Data Warehouse

    Task-relevant Data

    Selection

    Data Mining

    Pattern Evaluation

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    9/18

    Steps of a KDD Process

    y Learningthe applicationdomainy relevant priorknowledge andgoalsofapplication

    y Creating a targetdata set:data selection

    y Data cleaning and preprocessing: (maytake 60% ofeffort)y Data reduction andtransformation:

    y Finduseful features,dimensionality/variablereduction,invariantrepresentation.

    y Choosingfunctionsofdata miningy

    summarization,classification,regression, association,clustering.y Choosingthe mining algorithm(s)

    y Data mining:searchfor patternsofinterest

    y Patternevaluation andknowledge presentationy visualization,transformation,removingredundant patterns,etc.

    y Useofdiscoveredknowledge9

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    10/18

    Data Mining and Business

    Intelligence

    10

    Increasing potential

    to support

    business decisions End User

    BusinessAnalyst

    Data

    Analyst

    DBA

    Making

    Decisions

    Data Presentation

    Visualization Techniques

    Data Mining

    Information Discovery

    Data Exploration

    OLAP, MDA

    Statistical Analysis, Querying and Reporting

    Data Warehouses / Data Marts

    Data SourcesPaper, Files, Information Providers, Database Systems, OLTP

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    11/18

    Data Mining: On What Kind of Data?

    y Relational databases

    y Datawarehouses

    y Transactional databasey Spatial andtemporal data

    multimedia databases

    11

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    12/18

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    13/18

    Are All the Discovered Patterns

    Interesting?

    y Adata miningsystem/query maygeneratethousandsofpatterns,not

    all ofthem areinteresting.

    y Interestingness measures: A patternisinteresting ifitiseasily

    understood byhumans,validonnewortestdatawithsomedegreeof

    certainty, potentiallyuseful,novel,orvalidatessomehypothesis that a

    userseekstoconfirm

    y Objective vs. subjective interestingness measures:

    y Objective: basedonstatistics andstructuresofpatterns,e.g.,support,

    confidence,etc.

    y Subjective: basedonusers beliefinthedata,e.g.,unexpectedness,novelty,

    actionability,etc.

    13

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    14/18

    Data Mining: Confluence of Multiple

    Disciplines

    14

    Data Mining

    DatabaseTechnology

    Statistics

    OtherDisciplines

    InformationScience

    MachineLearning

    Visualization

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    15/18

    Major Issues in Data Mining (1)

    y Mining methodology anduserinteraction

    y Miningdifferentkindsofknowledgeindatabases

    y

    Incorporationofbackgroundknowledgey Data mining query languages.

    y Expression andvisualizationofdata miningresults

    y Handlingnoise andincompletedata

    y Patternevaluation:theinterestingness problem

    y Performance andscalability

    y Efficiency andscalabilityofdata mining algorithms

    y Parallel,distributed andincremental mining methods

    15

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    16/18

    Major Issues in Data Mining (2)

    y Issuesrelatingtothediversityofdata typesy Handlingrelational andcomplex typesofdata

    y

    Mininginformationfrom heterogeneousdatabases andglobalinformationsystems (WWW)

    y Issuesrelatedto applications andsocial impactsy Applicationofdiscoveredknowledge

    y Domain-specificdata miningtools

    y Intelligent query answering

    y Processcontrol anddecision making

    y Integrationofthediscoveredknowledgewithexistingknowledge: Aknowledgefusion problem

    y Protectionofdata security,integrity, and privacy16

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    17/18

    Summary

    y Data mining:discoveringinteresting patternsfrom large amountsof

    data

    y Anatural evolutionofdatabasetechnology,ingreatdemand,withwide

    applications

    y A KDD processincludesdata cleaning,data integration,data selection,

    transformation,data mining, patternevaluation, andknowledge

    presentation

    y Miningcan be performedin avarietyofinformationrepositories

    y Classificationofdata miningsystems

    y Majorissuesindata mining

    17

  • 8/3/2019 42410 Data Mining Concepts and Techniques 3305

    18/18

    Thank you

    January 19, 2012 Data Mining: Concepts and Techniques 18


Recommended