+ All Categories
Home > Documents > SCDL - Data Mining

SCDL - Data Mining

Date post: 13-Nov-2014
Category:
Upload: api-3733148
View: 139 times
Download: 2 times
Share this document with a friend
Popular Tags:
174
SCDL – 4 th Semester – Data Mining LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The Blank Question Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer Heterogeneous and distributed Your Answer Heterogeneous and distributed Multiple Choice Single Answer Question Main advantage of following which method is it's fast processing? Correct Answer Grid based Your Answer Partioning based Select The Blank Question With the widespread option of ________ real-time connection is viable for data warehouse. Correct Answer TCP/IP Your Answer HTTP Select The Blank Question ________ are responsible for running queries and reports against data warehouse tables. Correct Answer End users Your Answer End users Multiple Choice Multiple Answer Question Advantages of Wavelet transformation for clustering are :- Correct Answer Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer Unsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy Multiple Choice Single Answer Question Query tool is meant for :- Correct Answer Data acquisition Page 1 of 174
Transcript
Page 1: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

LIST OF ATTEMPTED QUESTIONS AND ANSWERS  

 Select The Blank  Question   Semantic integration of ________ genome database is the important task of DNA

analysis.  Correct Answer  

Heterogeneous and distributed

  Your Answer   Heterogeneous and distributed

 

 Multiple Choice Single Answer  Question   Main advantage of following which method is it's fast processing?

  Correct Answer  

Grid based

  Your Answer   Partioning based

 

 Select The Blank  Question   With the widespread option of ________ real-time connection is viable for data

warehouse.  Correct Answer  

TCP/IP

  Your Answer   HTTP

  Select The Blank  Question   ________ are responsible for running queries and reports against data warehouse tables.

  Correct Answer  

End users

  Your Answer   End users

 

 Multiple Choice Multiple Answer  Question   Advantages of Wavelet transformation for clustering are :-

  Correct Answer  

Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast

  Your Answer   Unsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy

 

 Multiple Choice Single Answer  Question   Query tool is meant for :-

  Correct Answer  

Data acquisition

  Your Answer   Information delivery

 

 Multiple Choice Single Answer  Question   Which of the following function involves data cleaning, data standardizing and

summarizing?  Correct Transforming data

Page 1 of 141

Page 2: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Answer    Your Answer   Storing data

 

 Multiple Choice Multiple Answer  Question   Which of the following clustering analysis method uses multiresolution approach?

  Correct Answer  

STING , Wave Cluster

  Your Answer   STING , Wave Cluster

 

 Multiple Choice Single Answer  Question   Which type of following clustering computes augumented cluster ordering?

  Correct Answer  

OPTICS

  Your Answer   CLQUE

  Multiple Choice Multiple Answer  Question   Time variant nature of the data in data warehouse :-

  Correct Answer  

Allows for analysis of the past , Relate information to the present , Enables forecasts for the future

  Your Answer   Allows for analysis of the past , Relate information to the present , Enables forecasts for the future  

 True/False  Question   The Structure that brings all the components together is known as Architecture.

  Correct Answer  

True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   Data compression is to compress the given data by encoding in terms of :-

  Correct Answer  

Association rule , Decision tree , Cluster

  Your Answer   Bytes , Cluster

 

 Multiple Choice Multiple Answer  Question   The different definitions of metadata are :-

  Correct Answer  

Data about data , Catalog of data , Data warehouse roadmap

  Your Answer   Data about data , Catalog of data , Data warehouse roadmap

 

 True/False  Question   A distinct feature of DB Miner is its data cube based online analytical mining.

Page 2 of 141

Page 3: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

True

  Your Answer   False

  Multiple Choice Single Answer  Question   Association rules mining is based on :-

  Correct Answer  

Clustering and Employing rules for classification

  Your Answer   Clustering and Employing rules for classification

 

 True/False  Question   A distinguishing feature of Clementine is its object oriented extended module interface.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ includes Normalization and Aggregation as data preprocessing procedures.

  Correct Answer  

Data transformation

  Your Answer   Data transformation

 

 True/False  Question   To remove noise from data is called as Smoothing.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Data matrix is :-

  Correct Answer  

Object by variable structure

  Your Answer   Object by variable structure

  True/False  Question   Data updates are common place in an operational database.

  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by

Page 3 of 141

Page 4: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

rectangles  Correct Answer  

False

  Your Answer   True

  True/False  Question   From a Dataware house perspective data mining canbe viewed as an advanced stage of

Online Analytical Programming.  Correct Answer  

True

  Your Answer   True

 

 Match The FollowingQuestion Correct Answer Your Answer

Disparate data Production data Query and analysis

Non volatile data Query and analysis Archive data

Data granularity Level of detail Level of detail

Data from external source External data External data

 

 Multiple Choice Multiple Answer  Question   In physical design of data warehouse administration provides features like :-

  Correct Answer  

Avoiding reorganizing of tables , Support backup and recovery , Query processing

  Your Answer   Support backup and recovery , Manage store area , Query processing

 

 Select The Blank  Question   ________ is the user who has system access privileges but no database administration

privileges as well as not for table and views.  Correct Answer  

Network administrator

  Your Answer   End user

 

 Multiple Choice Multiple Answer  Question   Data mining Functionalities are :-

  Correct Answer  

Charactrization and Discrimination , Association Analysis , Cluster Analysis

  Your Answer   Association Analysis , Cluster Analysis , Time series Data Analysis

  Select The Blank  Question   ________ dimension of database in which primitive level data are spatial but

generalization becomes non spatial.  Correct Answer  

Spatial to non spatial

Page 4 of 141

Page 5: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Spatial to non spatial

 

 Multiple Choice Multiple Answer  Question   Source Data Component may be grouped into following categories :-

  Correct Answer  

Production Data , Internal External Data

  Your Answer   Internal External Data , Analyzed data , Non Analyzed data

 

 Select The Blank  Question   ________ technique is the statistical technique for analyzing data.

  Correct Answer  

Time series

  Your Answer   Time series

 

 Multiple Choice Multiple Answer  Question   The strategies for data reduction are :-

  Correct Answer  

Data aggregation , Dimension reduction , Numerocity reduction

  Your Answer   Data aggregation , Dimension reduction , Numerocity reduction

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer  

Decision Tree

  Your Answer   Root-Node

  Match The FollowingQuestion Correct Answer Your Answer

Data Mining Knowledge discovery Knowledge discovery

Metadata Roadmap for user Details of summary

Data storage Data management Data management

Data staging Workbench for data Workbench for data

 

 True/False  Question   Data cube stores multidimensional aggregate information.

  Correct Answer  

True

  Your Answer   True

 

Page 5 of 141

Page 6: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Select The Blank  Question   ________ is the method used to predict the value of response variable from one to more

variables.  Correct Answer  

Regression

  Your Answer   Regression

  Select The Blank  Question   ________ databases are one of the most poplularly available and rich information

repositories.  Correct Answer  

Relational

  Your Answer   Object oriented

 

 True/False  Question   COBWEB is a method of incremental conceptual clustering.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Many methods for data smoothing are also methods for data reduction involving :-

  Correct Answer  

Discretization

  Your Answer   Clustering

 

 Multiple Choice Single Answer  Question   Dimensionality reduction reduces the data set size by removing :-

  Correct Answer  

Irrelevant attributes

  Your Answer   Irrelevant attributes

 

 Multiple Choice Single Answer  Question   Effect of one attibute value on a given class is independent of values of other attibute is

called  Correct Answer  

Value independence

  Your Answer   Class Conditional independence

  Multiple Choice Single Answer  Question   Which from the following are special programs that are stored on database and fired when

certain predefined action occurs?  Correct Answer  

Triggers

  Your Answer   Triggers

Page 6 of 141

Page 7: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Select The Blank  Question   A web server usually registers ________ entry for every access of a web page

  Correct Answer  

Weblog

  Your Answer   Log

 

 Multiple Choice Single Answer  Question   Bayes Theorem is :-

  Correct Answer  

P(H|X)=P(X|H)(P)/P(X)

  Your Answer   P(H|X)=P(X|H)(P)/P(X)

 

 True/False  Question   Visual display can help user to give clear impression and overview of the data

characteristics in a database.  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Which of the following is based on set of density distribution function clustering?

  Correct Answer  

DBSCAN

  Your Answer   DBSCAN

 

 Multiple Choice Multiple Answer  Question   Metadata in a data warehouse falls into following categories :-

  Correct Answer  

Operational Metadata , Extraction and Transformation metadata , End-user Metadata

  Your Answer   Operational Metadata , Extraction and Transformation metadata , End-user Metadata

 

 Multiple Choice Multiple Answer  Question   Knowledge discovery process includes :-

  Correct Answer  

Data Cleaning , Data Intergration , Data Selectin

  Your Answer   Data Cleaning , Data Intergration , Data Selectin

 

 Select The Blank  Question   Human being have around ________ gene.

  Correct Answer  

100000

Page 7 of 141

Page 8: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   1000000

LIST OF ATTEMPTED QUESTIONS AND ANSWERS 

 True/False  Question   A distinguishing feature of Clementine is its object oriented extended module interface.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   Creating ________is violation of Normalization principles.

  Correct Answer  

Array

  Your Answer   Array

 

 True/False  Question   Data Mining refers to extracting knowledge from larger amount of data.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Which of the following of Grid based clustering method explorates statistical information?

  Correct Answer  

STING

  Your Answer   CLIQUE

 

 Multiple Choice Multiple Answer  Question   The different definitions of metadata are :-

  Correct Answer  

Data about data , Catalog of data , Data warehouse roadmap

  Your Answer   Catalog of data , Data warehouse roadmap , Brain of data

 

 Select The Blank  Question   In ________ type smoothing, minimum and maximum values in given bin are identified as

bin boundaries.  Correct Answer  

Smoothing by bin boundaries

  Your Answer   Smoothing by medians

 

Page 8 of 141

Page 9: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Single Answer  Question   Query tool is meant for :-

  Correct Answer  

Data acquisition

  Your Answer   Information delivery

 

 True/False  Question   Data cube stores multidimensional aggregate information.

  Correct Answer  

True

  Your Answer   False

  Select The Blank  Question   ________ can store aggregate and detail data at varying levels of resolution or

abstraction.  Correct Answer  

Index tree

  Your Answer   R-Tree

 

 Select The Blank  Question   ________ is the platform for complex data transformation for the purpose of cleanse it

  Correct Answer  

Separate optimal Platform

  Your Answer   Legacy platform

 

 Multiple Choice Multiple Answer  Question   SMP provides the features like :-

  Correct Answer  

Each node has access to common set of disks , Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus

  Your Answer   Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , It is cluster of nodes  

 Multiple Choice Single Answer  Question   In intermediate data extraction data capture through transaction log uses transaction

from :-  Correct Answer  

Recovery from failure

  Your Answer   Recovery from failure

 

 Multiple Choice Multiple Answer  Question   In data storage area , DBA uses metadata for processes of :-

  Correct Answer  

Backup , Recovery , Tuning Database

  Your Answer   Backup , Recovery , Management

Page 9 of 141

Page 10: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Multiple Choice Multiple Answer  Question   Foundation infrastructure of warehouse includes many elements such as :-

  Correct Answer  

Basic Computing platform , Hardware and operating system , DBMS and Query

  Your Answer   Basic Computing platform , DBMS and Query , Query processing components

  Match The FollowingQuestion Correct Answer Your Answer

Data producer Responsible for data quality Responsible for data quality

Domain values Prevalent problem Foreign key preserved

Update security Prevention of unauthorized updates

Prevalent problem

Referential integrity Foreign key preserved Prevention of unauthorized updates

  Select The Blank  Question   ________ is density based clustering method which computes on augumented clustering

ordering for automic ordering for automatic and interactive cluster analysis  Correct Answer  

DBSCAN

  Your Answer   DBSCAN

 

 Multiple Choice Multiple Answer  Question   Data compression is to compress the given data by encoding in terms of :-

  Correct Answer  

Association rule , Decision tree , Cluster

  Your Answer   Bytes , Association rule , Decision tree

  Multiple Choice Multiple Answer  Question   Knowledge discovery process includes :-

  Correct Answer  

Data Cleaning , Data Intergration , Data Selectin

  Your Answer   Data Cleaning , Data Intergration , Data movememnt

  Multiple Choice Multiple Answer  Question   Building blocks of Data Warehouse are :-

  Correct Answer  

Source Data , Data Staging , Management and Control

  Your Answer   Data Staging , Data Manager , Management and Control

 

 True/False  Question   All data extraction, transformation, integration and staging jobs run on selected hardware

under chosen operating system.

Page 10 of 141

Page 11: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Real world databases are highly susceptible to noisy, missing and inconsistent data due

to :-  Correct Answer  

Huge size of data

  Your Answer   Huge size of data

 

 Match The FollowingQuestion Correct Answer Your Answer

Clustering tool To group different cases To detect unusual attribute

Data visualization tool Transaction activity using graph To filter unrelated attributes

Linkage analysis tool To identify links To group different cases

Classification tool To filter unrelated attributes To identify links

  Multiple Choice Multiple Answer  Question   Generalized linear model includes :-

  Correct Answer  

Logistic regression , Poisson regression

  Your Answer   Poisson regression , Linear regression , Polynomial Regression

  True/False  Question   Metadata acts like a nerve center.

  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer  Question   OLAP is used for :-

  Correct Answer  

Online Analytical Processing

  Your Answer   Online Application Processing

  Select The Blank  Question   ________ includes Normalization and Aggregation as data preprocessing procedures.

  Correct Answer  

Data transformation

Page 11 of 141

Page 12: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Data integration

  Multiple Choice Multiple Answer  Question   The dimensions of spatial data cube are :-

  Correct Answer  

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

  Your Answer   Non- spatial dimension , Spatial to non spatial , Spatial to spatial

 

 Multiple Choice Single Answer  Question   Maintenance of cache consistency is the limitation of :-

  Correct Answer  

MPP

  Your Answer   NUMA

 

 Select The Blank  Question   In ________ duplicate sub trees exist within the tree.

  Correct Answer  

Repetition

  Your Answer   Replication

 

 Select The Blank  Question   Indexed ________ engines search index,web pages and build huge keyword based

indices which help to search sets of web pages containing certain keywords  Correct Answer  

Web Search

  Your Answer   Web Search

 

 Multiple Choice Single Answer  Question   Redundancies can be deleted by :-

  Correct Answer  

Co-relational analysis

  Your Answer   Coherent analysis

 

 True/False  Question   To detect money laundering and other financial crimes, it is important to integrate

information for multiple databases.  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   Common areas of application for mixed effect model includes :-

Page 12 of 141

Page 13: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Multiple data , Repeated measures data , Block designs

  Your Answer   Multiple data , Dimensional data , Block designs

 

 Select The Blank  Question   In data ________, data encoding or transformations are applied to obtain reduced or

compressed representation.  Correct Answer  

Compression

  Your Answer   Compression

  Multiple Choice Single Answer  Question   Grouped data can be analyzed with the technique :-

  Correct Answer  

Mixed effect model

  Your Answer   Factor analysis

 

 Select The Blank  Question   ________ is the navigational map of data warehouse.

  Correct Answer  

End user Metadata

  Your Answer   Extraction Metadata

 

 Multiple Choice Multiple Answer  Question   Business metadata is useful for :-

  Correct Answer  

Providing support to end users , For external view of data , Provides technical support to search data

  Your Answer   Providing support to end users , For external view of data , Provides technical support to search data  

 True/False  Question   The elements of warehouse infrastructure are classified into operational and physical

infrastructure.  Correct Answer  

True

  Your Answer   True

  Multiple Choice Single Answer  Question   Data reduction by volume can be used for data representation using which type of

reduction?  Correct Answer  

Numerosity reduction

  Your Answer   Histograms

 

 True/False  Question   Descriptive mining takes perform ingerence on current data which predictive mining

Page 13 of 141

Page 14: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

characterize the general properties of data in database  Correct Answer  

False

  Your Answer   False

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer  

Decision Tree

  Your Answer   Decision Tree

  Multiple Choice Single Answer  Question   Queries run faster to find exact match using which type of indexing?

  Correct Answer  

Clustered index

  Your Answer   Sequential index

 

 Multiple Choice Single Answer  Question   Data can be smoothed by filling the data to function such as :-

  Correct Answer  

Regression

  Your Answer   Clustering

  True/False  Question   Data classification is two step process in which first step includes classfication of model

and in second step model describes set of data.  Correct Answer  

False

  Your Answer   True

 

 Select The Blank  Question   In data warehouse architecture, the ________ component interleaves with and connects

other components.  Correct Answer  

Metadata

  Your Answer   Metadata

  True/False  Question   Legacy data resides on Hierarchical or Network database.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer

Page 14 of 141

Page 15: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Metadata in a data warehouse falls into following categories :-

  Correct Answer  

Operational Metadata , Extraction and Transformation metadata , End-user Metadata

  Your Answer   Operational Metadata , Extraction and Transformation metadata , End-user Metadata

LIST OF ATTEMPTED QUESTIONS AND ANSWERS 

 Multiple Choice Multiple Answer  Question   Metadata is essential for IT for :-

  Correct Answer  

Source data structures , Data summarization

  Your Answer   Web enabling , Source data structures , Data summarization

 

 Multiple Choice Multiple Answer  Question   Financial data called for banking and financial industry are often relatively :-

  Correct Answer  

Complete , Reliable , High Quality

  Your Answer   Complete , Reliable , Correct

 

 Select The Blank  Question   ________ option of warehouse architecture provides incremental growth.

  Correct Answer  

Cluster

  Your Answer   Cluster

 

 Match The FollowingQuestion Correct Answer Your Answer

Operating systems compatibility Security, reliability, availability Security, reliability, availability

Data Acquisition Data Extraction, Transformation, clensing, integration

Data Extraction, Transformation, clensing, integration

Data Storage Data loading , Archiving Data loading , Archiving

Information Delivery Report generation, query processing and complex analysis

Report generation, query processing and complex analysis

 

 True/False  Question   A cluster is a collection of similar data objects in same cluster and disimilar to objects in

another cluster.  Correct Answer  

True

  Your Answer   True

 

Page 15 of 141

Page 16: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Single Answer  Question   Which of the following method creates copies of data in distributed environment?

  Correct Answer  

Replication

  Your Answer   Replication

 

 True/False  Question   Data cube stores multidimensional aggregate information.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Capture at data source and that's why this method is quite reliable :-

  Correct Answer  

Capture by database Triggers

  Your Answer   Capture in source application

 

 True/False  Question   The Structure that brings all the components together is known as Architecture.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   For Banking and financial data which type of analysis is used?

  Correct Answer  

Multidimensional

  Your Answer   Relational

 

 Multiple Choice Single Answer  Question   Which of the following methods for regression is used on sparse data :-

  Correct Answer  

Regression and log-linear model

  Your Answer   Regression and transformation

 

 Multiple Choice Multiple Answer  Question   Following data transformation methods are used in analysis of time series data :-

  Correct Answer  

Scaling , Normalization , Windows Stiching

  Your Answer   Scaling , Normalization , Windows Stiching

Page 16 of 141

Page 17: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Select The Blank  Question   ________ function of data staging component involves many forms of combining pieces

of data from different sources.  Correct Answer  

Data Transformation

  Your Answer   Data Loading

 

 Multiple Choice Single Answer  Question   Real world databases are highly susceptible to noisy, missing and inconsistent data due

to :-  Correct Answer  

Huge size of data

  Your Answer   Relational data

 

 Select The Blank  Question   Creating ________is violation of Normalization principles.

  Correct Answer  

Array

  Your Answer   Structure

 

 Multiple Choice Multiple Answer  Question   The tools of metadata falls in following categories :-

  Correct Answer  

Development tools for IT professional , Information access tool for End user

  Your Answer   Access tool , Development tools for IT professional , Information access tool for End user

 

 True/False  Question   Architecture comes first, tools follows it.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is an alternative aggolomerative hierarchical clustering algorithm.

  Correct Answer  

ROCK

  Your Answer   ROKE

 

 Multiple Choice Single Answer  Question   Which of the following function involves data cleaning, data standardizing and

summarizing?  Correct Answer  

Transforming data

Page 17 of 141

Page 18: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Transforming data

 

 True/False  Question   In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by

rectangles  Correct Answer  

False

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   In data storage area , DBA uses metadata for processes of :-

  Correct Answer  

Backup , Recovery , Tuning Database

  Your Answer   Backup , Recovery , Tuning Database

 

 Multiple Choice Single Answer  Question   Bayes Theorem is :-

  Correct Answer  

P(H|X)=P(X|H)(P)/P(X)

  Your Answer   P(X|H)=P(X|H)(PH)/P(X)

 

 True/False  Question   Data cleansing means removing noisy and inconsistent data.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ are responsible for running queries and reports against data warehouse tables.

  Correct Answer  

End users

  Your Answer   End users

 

 Select The Blank  Question   A web server usually registers ________ entry for every access of a web page

  Correct Answer  

Weblog

  Your Answer   Web site

 

 Multiple Choice Multiple Answer  Question   Data processing techniques are :-

Page 18 of 141

Page 19: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Cleansing , Integration , Transformation

  Your Answer   Cleansing , Transformation , Collection

 

 Multiple Choice Single Answer  Question   Data can be smoothed by filling the data to function such as :-

  Correct Answer  

Regression

  Your Answer   Clustering

 

 Multiple Choice Single Answer  Question   Deviation based outlier detection identifes outliers by :-

  Correct Answer  

Examining character of objects in groups

  Your Answer   Examining objects in group

 

 Multiple Choice Single Answer  Question   Data partitioning, data clustering are the techniques for :-

  Correct Answer  

Performance enhancement

  Your Answer   Performance enhancement

 

 Multiple Choice Multiple Answer  Question   Following are the issues to consider during data integration :-

  Correct Answer  

Schema integration , Redundancy , Detection and resolution of data values

  Your Answer   Schema integration , Redundancy , Inconsistency

 

 True/False  Question   Management architectural component manages and controls data acquisition functions.

  Correct Answer  

True

  Your Answer   True

 

 Match The FollowingQuestion Correct Answer Your Answer

Data loading tool Primary key generation Primary key generation

Data modeling tool Reverse Engineering capabilities Reverse Engineering capabilities

Data Extraction tool Bulk extraction for full refresh Bulk extraction for full refresh

Page 19 of 141

Page 20: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Data transformation tool Default values Replication

 

 Multiple Choice Multiple Answer  Question   DNA sequences are comprised of :-

  Correct Answer  

Adenine , Gaunine , Thymine

  Your Answer   Adenine , Cytocine , Gaunine

 

 Multiple Choice Single Answer  Question   Large number of indexes affects the loading process because :-

  Correct Answer  

Indexes are created for new records

  Your Answer   Indexes are created for old records

 

 Select The Blank  Question   The technique of ________ enables concurrent input/output operations and improves

file's access performance substantially.  Correct Answer  

File striping

  Your Answer   Data migration

 

 Multiple Choice Multiple Answer  Question   Warehouse Operational infrastructure is to support each architecture component consists

of :-  Correct Answer  

People , Procedures , Management software

  Your Answer   People , Procedures , Management software

 

 True/False  Question   In Purning method, postpruning requires more computation than prepruning yet generally

leads to more reliable.  Correct Answer  

True

  Your Answer   False

 

 Select The Blank  Question   ________ technique can be used to reduce the number of values for a given continuous

attribute by dividing range of attributes into interval.  Correct Answer  

Descretization

  Your Answer   Compression

 

 True/False  Question   Data cubes created for varying levels of abstraction are referred as cuboids.

Page 20 of 141

Page 21: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Which of the following approach requires more computation?

  Correct Answer  

Filter approach

  Your Answer   Filter approach

 

 Select The Blank  Question   ________components consists all the different ways of making the information from the

data warehouse available to the user.  Correct Answer  

Information Delivery

  Your Answer   Metadata

 

 Multiple Choice Multiple Answer  Question   Data transformation includes :-

  Correct Answer  

Smoothing , Aggregation , Generalization

  Your Answer   Smoothing , Aggregation

 

 Select The Blank  Question   ________ databases are one of the most poplularly available and rich information

repositories.  Correct Answer  

Relational

  Your Answer   Relational

 

 True/False  Question   In Linear regression data are modeled to fit a straight line.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is the method used to predict the value of response variable from one to more

variables.  Correct Answer  

Regression

  Your Answer   Analysis of variance

 

 Multiple Choice Multiple Answer

Page 21 of 141

Page 22: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Methods for outlier detection are categorised into following approaches :-

  Correct Answer  

Statistical , Distance based , Deviation based

  Your Answer   Statistical , Distance based , Deviation based

 

 Multiple Choice Multiple Answer  Question   Data base miner provides multiple data mining algorithms including :-

  Correct Answer  

Discovery driven OLAP analysis , Association , Classification

  Your Answer   Discovery driven OLAP analysis , Association , Regression

LIST OF ATTEMPTED QUESTIONS AND ANSWERS  

 Multiple Choice Single Answer  Question   Deviation based outlier detection identifes outliers by :-

  Correct Answer   Examining character of objects in groups

  Your Answer   Examining character of objects in groups

 

 Select The Blank  Question   ________ component of warehouse is responsible for coordinating services and

activities within the data warehouse.  Correct Answer   Management and Control

  Your Answer   Management and Control

 

 True/False  Question   Sequential pattern analysis and similarity search techniques have been developed in

data mining.  Correct Answer   True

  Your Answer   True

 

 True/False  Question   A distinct feature of DB Miner is its data cube based online analytical mining.

  Correct Answer   True

  Your Answer   True

 

 Select The Blank  Question   ________ is the user who has system access privileges but no database administration

privileges as well as not for table and views.

Page 22 of 141

Page 23: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer   Network administrator

  Your Answer   Network administrator

  Select The Blank  Question   For operational system, the stored data contains ________values.

  Correct Answer   Current data

  Your Answer   Current data

 

 True/False  Question   Intelligent miner is an IBM data mining product.

  Correct Answer   True

  Your Answer   True

  Select The Blank  Question   The technique of ________ enables concurrent input/output operations and improves

file's access performance substantially.  Correct Answer   File striping

  Your Answer   File striping

 

 Multiple Choice Multiple Answer  Question   SMP provides the features like :-

  Correct Answer   Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks

  Your Answer   Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus  

 Match The FollowingQuestion Correct Answer Your Answer

Incremental data capture Differed data capture Differed data capture

Initial load of data warehouse "as-is" data capture "as-is" data capture

Static data Capture of data in given point of time

Capture of data in given point of time

Data revision Incremental data capture Incremental data capture

  True/False  Question   In Purning method, postpruning requires more computation than prepruning yet

generally leads to more reliable.  Correct Answer   True

Page 23 of 141

Page 24: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   False

  True/False  Question   Data preprocessing is an important step in knowledge discovery process.

  Correct Answer   True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   The dimensions of spatial data cube are :-

  Correct Answer   Non- spatial dimension , Spatial to non spatial , Spatial to spatial

  Your Answer   Non- spatial dimension , Spatial to non spatial , Spatial to spatial

 

 True/False  Question   Data mining often requires data integration.

  Correct Answer   True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   In data storage area , DBA uses metadata for processes of :-

  Correct Answer   Backup , Recovery , Tuning Database

  Your Answer   Backup , Recovery , Tuning Database

 

 Multiple Choice Single Answer  Question   Effect of one attibute value on a given class is independent of values of other attibute is

called  Correct Answer   Value independence

  Your Answer   Attirbute conditional independence

 

 Select The Blank  Question   ________components consists all the different ways of making the information from the

data warehouse available to the user.  Correct Answer   Information Delivery

  Your Answer   Information Delivery

 

 Multiple Choice Multiple Answer  Question   Data processing techniques are :-

Page 24 of 141

Page 25: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer   Cleansing , Integration , Transformation

  Your Answer   Integration , Transformation , Cleansing

 

 Multiple Choice Single Answer  Question   Data matrix is :-

  Correct Answer   Object by variable structure

  Your Answer   Two mode matrix

  Match The FollowingQuestion Correct Answer Your Answer

Information Delivery Report generation, query processing and complex analysis

Report generation, query processing and complex analysis

Operating systems compatibility Security, reliability, availability Security, reliability, availability

Data Acquisition Data Extraction, Transformation, clensing, integration

Data Extraction, Transformation, clensing, integration

Data Storage Data loading , Archiving Data loading , Archiving

 

 Select The Blank  Question   In ________ type smoothing, minimum and maximum values in given bin are identified

as bin boundaries.  Correct Answer   Smoothing by bin boundaries

  Your Answer   Smoothing by bin boundaries

 

 Multiple Choice Single Answer  Question   Data partitioning, data clustering are the techniques for :-

  Correct Answer   Performance enhancement

  Your Answer   Data extraction

 

 Select The Blank  Question   ________ technique is the statistical technique for analyzing data.

  Correct Answer   Time series

  Your Answer   Survival analysis

 

 Multiple Choice Single Answer  Question   Association rules mining is based on :-

  Correct Answer   Clustering and Employing rules for classification

Page 25 of 141

Page 26: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Clustering and Employing rules for classification

  Select The Blank  Question   Most of the warehouses employ ________ database Management System.

  Correct Answer   Relational

  Your Answer   Relational

  True/False  Question   NUMA provides better scalability than SMP.

  Correct Answer   True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer   Decision Tree

  Your Answer   Decision Tree

 

 Multiple Choice Single Answer  Question   Data migration affects performance requiring multiple blocks to be read which can be

adjusted by :-  Correct Answer   Block percent free

  Your Answer   Block percent free

  Multiple Choice Multiple Answer  Question   Source Data Component may be grouped into following categories :-

  Correct Answer   Production Data , Internal External Data

  Your Answer   Production Data , Analyzed data , Non Analyzed data

 

 Multiple Choice Single Answer  Question   Redundancies can be deleted by :-

  Correct Answer   Co-relational analysis

  Your Answer   Co-relational analysis

  True/False  Question   A distinguishing feature of Clementine is its object oriented extended module interface.

  Correct Answer   True

Page 26 of 141

Page 27: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   True

  Multiple Choice Multiple Answer  Question   The functions of data acquisition are :-

  Correct Answer   Data Transformation , Data Extraction

  Your Answer   Data Extraction , Data Transformation , Data cleansing

  Multiple Choice Single Answer  Question   SMP stands for :-

  Correct Answer   Symmetric Multiprocessing

  Your Answer   Symmetric Multiprocessing

 

 Multiple Choice Multiple Answer  Question   Mining values can be removed by :-

  Correct Answer   Filling values manually , Use of global constant , Use of attribute mean

  Your Answer   Filling values manually , Use of attribute mean

  Multiple Choice Single Answer  Question   Which from the following is used for classification and prediction?

  Correct Answer   Regression trees

  Your Answer   Regression

  Multiple Choice Multiple Answer  Question   Before moving data to data warehouse is has to go through :-

  Correct Answer   Transformation , Integration , Consolidation

  Your Answer   Transformation , Integration , Consolidation

  Select The Blank  Question   ________ is the navigational map of data warehouse.

  Correct Answer   End user Metadata

  Your Answer   Operational Metadata

  True/False  Question   Architecture comes first, tools follows it.

  Correct Answer   True

Page 27 of 141

Page 28: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Which technique analyze experimental data?

  Correct Answer   Analysis of variance

  Your Answer   Regression

  Multiple Choice Multiple Answer  Question   The need for metadata is for :-

  Correct Answer   Using data warehouse , Building data warehouse , Administration of warehouse

  Your Answer   Building data warehouse , Administration of warehouse

 

 Multiple Choice Single Answer  Question   Development and deployment of your data warehouse is joint effort between :-

  Correct Answer   IT staff and user representatives

  Your Answer   IT staff and user representatives

  Select The Blank  Question   ________ function of data staging component involves many forms of combining pieces

of data from different sources.  Correct Answer   Data Transformation

  Your Answer   Data Transformation

  Multiple Choice Single Answer  Question   Bayes Theorem is :-

  Correct Answer   P(H|X)=P(X|H)(P)/P(X)

  Your Answer   P(X|H)=P(X|H)(PH)/P(X)

 

 Multiple Choice Multiple Answer  Question   When you use tool for design and development, following things take place with

metadata :-  Correct Answer   Metadata is no longer passive document , Metadata takes part in process , Metadata

aids in automation of data warehouse process   Your Answer   Metadata is no longer passive document , Metadata takes part in process , Metadata

aids in automation of data warehouse process  

 Multiple Choice Multiple Answer  Question   The main categories of Metadata in warehouse are :-

  Correct Answer   Operational , Extraction and transformation Metadata , End user Metadata

Page 28 of 141

Page 29: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Operational , Extraction and transformation Metadata , End user Metadata

 

 Select The Blank  Question   ________ is the type of pilot for early delivery with broader scope and may be

integrated.  Correct Answer   Broad business pilot

  Your Answer   Proof of concept pilot

 

 True/False  Question   A process of grouping a set of physical or abstract objects into classes of similar objects

is called clusiering  Correct Answer   True

  Your Answer   True

LIST OF ATTEMPTED QUESTIONS AND ANSWERS 

 Multiple Choice Single Answer  Question   Which type of Grid clustering depends on the granularity of lowest level of grid structure?

  Correct Answer  

STING

  Your Answer   OPTICS

 

 Multiple Choice Single Answer  Question   Which of the following option of data extraction is known as application assisted data

capture?  Correct Answer  

Capture in source application

  Your Answer   Capture by comparing files

 

 True/False  Question   Moving data into staging area and performing data transformation function is a part of

data acquisition.  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   The objective for physical design of data warehouse are :-

  Correct Answer  

Improve performance , Ensure scalability , Manage store

  Your Answer   Improve performance , Ensure scalability , Manage database

 

Page 29 of 141

Page 30: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Multiple Answer  Question   User must have proper access to metadata for performing responsibilities of :-

  Correct Answer  

Design , Administration

  Your Answer   Design , Administration , Management

 

 Multiple Choice Multiple Answer  Question   In Intelligent miner the data mining product provides data mining algorithm including

  Correct Answer  

Association , Classification , Regression

  Your Answer   Association , Regression , Aggregation

  Multiple Choice Single Answer  Question   The big difference between data warehouse and any operational system is its :-

  Correct Answer  

Usage

  Your Answer   Organization

  True/False  Question   Loan payment prediction and customer credit analysis are critical to business of bank.

  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer  Question   Which of the option is not considered as the major function needed to get data ready?

  Correct Answer  

Storing data

  Your Answer   Extracting data

 

 True/False  Question   In the data acquisition area, the data flow begins at the data sources and pauses at

staging area.  Correct Answer  

True

  Your Answer   True

  Select The Blank  Question   Most of the warehouses employ ________ database Management System.

  Correct Answer  

Relational

  Your Answer   Relational

Page 30 of 141

Page 31: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Multiple Choice Single Answer  Question   Which of the following is based on set of density distribution function clustering?

  Correct Answer  

DBSCAN

  Your Answer   DBSCAN

  True/False  Question   NUMA provides better scalability than SMP.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   Human being have around ________ gene.

  Correct Answer  

100000

  Your Answer   100000

 

 True/False  Question   COBWEB is a method of incremental conceptual clustering.

  Correct Answer  

True

  Your Answer   True

 

 Match The FollowingQuestion Correct Answer Your Answer

Interactive visual data mining Visualization tool Audio signal

Data visualization Visual display Graphical display

Data mining result visualization Presentation of knowledge Visualization tool

Data mining process visualization Data mining in visual format Data mining in visual format

 

 Multiple Choice Single Answer  Question   Deliberate splitting of a table and its index data into manageable part is known as :-

  Correct Answer  

Partitioning

  Your Answer   Decomposing

  Multiple Choice Multiple Answer

Page 31 of 141

Page 32: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data mining is applicable to :-

  Correct Answer  

Relational Database , Data Warehouse , Transaction Database

  Your Answer   Relational Database , Data Warehouse , Transaction Database

  True/False  Question   Data mining is not that much powerful tool for vast data such as gene sequences in DNA

analysis.  Correct Answer  

True

  Your Answer   False

 

 True/False  Question   Data cleansing means removing noisy and inconsistent data.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Which from the following is used for classification and prediction?

  Correct Answer  

Regression trees

  Your Answer   Generalized linear model

  Multiple Choice Multiple Answer  Question   Data cleansing routines work to clean the data by :-

  Correct Answer  

Filling missing values , Smoothing noisy data

  Your Answer   Filling missing values , Smoothing noisy data , Resolving inconsistency

 

 Select The Blank  Question   ________ is the method used to predict the value of response variable from one to more

variables.  Correct Answer  

Regression

  Your Answer   Factor analysis

  Select The Blank  Question   ________ is the type of pilot for early delivery with broader scope and may be integrated.

  Correct Answer  

Broad business pilot

  Your Answer   Proof of concept pilot

 

Page 32 of 141

Page 33: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Multiple Answer  Question   In physical design of data warehouse administration provides features like :-

  Correct Answer  

Avoiding reorganizing of tables , Support backup and recovery , Query processing

  Your Answer   Support backup and recovery , Manage store area , Query processing

  Select The Blank  Question   ________ dimension of database in which primitive level data are spatial but

generalization becomes non spatial.  Correct Answer  

Spatial to non spatial

  Your Answer   Spatial

 

 Multiple Choice Single Answer  Question   The data warehouse DBMS executes on :-

  Correct Answer  

Data server component

  Your Answer   Data server component

  True/False  Question   A process of grouping a set of physical or abstract objects into classes of similar objects is

called clusiering  Correct Answer  

True

  Your Answer   False

  Select The Blank  Question   ________ component of warehouse is responsible for coordinating services and activities

within the data warehouse.  Correct Answer  

Management and Control

  Your Answer   Management and Control

 

 Multiple Choice Single Answer  Question   Large number of indexes affects the loading process because :-

  Correct Answer  

Indexes are created for new records

  Your Answer   Records are reshuffled

 

 Match The FollowingQuestion Correct Answer Your Answer

Chasm Challenges Method to solve problem

Early majority Nature technology Technology to die out

Page 33 of 141

Page 34: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Innovators Method to solve problem Challenges

Early adaptors Increased interest Increased interest

  Select The Blank  Question   ________ is an alternative aggolomerative hierarchical clustering algorithm.

  Correct Answer  

ROCK

  Your Answer   ROKE

  Multiple Choice Single Answer  Question   Which technique is used to predict categorical response variable?

  Correct Answer  

Discriminant analysis

  Your Answer   Factor analysis

 

 Multiple Choice Single Answer  Question   Deviation based outlier detection identifes outliers by :-

  Correct Answer  

Examining character of objects in groups

  Your Answer   Examining character of objects in groups

  Multiple Choice Multiple Answer  Question   The information delivery methods from data warehouse are :-

  Correct Answer  

Complex queries , MD Analysis , Statistical Analysis

  Your Answer   Complex queries , MD Analysis , ETS System

  True/False  Question   To remove noise from data is called as Smoothing.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ does not handle categorical attributes.

  Correct Answer  

CURE

  Your Answer   Chameleon

 

 Multiple Choice Multiple Answer

Page 34 of 141

Page 35: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data warehouse environment is functionally divided into following areas :-

  Correct Answer  

Data acquisition , Data storage , Information delivery

  Your Answer   Data storage , Information delivery , Data transformation

 

 True/False  Question   Data mining often requires data integration.

  Correct Answer  

True

  Your Answer   True

  Select The Blank  Question   ________ method of regression is useful when errors fails to satisfy normal conditions.

  Correct Answer  

Robust

  Your Answer   Polynomial

 

 Select The Blank  Question   With the widespread option of ________ real-time connection is viable for data

warehouse.  Correct Answer  

TCP/IP

  Your Answer   TCP/IP

 

 Multiple Choice Multiple Answer  Question   The areas of classification for metadata are :-

  Correct Answer  

Development/usage , Technical/business , BackRoom/Front Room

  Your Answer   Development/usage , Technical/business , Administration

  Multiple Choice Multiple Answer  Question   Data base miner provides multiple data mining algorithms including :-

  Correct Answer  

Discovery driven OLAP analysis , Association , Classification

  Your Answer   Association , Classification , Regression

  Select The Blank  Question   The ________ record is one-to-many relationship with corresponding fact table record.

  Correct Answer  

Dimension tables

  Your Answer   Fact table

 

Page 35 of 141

Page 36: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Single Answer  Question   For Incremental data loads the sequence is :-

  Correct Answer  

Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing

  Your Answer   Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing

 

 Multiple Choice Multiple Answer  Question   The platform of Data warehouse consists of :-

  Correct Answer  

Basic hardware components , Operating System , Network and Network software

  Your Answer   Basic hardware components , Network and Network software , Utility software

 

 Multiple Choice Multiple Answer  Question   The smoothing techniques are :-

  Correct Answer  

Binning , Clustering , Regression

  Your Answer   Clustering , Regression , Insertion

LIST OF ATTEMPTED QUESTIONS AND ANSWERS 

 Select The Blank  Question   ________ method of regression is useful when errors fails to satisfy normal conditions.

  Correct Answer  

Robust

  Your Answer   Robust

  True/False  Question   Data classification is two step process in which first step includes classfication of model

and in second step model describes set of data.  Correct Answer  

False

  Your Answer   True

 

 True/False  Question   Data cleansing means removing noisy and inconsistent data.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   Following factors play important role in financial analysis :-

Page 36 of 141

Page 37: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Data warehouse , Data cubes , Outliner analysis

  Your Answer   Data warehouse , Data cubes , Data accuracy

 

 Multiple Choice Single Answer  Question   Data matrix is :-

  Correct Answer  

Object by variable structure

  Your Answer   Object by object structure

 

 Multiple Choice Multiple Answer  Question   The dimensions of spatial data cube are :-

  Correct Answer  

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

  Your Answer   Non- spatial dimension , Spatial to non spatial , Spatial to spatial

 

 Multiple Choice Single Answer  Question   Query tool is meant for :-

  Correct Answer  

Data acquisition

  Your Answer   Data acquisition

 

 Multiple Choice Single Answer  Question   OLAP is used for :-

  Correct Answer  

Online Analytical Processing

  Your Answer   Online Analytical Processing

 

 True/False  Question   Metadata acts like a nerve center.

  Correct Answer  

True

  Your Answer   True

 

 Match The FollowingQuestion Correct Answer Your Answer

Constructive merge New record supercedes Populating data warehouse table first time

Initial Load Populating data warehouse table first time

Populating data warehouse table first time

Incremental Load Applying ongoing changes Applying ongoing changes

Page 37 of 141

Page 38: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Load Image To correspond to target files Applying data

 

 Multiple Choice Single Answer  Question   Disparity is the significant & disturbing characteristic of which type of data?

  Correct Answer  

Production data

  Your Answer   Production data

  Multiple Choice Single Answer  Question   Effect of one attibute value on a given class is independent of values of other attibute is

called  Correct Answer  

Value independence

  Your Answer   Class Conditional independence

 

 True/False  Question   Audio data mining can be an interesting alternative to visual mining.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ platform is the platform on which the data warehouse DBMS runs and

database exist.  Correct Answer  

Data storage

  Your Answer   Data storage

 

 True/False  Question   Smoothing by bin means each value in bin is replaced by the mean value of the bucket.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Following clustering method is classified as being agglomerative or divisive :-

  Correct Answer  

Grid based

  Your Answer   Hierarchical Method

  Multiple Choice Multiple Answer  Question   Data processing is done for :-

  Correct Improving the efficiency , Ease of mining

Page 38 of 141

Page 39: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Answer    Your Answer   Improving the efficiency , Ease of mining , Removing redundancy

  Multiple Choice Single Answer  Question   For Banking and financial data which type of analysis is used?

  Correct Answer  

Multidimensional

  Your Answer   Relational

 

 Select The Blank  Question   Semantic integration of ________ genome database is the important task of DNA

analysis.  Correct Answer  

Heterogeneous and distributed

  Your Answer   Homogenous and distributed

 

 Multiple Choice Multiple Answer  Question   Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of

classification & prediction are :-  Correct Answer  

Data Cleaning , Relevance Analysis , Data Transformation

  Your Answer   Data Cleaning , Relevance Analysis , Data Transformation

  Multiple Choice Multiple Answer  Question   The functions of data acquisition are :-

  Correct Answer  

Data Extraction , Data Transformation

  Your Answer   Data Extraction , Data Transformation , Data cleansing

 

 Multiple Choice Single Answer  Question   Data partitioning, data clustering are the techniques for :-

  Correct Answer  

Performance enhancement

  Your Answer   Performance enhancement

 

 Multiple Choice Single Answer  Question   Main advantage of following which method is it's fast processing?

  Correct Answer  

Grid based

  Your Answer   Partioning based

  Multiple Choice Multiple Answer  Question   The Main areas of Data Warehouse are :-

Page 39 of 141

Page 40: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Data acquisition , Data Storage , Information Delivery

  Your Answer   Data acquisition , Data Storage , Information Delivery

 

 True/False  Question   From a Dataware house perspective data mining canbe viewed as an advanced stage of

Online Analytical Programming.  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is an alternative aggolomerative hierarchical clustering algorithm.

  Correct Answer  

ROCK

  Your Answer   ROCK

 

 Select The Blank  Question   ________ is the platform for complex data transformation for the purpose of cleanse it

  Correct Answer  

Separate optimal Platform

  Your Answer   Separate optimal Platform

 

 Multiple Choice Multiple Answer  Question   Metadata recorded in information delivery functional area is related to :-

  Correct Answer  

Predefined queries , Input parameter definition , Reports

  Your Answer   Predefined queries , Reports

 

 True/False  Question   Data cubes created for varying levels of abstraction are referred as cuboids.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Association rules mining is based on :-

  Correct Answer  

Clustering and Employing rules for classification

  Your Answer   Clustering and Employing rules for classification

 

 Select The Blank

Page 40 of 141

Page 41: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   ________ includes Normalization and Aggregation as data preprocessing procedures.

  Correct Answer  

Data transformation

  Your Answer   Data transformation

 

 True/False  Question   Moving data into staging area and performing data transformation function is a part of

data acquisition.  Correct Answer  

True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   Methods for outlier detection are categorised into following approaches :-

  Correct Answer  

Statistical , Distance based , Deviation based

  Your Answer   Statistical , Distance based , Deviation based

 

 Multiple Choice Single Answer  Question   The first step of attibute oriented induction is :-

  Correct Answer  

Data focusing

  Your Answer   Data Collection

 

 True/False  Question   Legacy data resides on Hierarchical or Network database.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ option of warehouse architecture provides incremental growth.

  Correct Answer  

Cluster

  Your Answer   Cluster

 

 Multiple Choice Single Answer  Question   Data can be smoothed by filling the data to function such as :-

  Correct Answer  

Regression

  Your Answer   Regression

Page 41 of 141

Page 42: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Multiple Choice Single Answer  Question   Many methods for data smoothing are also methods for data reduction involving :-

  Correct Answer  

Discretization

  Your Answer   Regression

 

 Multiple Choice Multiple Answer  Question   Data mining is applicable to :-

  Correct Answer  

Relational Database , Data Warehouse , Transaction Database

  Your Answer   Relational Database , Data Warehouse , Transaction Database

 

 Multiple Choice Multiple Answer  Question   The different definitions of metadata are :-

  Correct Answer  

Data about data , Catalog of data , Data warehouse roadmap

  Your Answer   Data about data , Catalog of data , Data warehouse roadmap

 

 Multiple Choice Single Answer  Question   The data warehouse DBMS executes on :-

  Correct Answer  

Data server component

  Your Answer   Data server component

  Multiple Choice Multiple Answer  Question   Source Data Component may be grouped into following categories :-

  Correct Answer  

Production Data , Internal External Data

  Your Answer   Production Data , Internal External Data , Analyzed data

 

 Select The Blank  Question   ________ databases are one of the most poplularly available and rich information

repositories.  Correct Answer  

Relational

  Your Answer   Relational

 

 Match The FollowingQuestion Correct Answer Your Answer

Metadata Roadmap for user Details of summary

Page 42 of 141

Page 43: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Data storage Data management Data management

Data staging Workbench for data Workbench for data

Data Mining Knowledge discovery Knowledge discovery

  True/False  Question   Data Mining refers to extracting knowledge from larger amount of data.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   Most of the warehouses employ ________ database Management System.

  Correct Answer  

Relational

  Your Answer   Relational

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer  

Decision Tree

  Your Answer   Decision Tree

LIST OF ATTEMPTED QUESTIONS AND ANSWERS  

 Multiple Choice Single Answer  Question   The technique of data clustering facilitates :-

  Correct Answer  

Serial access

  Your Answer   Indexed access

  Select The Blank  Question   In ________ type smoothing, minimum and maximum values in given bin are identified as

bin boundaries.  Correct Answer  

Smoothing by bin boundaries

  Your Answer   Smoothing by bin boundaries

 

 Multiple Choice Multiple Answer  Question   The ways of Intra query parallelization are :-

  Correct Answer  

Horizontal parallelization , Vertical Parallelization , Hybrid parallelization

Page 43 of 141

Page 44: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Vertical Parallelization , Homogenous parallelization

  Select The Blank  Question   ________ technique is the statistical technique for analyzing data.

  Correct Answer  

Time series

  Your Answer   Time series

  True/False  Question   One of the most important search problem in genetic analysis is similarity search and

comparison among DNA sequence.  Correct Answer  

True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   User must have proper access to metadata for performing responsibilities of :-

  Correct Answer  

Design , Administration

  Your Answer   Administration , Management , Accessing

  Multiple Choice Single Answer  Question   Association rules mining is based on :-

  Correct Answer  

Clustering and Employing rules for classification

  Your Answer   Clustering and Employing rules for classification

 

 Select The Blank  Question   ________ is the platform for complex data transformation for the purpose of cleanse it

  Correct Answer  

Separate optimal Platform

  Your Answer   Legacy platform

 

 Multiple Choice Multiple Answer  Question   Classification and Prediction have following applications :-

  Correct Answer  

Credit approval , Medical Diagnosis , Performance Prediction

  Your Answer   Credit approval , Selective Marketing

 

 Multiple Choice Multiple Answer  Question   In data storage area , DBA uses metadata for processes of :-

  Correct Tuning Database , Backup , Recovery

Page 44 of 141

Page 45: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Answer    Your Answer   Tuning Database , Management

 

 Multiple Choice Single Answer  Question   Data can be smoothed by filling the data to function such as :-

  Correct Answer  

Regression

  Your Answer   Binning

 

 True/False  Question   Tools perform major functions in data warehouse environment.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ option of warehouse architecture provides incremental growth.

  Correct Answer  

Cluster

  Your Answer   Cluster

 

 True/False  Question   Data staging and data storage may start out on same computing platform.

  Correct Answer  

True

  Your Answer   False

 

 Match The FollowingQuestion Correct Answer Your Answer

Middleware & connectivity tool Transparent access to source system

Assist data ware house administration

Data Quality tool Locating data errors Locating data errors

OLAP tools Channel queries Channel queries

Alert system tool Users attention on exceptions Users attention on exceptions

 

 Multiple Choice Single Answer  Question   Attribute construction is the part of :-

  Correct Answer  

Transformation

Page 45 of 141

Page 46: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Smoothing

 

 Multiple Choice Single Answer  Question   Which from the following are special programs that are stored on database and fired when

certain predefined action occurs?  Correct Answer  

Triggers

  Your Answer   Triggers

 

 True/False  Question   Data cube stores multidimensional aggregate information.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Deliberate splitting of a table and its index data into manageable part is known as :-

  Correct Answer  

Partitioning

  Your Answer   Partitioning

 

 Multiple Choice Single Answer  Question   Effect of one attibute value on a given class is independent of values of other attibute is

called  Correct Answer  

Value independence

  Your Answer   Attirbute conditional independence

 

 Multiple Choice Single Answer  Question   Simple matching approach is used for computing disimilarity between two objects for :-

  Correct Answer  

Nominal variable

  Your Answer   Invariant variable

 

 True/False  Question   Data mining is not that much powerful tool for vast data such as gene sequences in DNA

analysis.  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer  Question   Following clustering method is classified as being agglomerative or divisive :-

Page 46 of 141

Page 47: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Grid based

  Your Answer   Density based

 

 Select The Blank  Question   ________ is the user who has system access privileges but no database administration

privileges as well as not for table and views.  Correct Answer  

Network administrator

  Your Answer   Network administrator

 

 Select The Blank  Question   ________ clustering method follows statistical and neural network approach.

  Correct Answer  

Model based

  Your Answer   Grid based

 

 True/False  Question   COBWEB is a method of incremental conceptual clustering.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   The different analysis tools which are useful to detect unusual patterns such as large

amount of cash flow at certain period by certain group of people are :-  Correct Answer  

Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

  Your Answer   Linkage analysis tool , Outlier analysis tool , Complexity definition tool

 

 Multiple Choice Multiple Answer  Question   DNA sequences are comprised of :-

  Correct Answer  

Adenine , Gaunine , Thymine

  Your Answer   Adenine , Cytocine , Gaunine , Thymine

 

 True/False  Question   Management architectural component manages and controls data acquisition functions.

  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer

Page 47 of 141

Page 48: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   If many indexes are needed, then on which table which option is more preferable?

  Correct Answer  

Splitting of tables

  Your Answer   Splitting of tables

 

 True/False  Question   To detect money laundering and other financial crimes, it is important to integrate

information for multiple databases.  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   It is good practice to drop ________ before initial load.

  Correct Answer  

Index

  Your Answer   Index

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer  

Decision Tree

  Your Answer   Decision Tree

 

 True/False  Question   All data extraction, transformation, integration and staging jobs run on selected hardware

under chosen operating system.  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   Metadata in a data warehouse falls into following categories :-

  Correct Answer  

Operational Metadata , Extraction and Transformation metadata , End-user Metadata

  Your Answer   Operational Metadata , Extraction and Transformation metadata , End-user Metadata

 

 Multiple Choice Single Answer  Question   Deviation based outlier detection identifes outliers by :-

  Correct Answer  

Examining character of objects in groups

  Your Answer   Examining character of objects in groups

Page 48 of 141

Page 49: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Select The Blank  Question   ________ method of regression is useful when errors fails to satisfy normal conditions.

  Correct Answer  

Robust

  Your Answer   Polynomial

 

 Multiple Choice Multiple Answer  Question   The functional areas of metadata are :-

  Correct Answer  

Data Acquisition , Data storage , Information delivery

  Your Answer   Data Acquisition , Data storage , Information delivery

 

 Match The FollowingQuestion Correct Answer Your Answer

Load Utility High performance data loading, recovery

High performance data loading, recovery

Query Governer Abort runaway query Balancing extraction of query

Query Optimizer Parsing, optimizing query Parsing, optimizing query

Query Management Balancing extraction of query Execution and rescheduling queries

 

 Multiple Choice Single Answer  Question   The first step of attibute oriented induction is :-

  Correct Answer  

Data focusing

  Your Answer   Data Classification

 

 Multiple Choice Single Answer  Question   Dimensionality reduction reduces the data set size by removing :-

  Correct Answer  

Irrelevant attributes

  Your Answer   Irrelevant attributes

 

 True/False  Question   Architecture comes first, tools follows it.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer

Page 49 of 141

Page 50: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data cleansing routines work to clean the data by :-

  Correct Answer  

Filling missing values , Smoothing noisy data

  Your Answer   Smoothing noisy data , Resolving inconsistency

 

 Select The Blank  Question   ________ is the method used to predict the value of response variable from one to more

variables.  Correct Answer  

Regression

  Your Answer   Factor analysis

 

 Select The Blank  Question   Most of the warehouses employ ________ database Management System.

  Correct Answer  

Relational

  Your Answer   Multidimensional

 

 Multiple Choice Single Answer  Question   Which of the following method creates copies of data in distributed environment?

  Correct Answer  

Replication

  Your Answer   Replication

 

 Select The Blank  Question   Human being have around ________ gene.

  Correct Answer  

100000

  Your Answer   100000

LIST OF ATTEMPTED QUESTIONS AND ANSWERS 

 Multiple Choice Multiple Answer  Question   DNA sequences are comprised of :-

  Correct Answer  

Gaunine , Thymine , Adenine

  Your Answer   Gaunine , Thymine , Adenine , Cytocine

 

 Multiple Choice Multiple Answer  Question   Source Data Component may be grouped into following categories :-

  Correct Production Data , Internal External Data

Page 50 of 141

Page 51: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Answer    Your Answer   Production Data , Internal External Data

 

 True/False  Question   Loan payment prediction and customer credit analysis are critical to business of bank.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of

classification & prediction are :-  Correct Answer  

Data Cleaning , Relevance Analysis , Data Transformation

  Your Answer   Data Cleaning , Relevance Analysis , Data Transformation

 

 Multiple Choice Single Answer  Question   The big difference between data warehouse and any operational system is its :-

  Correct Answer  

Usage

  Your Answer   Usage

 

 True/False  Question   Data cleansing means removing noisy and inconsistent data.

  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   Moving data into staging area and performing data transformation function is a part of

data acquisition.  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ option of warehouse architecture provides incremental growth.

  Correct Answer  

Cluster

  Your Answer   Cluster

 

 Select The Blank

Page 51 of 141

Page 52: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   For operational system, the stored data contains ________values.

  Correct Answer  

Current data

  Your Answer   Current data

 

 Multiple Choice Multiple Answer  Question   Splitting of data into smaller partition decision tree induction is prone to :-

  Correct Answer  

Fragmentation , Replication , Repetation

  Your Answer   Fragmentation , Generalization

 

 Select The Blank  Question   ________ includes Normalization and Aggregation as data preprocessing procedures.

  Correct Answer  

Data transformation

  Your Answer   Data transformation

 

 Multiple Choice Single Answer  Question   Bitmapped indexes are more suitable for data warehouse environment than for an OLTP

system  Correct Answer  

Bitmapped index

  Your Answer   Clustered index

 

 True/False  Question   Data updates are common place in an operational database.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is the type of pilot for early delivery with broader scope and may be integrated.

  Correct Answer  

Broad business pilot

  Your Answer   User tool appreciation

 

 Match The FollowingQuestion Correct Answer Your Answer

Data Mining Knowledge discovery Knowledge discovery

Metadata Roadmap for user Roadmap for user

Page 52 of 141

Page 53: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Data storage Data management Data management

Data staging Workbench for data Workbench for data

 

 Multiple Choice Single Answer  Question   A gene is usually comprised of hundreds of individual :-

  Correct Answer  

Nucleotides

  Your Answer   Nucleotides

 

 True/False  Question   The Structure that brings all the components together is known as Architecture.

  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   NUMA provides better scalability than SMP.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Deviation based outlier detection identifes outliers by :-

  Correct Answer  

Examining character of objects in groups

  Your Answer   Examining distance between objects

 

 Select The Blank  Question   ________ is density based clustering method which computes on augumented clustering

ordering for automic ordering for automatic and interactive cluster analysis  Correct Answer  

DBSCAN

  Your Answer   DBSCAN

 

 Multiple Choice Single Answer  Question   Enterprise miner technique provides data mining algorithms including distinguishing

feature as :-  Correct Answer  

Advanced Statistical and advanced visualization tool

  Your Answer   Advanced Statistical and classification tool

 

 Match The Following

Page 53 of 141

Page 54: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question Correct Answer Your Answer

Load Image To correspond to target files To correspond to target files

Constructive merge New record supercedes New record supercedes

Initial Load Populating data warehouse table first time

Populating data warehouse table first time

Incremental Load Applying ongoing changes Applying ongoing changes

 

 Multiple Choice Multiple Answer  Question   Advantages of Wavelet transformation for clustering are :-

  Correct Answer  

Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast

  Your Answer   Unsupervised clustering , Clustering is fast

 

 Select The Blank  Question   With the widespread option of ________ real-time connection is viable for data

warehouse.  Correct Answer  

TCP/IP

  Your Answer   HTTP

 

 True/False  Question   A process of grouping a set of physical or abstract objects into classes of similar objects is

called clusiering  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Development and deployment of your data warehouse is joint effort between :-

  Correct Answer  

IT staff and user representatives

  Your Answer   IT staff and user representatives

 

 Multiple Choice Single Answer  Question   Attribute construction is the part of :-

  Correct Answer  

Transformation

  Your Answer   Aggregation

 

 Multiple Choice Single Answer  Question   Which of the following data warehouse component includes dependent data marts,

special multidimensional database and full range of query and reporting facilities?

Page 54 of 141

Page 55: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Information Delivery component

  Your Answer   Data Staging component

 

 Multiple Choice Single Answer  Question   Which technique analyze experimental data?

  Correct Answer  

Analysis of variance

  Your Answer   Analysis of variance

 

 Select The Blank  Question   ________ function of data staging component involves many forms of combining pieces

of data from different sources.  Correct Answer  

Data Transformation

  Your Answer   Data Transformation

 

 Multiple Choice Multiple Answer  Question   Metadata is essential for IT for :-

  Correct Answer  

Source data structures , Data summarization

  Your Answer   Source data structures , Data summarization , Aggregation

 

 Multiple Choice Multiple Answer  Question   Methods for outlier detection are categorised into following approaches :-

  Correct Answer  

Statistical , Distance based , Deviation based

  Your Answer   Statistical , Distance based , Deviation based

 

 Select The Blank  Question   ________ are responsible for running queries and reports against data warehouse tables.

  Correct Answer  

End users

  Your Answer   Query tool specialist

 

 Select The Blank  Question   ________ is the user who has system access privileges but no database administration

privileges as well as not for table and views.  Correct Answer  

Network administrator

  Your Answer   End user

 

 Multiple Choice Multiple Answer

Page 55 of 141

Page 56: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data base miner provides multiple data mining algorithms including :-

  Correct Answer  

Discovery driven OLAP analysis , Association , Classification

  Your Answer   Discovery driven OLAP analysis , Association , Classification

 

 Multiple Choice Multiple Answer  Question   Knowledge discovery process includes :-

  Correct Answer  

Data Cleaning , Data Intergration , Data Selectin

  Your Answer   Data Cleaning , Data Intergration , Data Selectin

 

 True/False  Question   In Linear regression data are modeled to fit a straight line.

  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   Data in data warehouse cuts across application.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   If many indexes are needed, then on which table which option is more preferable?

  Correct Answer  

Splitting of tables

  Your Answer   Rearranging of tables

 

 Multiple Choice Single Answer  Question   Which technique is used to predict categorical response variable?

  Correct Answer  

Discriminant analysis

  Your Answer   Discriminant analysis

 

 Multiple Choice Multiple Answer  Question   Following data transformation methods are used in analysis of time series data :-

  Correct Answer  

Scaling , Normalization , Windows Stiching

  Your Answer   Scaling , Normalization , Windows Stiching

Page 56 of 141

Page 57: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Multiple Choice Single Answer  Question   Concept Description generates description for :-

  Correct Answer  

Charaterisation and Comparison

  Your Answer   Charaterisation and Comparison

 

 True/False  Question   Data preprocessing is an important step in knowledge discovery process.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ databases are one of the most poplularly available and rich information

repositories.  Correct Answer  

Relational

  Your Answer   Relational

 

 Multiple Choice Multiple Answer  Question   Data Mining means :-

  Correct Answer  

Knowledge mining from database , Data /Pattern analysis , Data Archelogy

  Your Answer   Knowledge mining from database , Data /Pattern analysis , Data Archelogy

 

 Multiple Choice Single Answer  Question   What improves accuracy and speed of subsequent mining process?

  Correct Answer  

Integration

  Your Answer   Integration

 

 Multiple Choice Multiple Answer  Question   Data mining is applicable to :-

  Correct Answer  

Relational Database , Data Warehouse , Transaction Database

  Your Answer   Relational Database , Data Warehouse , Transaction Database

LIST OF ATTEMPTED QUESTIONS AND ANSWERS  

 True/False

Page 57 of 141

Page 58: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data cube stores multidimensional aggregate information.

  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is a summarization of general characteristics or features of a target class of data.

  Correct Answer  

Data Characterization

  Your Answer   Data Generalization

 

 Multiple Choice Single Answer  Question   The pilot which is useful for user and project team both as it touches all important functions

is :-  Correct Answer  

Expanded seed pilot

  Your Answer   User tool appreciation pilot

 

 Multiple Choice Single Answer  Question   Which of the following technique involves placing and managing related units of data in

same physical block of storage  Correct Answer  

Clustering

  Your Answer   Clustering

 

 Multiple Choice Multiple Answer  Question   History of metadata includes :-

  Correct Answer  

Changes to source system , Data extraction methods , Data transformation algorithm

  Your Answer   Changes to source system , Data extraction methods

 

 Multiple Choice Single Answer  Question   Which of the following approach requires more computation?

  Correct Answer  

Filter approach

  Your Answer   Filter approach

 

 True/False  Question   The substantial part of historical data comes form antiquated legacy system.

  Correct Answer  

True

  Your Answer   True

Page 58 of 141

Page 59: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Multiple Choice Multiple Answer  Question   Data reduction includes :-

  Correct Answer  

Single value decomposition , Wavelets , Regression

  Your Answer   Single value decomposition , Wavelets , Regression

 

 Multiple Choice Single Answer  Question   Bayes Theorem is :-

  Correct Answer  

P(H|X)=P(X|H)(P)/P(X)

  Your Answer   P(H|X)=P(X|H)(P)/P(X)

 

 Multiple Choice Single Answer  Question   Establish the importance of data quality, Form data quality steering committee, Institute a

data quality framework, Assign roles and responsibilities. These are the steps of :-  Correct Answer  

Data purification

  Your Answer   Data quality control

 

 Select The Blank  Question   With the widespread option of ________ real-time connection is viable for data warehouse.

  Correct Answer  

TCP/IP

  Your Answer   TCP/IP

 

 Multiple Choice Single Answer  Question   Which is the typical example of Grid based clustering method

  Correct Answer  

STING

  Your Answer   STING

 

 Match The FollowingQuestion Correct Answer Your Answer

Normalization Scattered data Constructing small units of data

Smoothing Removal of noisy data Removal of noisy data

Aggregation Summary operations Constructing new attributes

Generalization Data hierarchies Data hierarchies

 

 Multiple Choice Single Answer

Page 59 of 141

Page 60: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Association rules mining is based on :-

  Correct Answer  

Clustering and Employing rules for classification

  Your Answer   Clustering and Employing rules for classification

 

 True/False  Question   Bitmapped indexing does not apply to fault tables.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   For processing metadata in informal delivery area, data can be referred back for :-

  Correct Answer  

Source data configuration , Data structure , Data transformation

  Your Answer   Source data configuration , Data structure , Data transformation

 

 True/False  Question   The precision measure is the % of retrieved documents that are in fact relevant to query.

  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer  Question   Main advantage of following which method is it's fast processing?

  Correct Answer  

Grid based

  Your Answer   Grid based

 

 Select The Blank  Question   Analysis of frequent sequential patterns is important in analysis ________ in generic

sequence.  Correct Answer  

Dismilarity and similarity

  Your Answer   Similarity

 

 Select The Blank  Question   ________ is the clustering method which encounters difficultes regarding the selection of

merge/split points  Correct Answer  

Hierachical

  Your Answer   Hierachical

Page 60 of 141

Page 61: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 

 Multiple Choice Single Answer  Question   Following clustering method is classified as being agglomerative or divisive :-

  Correct Answer  

Grid based

  Your Answer   Grid based

 

 Multiple Choice Multiple Answer  Question   Normalization improves :-

  Correct Answer  

Efficiency , Accuracy

  Your Answer   Efficiency , Accuracy

 

 Multiple Choice Single Answer  Question   A Wavelet transformation is :-

  Correct Answer  

Single processing Technique that decomposes signals into different frequency subbands

  Your Answer   Single processing Technique that decomposes signals into different frequency subbands

 

 Multiple Choice Single Answer  Question   The Clustering method DBSCAN stands for :-

  Correct Answer  

Desity Based Spatial clustering of Application with Noise

  Your Answer   Desity Based Spatial clustering of Application with Noise

 

 Select The Blank  Question   ________ can store aggregate and detail data at varying levels of resolution or abstraction.

  Correct Answer  

Index tree

  Your Answer   Index tree

 

 Multiple Choice Single Answer  Question   Behavioral data of objects can be derived by the application of :-

  Correct Answer  

Method

  Your Answer   Method

 

 Select The Blank  Question   ________ is the type of pilot for early delivery with broader scope and may be integrated.

  Correct Answer  

Broad business pilot

Page 61 of 141

Page 62: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Broad business pilot

 

 Multiple Choice Multiple Answer  Question   Metadata types can be classified as :-

  Correct Answer  

Business metadata , Technical metadata

  Your Answer   Business metadata , Technical metadata

 

 Multiple Choice Single Answer  Question   Simple matching approach is used for computing disimilarity between two objects for :-

  Correct Answer  

Nominal variable

  Your Answer   Nominal variable

 

 Multiple Choice Single Answer  Question   Effect of one attibute value on a given class is independent of values of other attibute is

called  Correct Answer  

Value independence

  Your Answer   Value independence

 

 Multiple Choice Multiple Answer  Question   The different analysis tools which are useful to detect unusual patterns such as large

amount of cash flow at certain period by certain group of people are :-  Correct Answer  

Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

  Your Answer   Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

 

 Multiple Choice Single Answer  Question   When DDL statements are created using database software, so to create an index system

creates :-  Correct Answer  

B-Tree index

  Your Answer   B-Tree index

 

 Multiple Choice Multiple Answer  Question   Data processing techniques are :-

  Correct Answer  

Cleansing , Integration , Transformation

  Your Answer   Cleansing , Integration , Transformation

 

 Match The FollowingQuestion Correct Answer Your Answer

Page 62 of 141

Page 63: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Load Utility High performance data loading, recovery

High performance data loading, recovery

Query Governer Abort runaway query Abort runaway query

Query Optimizer Parsing, optimizing query Parsing, optimizing query

Query Management Balancing extraction of query Balancing extraction of query

 

 Select The Blank  Question   Indexed ________ engines search index, web pages and build huge keyword based indices

which help to search sets of web pages containing certain keywords  Correct Answer  

Web Search Engines

  Your Answer   Web Search Engines

 

 True/False  Question   To detect money laundering and other financial crimes, it is important to integrate

information for multiple databases.  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   ________ is the time consuming and less feasible approach for filling missing values.

  Correct Answer  

Filling missing values manually

  Your Answer   Filling missing values manually

 

 Multiple Choice Single Answer  Question   Which from the following is used for classification and prediction?

  Correct Answer  

Regression trees

  Your Answer   Regression trees

 

 Multiple Choice Multiple Answer  Question   Multimedia database stores and manages large collection of database such as :-

  Correct Answer  

Audio and Video , Sequence data , Text Markup and linkage

  Your Answer   Audio and Video , Sequence data

 

 Select The Blank  Question   ________ is an alternative aggolomerative hierarchical clustering algorithm.

  Correct Answer  

ROCK

Page 63 of 141

Page 64: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   ROCK

 

 Multiple Choice Single Answer  Question   Association rules mining is based on :-

  Correct Answer  

Clustering and Employing rules for classification

  Your Answer   Clustering and Employing rules for classification

 

 Multiple Choice Single Answer  Question   Data matrix is :-

  Correct Answer  

Object by variable structure

  Your Answer   Object by variable structure

 

 Select The Blank  Question   ________ architecture is more concerned with data access than memory access.

  Correct Answer  

MPP

  Your Answer   MPP

 

 True/False  Question   Architecture comes first, tools follows it.

  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   Task of selection in data transformation forms part of extraction function.

  Correct Answer  

True

  Your Answer   False

 

 Multiple Choice Single Answer  Question   Classification rules are extracted from

  Correct Answer  

Decision Tree

  Your Answer   Decision Tree

 

 Select The Blank  Question   ________ includes Normalization and Aggregation as data preprocessing procedures.

Page 64 of 141

Page 65: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Correct Answer  

Data transformation

  Your Answer   Data transformation

LIST OF ATTEMPTED QUESTIONS AND ANSWERS  

 True/False  Question   Matching the choice of DBMS with selected server hardware is not important for

warehouse.  Correct Answer  

False

  Your Answer   False

  Match The FollowingQuestion Correct Answer Your Answer

Metadata Roadmap for user Roadmap for user

Data storage Data management Data management

Data staging Workbench for data Workbench for data

Data Mining Knowledge discovery Knowledge discovery

 

 True/False  Question   Database systems, data warehouse system and world wide web have become

mainstream information system.  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Single Answer  Question   Bitmapped indexes are more suitable for data warehouse environment than for an

OLTP system  Correct Answer  

Bitmapped index

  Your Answer   Bitmapped index

 

 Multiple Choice Single Answer  Question   The big difference between data warehouse and any operational system is its :-

  Correct Answer  

Usage

  Your Answer   Usage

 

Page 65 of 141

Page 66: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

 Multiple Choice Single Answer  Question   One major effort within data transformation is :-

  Correct Answer  

Improvement of data quality

  Your Answer   Analysis of data quality

 

 Multiple Choice Multiple Answer  Question   Advantages of Wavelet transformation for clustering are :-

  Correct Answer  

Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast

  Your Answer   Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast

  Multiple Choice Single Answer  Question   Which of the following technique is used to display group summary statistics?

  Correct Answer  

Quality control

  Your Answer   Survival analysis

 

 Select The Blank  Question   ________ platform is the platform on which the data warehouse DBMS runs and

database exist.  Correct Answer  

Data storage

  Your Answer   Data storage

 

 Multiple Choice Multiple Answer  Question   Class Comparison is performed through following steps :-

  Correct Answer  

Data Collection , Dimension relevance analysis , Presentation of derived comparison

  Your Answer   Data Collection , Dimension relevance analysis , Presentation of derived comparison  

 Select The Blank  Question   It is good practice to drop ________ before initial load.

  Correct Answer  

Index

  Your Answer   Index

 

 Select The Blank  Question   ________ is the time consuming and less feasible approach for filling missing

values.  Correct Answer  

Filling missing values manually

Page 66 of 141

Page 67: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Filling missing values manually

  Multiple Choice Multiple Answer  Question   Basic Heuristic method of attribute subset selection includes following techniques :-

  Correct Answer  

Stepwise forward selection , Stepwise backward elimination

  Your Answer   Stepwise forward selection , Stepwise backward elimination , Combination of forward selection and backward elimination  

 True/False  Question   For maintaining the quality of data proper naming conventions help to make data

elements well understood by users.  Correct Answer  

True

  Your Answer   True

 

 Select The Blank  Question   In ________ duplicate sub trees exist within the tree.

  Correct Answer  

Repetition

  Your Answer   Repetition

 

 Select The Blank  Question   The technique of ________ enables concurrent input/output operations and

improves file's access performance substantially.  Correct Answer  

File striping

  Your Answer   File striping

 

 Select The Blank  Question   ________ does not handle categorical attributes.

  Correct Answer  

CURE

  Your Answer   CURE

 

 Select The Blank  Question   Creating ________is violation of Normalization principles.

  Correct Answer  

Array

  Your Answer   Array

  True/False

Page 67 of 141

Page 68: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Question   Data in warehouse is primarily for query.

  Correct Answer  

True

  Your Answer   True

 

 Multiple Choice Multiple Answer  Question   Preprocessing steps of data in order to help improve accuracy, efficiency and

scalability of classification & prediction are :-  Correct Answer  

Data Cleaning , Relevance Analysis , Data Transformation

  Your Answer   Data Cleaning , Relevance Analysis , Data Transformation

 

 Multiple Choice Single Answer  Question   Which task in data transformation includes types of data manipulation on selected

parts of source data?  Correct Answer  

Splitting/Joining

  Your Answer   Splitting/Joining

 

 True/False  Question   Business metadata is like a roadmap or easy to use information directory showing

contents and how to get there.  Correct Answer  

True

  Your Answer   True

 

 True/False  Question   Data error discovery and data correction are two parts of data cleansing process.

  Correct Answer  

True

  Your Answer   False

  Multiple Choice Multiple Answer  Question   The dimensions of spatial data cube are :-

  Correct Answer  

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

  Your Answer   Non- spatial dimension , Spatial to non spatial , Spatial to spatial

 

 Select The Blank  Question   ________ technique is known as snapshot differential technique.

  Correct Answer  

Capture based on comparing files

Page 68 of 141

Page 69: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   Capture based on comparing files

  Multiple Choice Multiple Answer  Question   The benefits of improved data quality are :-

  Correct Answer  

Better customer service , Improved productivity , Reliable strategic decision making

  Your Answer   Better customer service , Improved productivity , Reliable strategic decision making

  Multiple Choice Single Answer  Question   Which technique of data extraction is available to non relational databases?

  Correct Answer  

Capture through transaction log

  Your Answer   Capture of static data

  Select The Blank  Question   ________ technique is the statistical technique for analyzing data.

  Correct Answer  

Time series

  Your Answer   Time series

  True/False  Question   Noise in data means error or variance in measured variable.

  Correct Answer  

True

  Your Answer   True

  Multiple Choice Multiple Answer  Question   Data mining at home can help to mine data related to :-

  Correct Answer  

Medical History , Cancer , Chromosome abnormalities

  Your Answer   Medical History , Chromosome abnormalities , Physiological conditions

 

 True/False  Question   Data Mining refers to extracting knowledge from larger amount of data.

  Correct Answer  

True

  Your Answer   True

  Multiple Choice Single Answer  Question   Simple matching approach is used for computing disimilarity between two objects

for :-  Correct Nominal variable

Page 69 of 141

Page 70: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Answer    Your Answer   Nominal variable

 

 Multiple Choice Multiple Answer  Question   Following are the reasons for getting data polluted :-

  Correct Answer  

Data aging , Input errors , Fraud

  Your Answer   Data aging , Input errors , Processing errors

 

 Select The Blank  Question   ________ is the type of pilot for early delivery with broader scope and may be

integrated.  Correct Answer  

Broad business pilot

  Your Answer   Broad business pilot

  Multiple Choice Multiple Answer  Question   Following are the issues to consider during data integration :-

  Correct Answer  

Schema integration , Redundancy , Detection and resolution of data values

  Your Answer   Schema integration , Redundancy , Detection and resolution of data values

  Match The FollowingQuestion Correct Answer Your Answer

Rough set Approach Noisy Data Previously unseen data

k-Nearest Neighbour Classifiers Learning Analogy Noisy Data

Class based Testing Instanace Based Learning Analogy

Generic Algorithms Natural Evolution Natural Evolution

  Multiple Choice Single Answer  Question   When DDL statements are created using database software, so to create an index

system creates :-  Correct Answer  

B-Tree index

  Your Answer   B-Tree index

 

 True/False  Question   The difficulties encountered in data transformation function relate to heterogeneity

of the source system.  Correct Answer  

True

Page 70 of 141

Page 71: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

  Your Answer   False

  True/False  Question   Data mining is not that much powerful tool for vast data such as gene sequences in

DNA analysis.  Correct Answer  

True

  Your Answer   True

  Multiple Choice Single Answer  Question   When current extent on disk storage for a file is full, DBMS finds new extent and

allows an insertion of new record is known as :-  Correct Answer  

Dynamic extension

  Your Answer   Dynamic extension

  Multiple Choice Multiple Answer  Question   Following are the types of normalization :-

  Correct Answer  

Min-Max Normalization , Z-score normalization , Normalization by scaling

  Your Answer   Min-Max Normalization , Z-score normalization , Normalization by scaling

 

 Multiple Choice Multiple Answer  Question   In generation of numerical hierarchies for cluster analysis following techniques are

useful :-  Correct Answer  

Binning , Histogram analysis , Clustering

  Your Answer   Binning , Histogram analysis , Segmentation

  Select The Blank  Question   ________ is an alternative aggolomerative hierarchical clustering algorithm.

  Correct Answer  

ROCK

  Your Answer   ROCK

 

 Multiple Choice Multiple Answer  Question   Generalized linear model includes :-

  Correct Answer  

Logistic regression , Poisson regression

  Your Answer   Logistic regression , Poisson regression

 

 Multiple Choice Single Answer  Question   Inherently Architected, Single, central storage of data about content, Centralized

rules and control, Seek quick result, these are the advantages of which type of data

Page 71 of 141

Page 72: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

extraction?  Correct Answer  

Top down approach

  Your Answer   Top down approach

 

 Multiple Choice Single Answer  Question   Data matrix is :-

  Correct Answer  

Object by variable structure

  Your Answer   Object by variable structure

 

 Multiple Choice Single Answer  Question   Queries run faster to find exact match using which type of indexing?

  Correct Answer  

Clustered index

  Your Answer   Clustered index

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources. Correct Answer: Data Transformation Your Answer: Data Transformation Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data acquisition , Data Storage , Information Delivery Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis. Correct Answer: Integration Your Answer: Integration Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Represent actual data Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-

Page 72 of 141

Page 73: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Different Objective Scope , Complete Analysis and Quick Response , Flexible and Dynamic Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components. Correct Answer: Metadata Your Answer: Metadata Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Statistical , Distance based , Deviation based True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction True/FalseQuestion: Data Integration means multiple resourses may be combined. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Multidimensional index tree True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition. Correct Answer: True Your Answer: True True/FalseQuestion: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK

Page 73 of 141

Page 74: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :- Correct Answer: Huge size of data Your Answer: Huge size of data Multiple Choice Single AnswerQuestion: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X|H)(P)/P(X) Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :- Correct Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Select The BlankQuestion: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: ROCK Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Select The BlankQuestion: The ________ record is one-to-many relationship with corresponding fact table record. Correct Answer: Dimension tables Your Answer: Dimension tables True/FalseQuestion: In Database system multidimensional index trees are primarily used for providing fast data access. Correct Answer: True Your Answer: True Match The FollowingQuestion Correct Answer Your AnswerData Mining Knowledge discovery Knowledge discovery Metadata Roadmap for user Roadmap for user Data storage Data management Data management Data staging Workbench for data Workbench for data True/FalseQuestion: COBWEB is a method of incremental conceptual clustering. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :- Correct Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool Your Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

Page 74 of 141

Page 75: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Multiple Choice Multiple AnswerQuestion: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Multiple Choice Single AnswerQuestion: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification Multiple Choice Single AnswerQuestion: Data can be smoothed by filling the data to function such as :- Correct Answer: Regression Your Answer: Regression Multiple Choice Multiple AnswerQuestion: In physical design of data warehouse administration provides features like :- Correct Answer: Support backup and recovery , Query processing , Avoiding reorganizing of tables Your Answer: Avoiding reorganizing of tables , Support backup and recovery , Query processing True/FalseQuestion: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Multiple Choice Single AnswerQuestion: Data partitioning, data clustering are the techniques for :- Correct Answer: Performance enhancement Your Answer: Performance enhancement Multiple Choice Multiple AnswerQuestion: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin

Page 75 of 141

Page 76: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Single AnswerQuestion: Query tool is meant for :- Correct Answer: Data acquisition Your Answer: Data acquisition Multiple Choice Multiple AnswerQuestion: The functions of data acquisition are :- Correct Answer: Data Extraction , Data Transformation Your Answer: Data Extraction , Data Transformation Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational True/FalseQuestion: From a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach? Correct Answer: STING , Wave Cluster Your Answer: STING , Wave Cluster True/FalseQuestion: The Structure that brings all the components together is known as Architecture. Correct Answer: True Your Answer: True Select The BlankQuestion: Human being have around ________ gene. Correct Answer: 100000 Your Answer: 100000 Select The BlankQuestion: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Regression Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism True/FalseQuestion: Data cleansing means removing noisy and inconsistent data. Correct Answer: True Your Answer: True Multiple Choice Single Answer

Page 76 of 141

Page 77: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: When DDL statements are created using database software, so to create an index system creates :- Correct Answer: B-Tree index Your Answer: B-Tree index

LIST OF ATTEMPTED QUESTIONS AND ANSWERS True/FalseQuestion: Architecture comes first, tools follows it. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Following are the theories for the basis of data mining :- Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Pattern discovery , Probability theory , Microeconomic view True/FalseQuestion: Data preprocessing is an important step in knowledge discovery process. Correct Answer: True Your Answer: True True/FalseQuestion: A distinguishing feature of Clementine is its object oriented extended module interface. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Architecture defines :- Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , Standard , Standard Techniques Select The BlankQuestion: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery Match The FollowingQuestion Correct Answer Your AnswerClassification tool To filter unrelated attributes To characterize unusual access

sequence Clustering tool To group different cases Transaction activity using graph Data visualization Transaction activity using To group different cases Tool graphLinkage analysis tool To identify links To identify links Multiple Choice Multiple AnswerQuestion: Data processing techniques are :- Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation Select The BlankQuestion: Creating ________is violation of Normalization principles.

Page 77 of 141

Page 78: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Array Your Answer: Cluster Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :- Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Data Manager Multiple Choice Single AnswerQuestion: OPTICS regarding clustering stands for :- Correct Answer: Ordering Points to identify the clustering Structure Your Answer: Ordering Points to identify the clustering Structure Select The BlankQuestion: ________ that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: The stored values of an attribute represents the value of attribute at this moment of time is :- Correct Answer: Current value Your Answer: Value of attribute Match The FollowingQuestion Correct Answer Your AnswerData loading tool Primary key generation Bulk extraction for full refresh Data modeling tool Reverse Engineering Reverse Engineering capabilities CapabilitiesData Extraction tool Bulk extraction for full Default values

refreshData transformation Default values Primary key generation tool True/FalseQuestion: Audio data mining can be an interesting alternative to visual mining. Correct Answer: True Your Answer: True Select The BlankQuestion: Most of the warehouses employ ________ database Management System. Correct Answer: Relational Your Answer: Hierarchical Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs? Correct Answer: Triggers

Page 78 of 141

Page 79: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Events Multiple Choice Multiple AnswerQuestion: For processing metadata in informal delivery area, data can be referred back for :- Correct Answer: Data structure , Data transformation , Source data configuration Your Answer: Source data configuration , Data structure , Data transformation Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :- Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :- Correct Answer: Grid based Your Answer: Partioning based Multiple Choice Single AnswerQuestion: The big difference between data warehouse and any operational system is its :- Correct Answer: Usage Your Answer: Structure Multiple Choice Multiple AnswerQuestion: Following are the data movement options in data warehouse :- Correct Answer: Shared disk , Mass data transmission , Real time connection Your Answer: Shared disk , Mass data transmission , Real time connection True/FalseQuestion: Data Mining refers to extracting knowledge from larger amount of data. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing? Correct Answer: Grid based Your Answer: Density based Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords Correct Answer: Web Search Your Answer: Web Search Multiple Choice Multiple AnswerQuestion: Data base miner provides multiple data mining algorithms including :- Correct Answer: Discovery driven OLAP analysis , Association , Classification Your Answer: Discovery driven OLAP analysis , Association , Regression Select The BlankQuestion: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Non parametric True/False

Page 79 of 141

Page 80: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Deviation based outlier detection identifes outliers by :- Correct Answer: Examining character of objects in groups Your Answer: Examining character of objects in groups Select The BlankQuestion: It is good practice to drop ________ before initial load. Correct Answer: Index Your Answer: Index Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques. Correct Answer: B-Tree Your Answer: B-Tree Select The BlankQuestion: In ________ duplicate sub trees exist within the tree. Correct Answer: Repetition Your Answer: Fragmentation Multiple Choice Single AnswerQuestion: Which is the typical example of Grid based clustering method Correct Answer: STING Your Answer: DBSCAN True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: In data storage area , DBA uses metadata for processes of :- Correct Answer: Backup , Recovery , Tuning Database Your Answer: Backup , Recovery True/FalseQuestion: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: True Select The BlankQuestion: When data block contains excessive amount of free space, performance ________ Correct Answer: Degenerates Your Answer: Degenerates Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist. Correct Answer: Data storage

Page 80 of 141

Page 81: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Legacy Multiple Choice Multiple AnswerQuestion: Data integration means :- Correct Answer: Integrating database , Integrating cubes , Integrating files Your Answer: Integrating database , Integrating cubes , Integrating files Multiple Choice Single AnswerQuestion: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Analysis of variance True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Maintenance of cache consistency is the limitation of :- Correct Answer: MPP Your Answer: SMP Multiple Choice Multiple AnswerQuestion: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules Multiple Choice Single AnswerQuestion: Redundancies can be deleted by :- Correct Answer: Co-relational analysis Your Answer: Relational analysis Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :- Correct Answer: Much smaller Your Answer: Much smaller

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis. Correct Answer: Integration Your Answer: Integration Select The BlankQuestion: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Robust Multiple Choice Single AnswerQuestion: Bitmap index takes significantly less space than which type of index? Correct Answer: B-Tree index

Page 81 of 141

Page 82: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: B-Tree index Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user. Correct Answer: Information Delivery Your Answer: Information Delivery True/FalseQuestion: Architecture comes first, tools follows it. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data acquisition , Data Storage , Information Delivery Select The BlankQuestion: ________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis Correct Answer: DBSCAN Your Answer: DBSCAN Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance data loading,

loading, recovery recovery Query Governer Abort runaway query Active data catalog/directory Query Optimizer Parsing, optimizing query Parsing, optimizing query Query Management Balancing extraction of query Execution and rescheduling queries Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data Select The BlankQuestion: ________ is the type of pilot for early delivery with broader scope and may be integrated. Correct Answer: Broad business pilot Your Answer: Broad business pilot Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :- Correct Answer: Binning , Clustering , Regression Your Answer: Clustering , Regression Multiple Choice Single AnswerQuestion: Which of the following data warehouse component includes dependent data marts, special multidimensional database and full range of query and reporting facilities? Correct Answer: Information Delivery component Your Answer: Metadata Component True/FalseQuestion: The Structure that brings all the components together is known as Architecture.

Page 82 of 141

Page 83: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: True Your Answer: True Select The BlankQuestion: The technique of ________ enables concurrent input/output operations and improves file's access performance substantially. Correct Answer: File striping Your Answer: File striping True/FalseQuestion: Management architectural component manages and controls data acquisition functions. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: If many indexes are needed, then on which table which option is more preferable? Correct Answer: Splitting of tables Your Answer: Rearranging of tables Multiple Choice Single AnswerQuestion: Which of the following of Grid based clustering method explorates statistical information? Correct Answer: STING Your Answer: STING Multiple Choice Single AnswerQuestion: Attribute construction is the part of :- Correct Answer: Transformation Your Answer: Aggregation Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :- Correct Answer: Adenine , Gaunine , Thymine Your Answer: Gaunine , Thymine , Adenine True/FalseQuestion: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectangles Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Multiple Choice Single AnswerQuestion: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques. Correct Answer: B-Tree

Page 83 of 141

Page 84: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: B-Tree Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Multiple Choice Single AnswerQuestion: Dimensionality reduction reduces the data set size by removing :- Correct Answer: Irrelevant attributes Your Answer: Irrelevant attributes Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :- Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features , Restructuring True/FalseQuestion: Data integration merges data from multiple sources into coherent sources. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: The option "capture in source application technique of data extraction degrades performance of source application because :- Correct Answer: Additional processing needs Your Answer: Additional processing needed to capture changes on separate files Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism Multiple Choice Single AnswerQuestion: Data partitioning, data clustering are the techniques for :- Correct Answer: Performance enhancement Your Answer: Performance enhancement True/FalseQuestion: COBWEB is an extension of CLASSIT for incremental clustering of contineous data. Correct Answer: False Your Answer: True Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :- Correct Answer: Detection and resolution of data values , Schema integration , Redundancy Your Answer: Schema integration , Redundancy , Detection and resolution of data values Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree

Page 84 of 141

Page 85: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach? Correct Answer: STING , Wave Cluster Your Answer: STING , Wave Cluster True/FalseQuestion: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: CURE Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Multiple Choice Single AnswerQuestion: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X|H)(P)/P(X) True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis. Correct Answer: True Your Answer: False Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial True/FalseQuestion: Easily accessible metadata is crucial for end users. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction True/FalseQuestion: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: False Select The Blank

Page 85 of 141

Page 86: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational Multiple Choice Multiple AnswerQuestion: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Select The BlankQuestion: ________ is the platform for complex data transformation for the purpose of cleanse it Correct Answer: Separate optimal Platform Your Answer: Separate optimal Platform Select The BlankQuestion: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single AnswerQuestion: Data matrix is :- Correct Answer: Object by variable structure Your Answer: Object by variable structure Multiple Choice Multiple AnswerQuestion: Following are the data movement options in data warehouse :- Correct Answer: Shared disk , Mass data transmission , Real time connection Your Answer: Shared disk , Mass data transmission , Real time connection Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Replace data True/FalseQuestion: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: For Incremental data loads the sequence is :- Correct Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing Your Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing True/FalseQuestion: COBWEB incrementally incarporates objects into classification tree. Correct Answer: True Your Answer: True

Page 86 of 141

Page 87: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition. Correct Answer: True Your Answer: True Select The BlankQuestion: Creating ________is violation of Normalization principles. Correct Answer: Array Your Answer: Cluster Multiple Choice Single AnswerQuestion: Which of the following method is built on Influece function? Correct Answer: DENCLUE Your Answer: STING Multiple Choice Single AnswerQuestion: Which of the following methods for regression is used on sparse data :- Correct Answer: Regression and log-linear model Your Answer: Regression and log-linear model Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :- Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control Multiple Choice Multiple AnswerQuestion: Metadata in a data warehouse falls into following categories :- Correct Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Your Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Multiple Choice Single AnswerQuestion: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing Multiple Choice Multiple AnswerQuestion: Partitioning in physical design of data warehouse consists of :- Correct Answer: Fact tables and dimension tables , Number of partitions for each table , Criteria for dividing table Your Answer: Fact tables and dimension tables , Number of partitions for each table , Criteria for dividing table True/FalseQuestion: Data updates are common place in an operational database. Correct Answer: True Your Answer: True True/FalseQuestion: A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster. Correct Answer: True Your Answer: True

Page 87 of 141

Page 88: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: The functional areas of metadata are :- Correct Answer: Data Acquisition , Data storage , Information delivery Your Answer: Data transformation , Data Acquisition , Information delivery Select The BlankQuestion: ________ regression involves finding the best time to fit two variables. Correct Answer: Linear Your Answer: Linear Match The FollowingQuestion Correct Answer Your AnswerAdministration Providing support for all Support for System administration DBA functionsExtensibility Hybrid Extension to OLAP Hybrid Extension to OLTP database

databasePortability Across platform Across platform Query tool APIs For tools from loading Providing support for all DBA vendors

functions Multiple Choice Single AnswerQuestion: Which of the following type of processing provides high concurrency? Correct Answer: SMP Your Answer: ccNUMA Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer: Heterogeneous and distributed Your Answer: Heterogeneous and distributed True/FalseQuestion: To remove noise from data is called as Smoothing. Correct Answer: True Your Answer: True Match The FollowingQuestion Correct Answer Your AnswerData Mining Knowledge discovery Knowledge discovery Metadata Roadmap for user Roadmap for user Data storage Data management Data management Data staging Workbench for data Workbench for data Multiple Choice Multiple AnswerQuestion: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Statistical , Distance based , Deviation based Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :-

Page 88 of 141

Page 89: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Grid based Your Answer: Grid based Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components. Correct Answer: Metadata Your Answer: Metadata Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :- Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Multiple Choice Multiple AnswerQuestion: The objective for physical design of data warehouse are :- Correct Answer: Improve performance , Ensure scalability , Manage store Your Answer: Improve performance , Ensure scalability , Manage database True/FalseQuestion: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: What improves accuracy and speed of subsequent mining process? Correct Answer: Integration Your Answer: Regression Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables. Correct Answer: End users Your Answer: End users Select The BlankQuestion: For operational system, the stored data contains ________values. Correct Answer: Current data Your Answer: Current data Multiple Choice Single AnswerQuestion: Enterprise miner technique provides data mining algorithms including distinguishing feature as :- Correct Answer: Advanced Statistical and advanced visualization tool Your Answer: Robust Graphics tools Multiple Choice Multiple AnswerQuestion: Splitting of query by DBMS in intra query parallelization is for :- Correct Answer: Index read , Data read , Data joint Your Answer: Index read , Data read , Data joint Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation? Correct Answer: Filter approach Your Answer: Filter approach

Page 89 of 141

Page 90: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

True/FalseQuestion: Data in warehouse is primarily for query. Correct Answer: True Your Answer: False Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :- Correct Answer: Nominal variable Your Answer: Invariant variable Select The BlankQuestion: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :- Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Multiple Choice Multiple AnswerQuestion: The different definitions of metadata are :- Correct Answer: Data about data , Catalog of data , Data warehouse roadmap Your Answer: Data about data , Catalog of data , Data warehouse roadmap Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval. Correct Answer: Descretization Your Answer: Descretization True/FalseQuestion: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing? Correct Answer: Grid based Your Answer: Grid based Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Index tree Select The BlankQuestion: ________ architecture is more concerned with data access than memory access. Correct Answer: MPP Your Answer: SMP

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Page 90 of 141

Page 91: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

True/False Question: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data Storage , Information Delivery , Data acquisition Select The BlankQuestion: ________ is the navigational map of data warehouse. Correct Answer: End user Metadata Your Answer: End user Metadata Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :- Correct Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Multiple Choice Single AnswerQuestion: Which of the following option is to share data by placing data at common place :- Correct Answer: Shared disk Your Answer: Shared disk Multiple Choice Multiple AnswerQuestion: Data mining is applicable to :- Correct Answer: Relational Database , Data Warehouse , Transaction Database Your Answer: Relational Database , Data Warehouse , Transaction Database Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation? Correct Answer: Filter approach Your Answer: Filter approach Match The FollowingQuestion Correct Answer Your AnswerClustering Data tuples as objects Data tuples as objects Dimension reduction Removal of irrelevant data Removal of irrelevant data Data compression More computations More computations Wrapper approach Great accuracy Great accuracy

Select The BlankQuestion: According to ________ theory database schema consist of data and patterns that are stored in database. Correct Answer: Inductive databases Your Answer: Inductive databases True/False Question: Data cubes created for varying levels of abstraction are referred as cuboids. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Architecture defines :- Correct Answer: Measurements , Standard , General Design

Page 91 of 141

Page 92: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Measurements , Standard , General Design Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata aids in automation of data warehouse process , Metadata is no longer passive document , Metadata takes part in process True/False Question: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Before moving data to data warehouse is has to go through :- Correct Answer: Transformation , Integration , Consolidation Your Answer: Transformation , Integration , Consolidation Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values. Correct Answer: Filling missing values manually Your Answer: Use of row mean Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK Multiple Choice Single AnswerQuestion: Which of the following is based on set of density distribution function clustering? Correct Answer: DBSCAN Your Answer: DBSCAN True/False Question: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: True Select The Blank

Page 92 of 141

Page 93: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: ________ component of warehouse is responsible for coordinating services and activities within the data warehouse. Correct Answer: Management and Control Your Answer: Management and Control Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval. Correct Answer: Descretization Your Answer: Descretization Multiple Choice Single AnswerQuestion: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Analysis of variance Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user. Correct Answer: Information Delivery Your Answer: Information Delivery True/False Question: In Linear regression data are modeled to fit a straight line. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist. Correct Answer: Data storage Your Answer: Data storage Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Replace data Multiple Choice Single AnswerQuestion: The DWT ( Discret Wavlet Transform) is a :- Correct Answer: Linear single processing technique Your Answer: Linear single processing technique Multiple Choice Multiple AnswerQuestion: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules True/False Question: A distinct feature of DB Miner is its data cube based online analytical mining. Correct Answer: True

Page 93 of 141

Page 94: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: True Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality True/False Question: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing Select The BlankQuestion: In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries. Correct Answer: Smoothing by bin boundaries Your Answer: Smoothing by bin boundaries Select The BlankQuestion: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Regression Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :- Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features True/False Question: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True True/False Question: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ is the user who has all access privileges like system, database administrator, for table and views. Correct Answer: Security administrator Your Answer: Power user Multiple Choice Multiple AnswerQuestion: Generalized linear model includes :- Correct Answer: Logistic regression , Poisson regression Your Answer: Logistic regression , Poisson regression

Page 94 of 141

Page 95: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: The main categories of Metadata in warehouse are :- Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Extraction and transformation Metadata , End user Metadata Multiple Choice Single AnswerQuestion: Data migration affects performance requiring multiple blocks to be read which can be adjusted by :- Correct Answer: Block percent free Your Answer: Block percent free True/False Question: Data Integration means multiple resourses may be combined. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Data reduction by volume can be used for data representation using which type of reduction? Correct Answer: Numerosity reduction Your Answer: Numerosity reduction Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Attirbute conditional independence Multiple Choice Single AnswerQuestion: Which of the following technique involves placing and managing related units of data in same physical block of storage Correct Answer: Clustering Your Answer: Clustering

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Multiple AnswerQuestion: Data mining is applicable to :-Correct Answer: Transaction Database , Relational Database , Data Warehouse Your Answer: Transaction Database , Relational Database , Data Warehouse Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: Chameleon Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing?Correct Answer: Grid basedYour Answer: Density based Select The Blank

Page 95 of 141

Page 96: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: When data block contains excessive amount of free space, performance ________Correct Answer: DegeneratesYour Answer: Degenerates Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user.Correct Answer: Information DeliveryYour Answer: Information Delivery Multiple Choice Single AnswerQuestion: SMP stands for :-Correct Answer: Symmetric MultiprocessingYour Answer: Symmetric Multiprocessing Multiple Choice Multiple AnswerQuestion: The need for metadata is for :-Correct Answer: Using data warehouse , Building data warehouse , Administration of warehouse Your Answer: Building data warehouse , Administration of warehouse , Accessing data in warehouse Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables.Correct Answer: End usersYour Answer: Query tool specialist

Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Data Content , Complete Analysis and Quick Response , Flexible and Dynamic Multiple Choice Single AnswerQuestion: Redundancies can be deleted by :-Correct Answer: Co-relational analysisYour Answer: Relational analysis True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.Correct Answer: TrueYour Answer: True

Match The FollowingQuestion Correct Answer Your AnswerLoad Image To correspond to target files Offline data warehouseConstructive merge New record supercedes Populating data warehouse table first

timeInitial Load Populating data warehouse Applying data

table first timeIncremental Load Applying ongoing changes Applying ongoing changes True/FalseQuestion: COBWEB incrementally incarporates objects into classification tree.Correct Answer: True

Page 96 of 141

Page 97: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: True Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control True/FalseQuestion: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusieringCorrect Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Application server serves following purposes :-Correct Answer: To run middleware and establish connectivity , To execute management and control software , To manage metadata Your Answer: To run middleware and establish connectivity , To execute management and control software , To run OLTP application True/FalseQuestion: Data mining often requires data integration.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: The option "capture in source application technique of data extraction degrades performance of source application because :-Correct Answer: Additional processing needsYour Answer: Additional processing needs Multiple Choice Multiple AnswerQuestion: The main categories of Metadata in warehouse are :-Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Execution and Transformation Metadata , End user Metadata Multiple Choice Single AnswerQuestion: Which of the following method creates copies of data in distributed environment?Correct Answer: ReplicationYour Answer: Replication Multiple Choice Multiple AnswerQuestion: Common areas of application for mixed effect model includes :-Correct Answer: Multiple data , Repeated measures data , Block designs Your Answer: Multiple data , Repeated measures data , Block designs Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :-Correct Answer: Detection and resolution of data values , Schema integration , Redundancy Your Answer: Schema integration , Redundancy , Inconsistency True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket.Correct Answer: TrueYour Answer: True

Page 97 of 141

Page 98: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Select The BlankQuestion: In ________ duplicate sub trees exist within the tree.Correct Answer: RepetitionYour Answer: Replication Multiple Choice Multiple AnswerQuestion: The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-Correct Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool Your Answer: Linkage analysis tool , Complexity definition tool , Sequential pattern analysis tool Select The BlankQuestion: According to ________ theory database schema consist of data and patterns that are stored in database.Correct Answer: Inductive databasesYour Answer: Data compression Multiple Choice Single AnswerQuestion: Which of the following methods for regression is used on sparse data :-Correct Answer: Regression and log-linear modelYour Answer: Regression and log-linear model Multiple Choice Single AnswerQuestion: The big difference between data warehouse and any operational system is its :-Correct Answer: UsageYour Answer: Structure Multiple Choice Single AnswerQuestion: In intermediate data extraction data capture through transaction log uses transaction from :-Correct Answer: Recovery from failureYour Answer: Logs of successful transaction Multiple Choice Multiple AnswerQuestion: SMP provides the features like :-Correct Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Your Answer: Controllers which are accessible to all processors , Each node has access to common set of disks , It is cluster of nodes Match The FollowingQuestion Correct Answer Your AnswerData producer Responsible for data quality Foreign key preservedDomain values Prevalent problem Primary key introducedUpdate security Prevention of unauthorized Prevention of unauthorized

updates updatesReferential integrity Foreign key preserved Responsible for data quality True/FalseQuestion: Management architectural component manages and controls data acquisition functions.Correct Answer: TrueYour Answer: False

Page 98 of 141

Page 99: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single AnswerQuestion: EIS stands for :-Correct Answer: Executive Information SystemYour Answer: Extracted Integrated System True/FalseQuestion: NUMA provides better scalability than SMP.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ architecture is more concerned with data access than memory access.Correct Answer: MPPYour Answer: MPP Select The BlankQuestion: Human being have around ________ gene.Correct Answer: 100000Your Answer: 1000000 Select The BlankQuestion: With the widespread option of ________ real-time connection is viable for data warehouse.Correct Answer: TCP/IPYour Answer: TCP/IP True/FalseQuestion: In Linear regression data are modeled to fit a straight line.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Development and deployment of your data warehouse is joint effort between :-Correct Answer: IT staff and user representativesYour Answer: IT staff and developer True/FalseQuestion: Lower the level of detail, finer the data granularity.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following technique involves placing and managing related units of data in same physical block of storageCorrect Answer: ClusteringYour Answer: Indexing True/FalseQuestion: Loan payment prediction and customer credit analysis are critical to business of bank.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ is the platform for complex data transformation for the purpose of cleanse itCorrect Answer: Separate optimal PlatformYour Answer: Legacy platform

Page 99 of 141

Page 100: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Select The BlankQuestion: ________ clustering method follows statistical and neural network approach.Correct Answer: Model basedYour Answer: Hierarchical Method Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , Thymine Your Answer: Cytocine , Gaunine , Thymine Multiple Choice Multiple AnswerQuestion: Business metadata is useful for :-Correct Answer: Providing support to end users , For external view of data , Provides technical support to search data Your Answer: Providing support to end users , For external view of data , Provides technical support to search data , Helps in searching data Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :-Correct Answer: Grid basedYour Answer: Grid based

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Multiple AnswerQuestion: Metadata in a data warehouse falls into following categories :-Correct Answer: End-user Metadata , Operational Metadata , Extraction and Transformation metadata Your Answer: End-user Metadata , Operational Metadata , Extraction and Transformation metadata Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Performance Prediction , Selective Marketing Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Two mode matrix Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Internal dataNon volatile data Query and analysis Production dataData granularity Level of detail Archive dataData from external source External data Query and analysis

Multiple Choice Single AnswerQuestion: Bitmapped indexes are more suitable for data warehouse environment than for an OLTP systemCorrect Answer: Bitmapped indexYour Answer: B-Tree index

Page 100 of 141

Page 101: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control Multiple Choice Single AnswerQuestion: Queries run faster to find exact match using which type of indexing?Correct Answer: Clustered indexYour Answer: Clustered index True/FalseQuestion: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following option is to share data by placing data at common place :-Correct Answer: Shared diskYour Answer: Mass data transmission Multiple Choice Single AnswerQuestion: The category in which the value of each attribute is preserved as status every time a change occurs is :-Correct Answer: Periodic statusYour Answer: Periodic status True/FalseQuestion: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectanglesCorrect Answer: FalseYour Answer: False True/FalseQuestion: Intelligent miner is an IBM data mining product.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct Answer: TriggersYour Answer: Triggers Multiple Choice Single AnswerQuestion: Attribute construction is the part of :-Correct Answer: TransformationYour Answer: Transformation True/FalseQuestion: Metadata acts like a nerve center.Correct Answer: TrueYour Answer: True Multiple Choice Multiple Answer

Page 101 of 141

Page 102: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: Data reduction includes :-Correct Answer: Single value decomposition , Wavelets , Regression Your Answer: Wavelets , Regression True/FalseQuestion: Data cleansing means removing noisy and inconsistent data.Correct Answer: TrueYour Answer: True True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct Answer: Data Cleaning , Relevance Analysis , Data Transformation Your Answer: Data Cleaning , Data Transformation Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :-Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , Correct Multiple Choice Single AnswerQuestion: Which of the option is not considered as the major function needed to get data ready?Correct Answer: Storing dataYour Answer: Extracting data Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval.Correct Answer: DescretizationYour Answer: Reduction Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Invariant variable Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :-Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Hybrid parallelization , Homogenous parallelization True/FalseQuestion: Legacy data resides on Hierarchical or Network database.Correct Answer: TrueYour Answer: True Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis.Correct Answer: IntegrationYour Answer: Integration

Page 102 of 141

Page 103: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Select The BlankQuestion: ________ dimension of database in which primitive level data are spatial but generalization becomes non spatial.Correct Answer: Spatial to non spatialYour Answer: Spatial to non spatial Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction.Correct Answer: Index treeYour Answer: Index tree Multiple Choice Multiple AnswerQuestion: Following factors play important role in financial analysis :-Correct Answer: Data warehouse , Data cubes , Outliner analysis Your Answer: Data warehouse , Data cubes , Outliner analysis Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :-Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Normalization by scaling Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables.Correct Answer: End usersYour Answer: End users Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation?Correct Answer: Filter approachYour Answer: Wrapper approach Select The BlankQuestion: When data block contains excessive amount of free space, performance ________Correct Answer: DegeneratesYour Answer: Degenerates Multiple Choice Single AnswerQuestion: Which of the following type of processing provides high concurrency?Correct Answer: SMPYour Answer: MPP Select The BlankQuestion: ________ option of warehouse architecture provides incremental growth.Correct Answer: ClusterYour Answer: Cluster Match The FollowingQuestion Correct Answer Your AnswerConstructive merge New record supercedes New record supercedesInitial Load Populating data warehouse Populating data warehouse

table first time table first timeIncremental Load Applying ongoing changes Applying ongoing changesLoad Image To correspond to target files To correspond to target files

Page 103 of 141

Page 104: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: Data cleansing routines work to clean the data by :-Correct Answer: Filling missing values , Smoothing noisy data Your Answer: Filling missing values , Smoothing noisy data , Resolving inconsistency True/FalseQuestion: From a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist.Correct Answer: Data storageYour Answer: Data storage Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :-Correct Answer: Binning , Clustering , Regression Your Answer: Clustering , Regression , Insertion True/FalseQuestion: The elements of warehouse infrastructure are classified into operational and physical infrastructure.Correct Answer: TrueYour Answer: True Select The BlankQuestion: It is good practice to drop ________ before initial load.Correct Answer: IndexYour Answer: Splitting Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm.Correct Answer: ROCKYour Answer: CURE Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques.Correct Answer: B-TreeYour Answer: B-Tree True/FalseQuestion: A distinguishing feature of Clementine is its object oriented extended module interface.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence

Page 104 of 141

Page 105: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: The information delivery methods from data warehouse are :-Correct Answer: Complex queries , MD Analysis , Statistical Analysis

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single AnswerQuestion: Capture at data source and that's why this method is quite reliable :-Correct Answer: Capture by database TriggersYour Answer: Capture in source application Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Rules for classification Select The BlankQuestion: A web server usually registers ________ entry for every access of a web pageCorrect Answer: WeblogYour Answer: Weblog Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components.Correct Answer: MetadataYour Answer: Metadata True/FalseQuestion: To remove noise from data is called as Smoothing.Correct Answer: TrueYour Answer: True Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis.Correct Answer: Heterogeneous and distributedYour Answer: Homogenous and stagnant True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE True/FalseQuestion: Tools perform major functions in data warehouse environment.Correct Answer: TrueYour Answer: True

Page 105 of 141

Page 106: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple AnswerQuestion: Common areas of application for mixed effect model includes :-Correct Answer: Multiple data , Repeated measures data , Block designs Your Answer: Multiple data , Dimensional data , Block designs Multiple Choice Single AnswerQuestion: Bitmap index takes significantly less space than which type of index?Correct Answer: B-Tree indexYour Answer: Clustered index Multiple Choice Multiple AnswerQuestion: Data processing is done for :-Correct Answer: Improving the efficiency , Ease of mining Your Answer: Improving the efficiency , Removing redundancy , Removing complexity Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.Correct Answer: Data TransformationYour Answer: Data Transformation Multiple Choice Multiple AnswerQuestion: Mining values can be removed by :-Correct Answer: Filling values manually , Use of global constant , Use of attribute mean Your Answer: Filling values manually , Use of global constant , Use of row mean Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :-Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Select The BlankQuestion: In ________ duplicate sub trees exist within the tree.Correct Answer: RepetitionYour Answer: Replication Select The BlankQuestion: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another.Correct Answer: Data portsYour Answer: Data cubes Match The FollowingQuestion Correct Answer Your AnswerData loading tool Primary key generation Formulating and running queriesData modeling tool Reverse Engineering capabilities Primary key generationData Extraction tool Bulk extraction for full refresh Bulk extraction for full

refreshData transformation toolDefault values Formulating and running queries

Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational Multiple Choice Multiple Answer

Page 106 of 141

Page 107: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: Metadata types can be classified as :-Correct Answer: Business metadata , Technical metadata Your Answer: Business metadata , Technical metadata , Logical metadata True/FalseQuestion: COBWEB is an extension of CLASSIT for incremental clustering of contineous data.Correct Answer: FalseYour Answer: True Multiple Choice Single AnswerQuestion: Which type of analysis of DNA facilitates discovery of group of genes and study of interaction and relationship between them?Correct Answer: Association analysisYour Answer: Generic data analysis Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :-Correct Answer: Schema integration , Redundancy , Detection and resolution of data values Your Answer: Schema integration , Redundancy , Detection and resolution of data values Multiple Choice Single AnswerQuestion: Data migration affects performance requiring multiple blocks to be read which can be adjusted by :-Correct Answer: Block percent freeYour Answer: Block percent vacant Multiple Choice Multiple AnswerQuestion: Normalization improves :-Correct Answer: Efficiency , Accuracy Your Answer: Efficiency , Accuracy True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: In intermediate data extraction data capture through transaction log uses transaction from :-Correct Answer: Recovery from failureYour Answer: All Transaction Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search Multiple Choice Single AnswerQuestion: The first step of attibute oriented induction is :-Correct Answer: Data focusingYour Answer: Data Collection Multiple Choice Single Answer

Page 107 of 141

Page 108: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: Enterprise miner technique provides data mining algorithms including distinguishing feature as :-Correct Answer: Advanced Statistical and advanced visualization toolYour Answer: Robust Graphics tools Select The BlankQuestion: ________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysisCorrect Answer: DBSCANYour Answer: Hierachical True/FalseQuestion: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusieringCorrect Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Grouped data can be analyzed with the technique :-Correct Answer: Mixed effect modelYour Answer: Regression Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach?Correct Answer: STING , Wave Cluster Your Answer: STING , Only Wave Cluster True/FalseQuestion: COBWEB is a method of incremental conceptual clustering.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :-Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data , Non Analyzed data Multiple Choice Single AnswerQuestion: Which type of indexing do not work with data whose selectivity is low :-Correct Answer: B-Tree indexYour Answer: B-Tree index True/FalseQuestion: Easily accessible metadata is crucial for end users.Correct Answer: TrueYour Answer: False Match The FollowingQuestion Correct Answer Your AnswerClementine Integral solutions SASIntelligent miner IBM IBMEnterprise miner SAS DB miner technologyMineset Silicon Graphics Integral solutions Multiple Choice Single AnswerQuestion: Data can be smoothed by filling the data to function such as :-

Page 108 of 141

Page 109: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: RegressionYour Answer: Clustering True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: The need for metadata is for :-Correct Answer: Using data warehouse , Building data warehouse , Administration of warehouse Your Answer: Using data warehouse , Building data warehouse , Administration of warehouse Multiple Choice Multiple AnswerQuestion: The Architecture defines :-Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , General Design , Standard Techniques Multiple Choice Multiple AnswerQuestion: Following are the theories for the basis of data mining :-Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Pattern discovery , Probability theory , Macroeconomic view Select The BlankQuestion: In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.Correct Answer: Smoothing by bin boundariesYour Answer: Smoothing by bin boundaries True/FalseQuestion: Data Integration means multiple resourses may be combined.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following function involves data cleaning, data standardizing and summarizing?Correct Answer: Transforming dataYour Answer: Transforming data

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: For operational system, the stored data contains ________values.Correct Answer: Current dataYour Answer: Current data Select The BlankQuestion: ________ is the user who has system access privileges but no database administration privileges as well as not for table and views.Correct Answer: Network administratorYour Answer: Security administrator

Page 109 of 141

Page 110: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single AnswerQuestion: Selection of which part of data warehouse hardware is ' Bit your bottom dollar'?Correct Answer: Server hardwareYour Answer: Workstation hardware Multiple Choice Single AnswerQuestion: The Clustering method DBSCAN stands for :-Correct Answer: Desity Based Spatial clustering of Application with NoiseYour Answer: Desity Based Spatial clustering of Application with Noise Multiple Choice Single AnswerQuestion: Which of the option is not considered as the major function needed to get data ready?Correct Answer: Storing dataYour Answer: Storing data Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct Answer: TriggersYour Answer: Triggers Multiple Choice Multiple AnswerQuestion: User must have proper access to metadata for performing responsibilities of :-Correct Answer: Design , Administration Your Answer: Administration , Management True/FalseQuestion: Architecture comes first, tools follows it.Correct Answer: TrueYour Answer: True True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.Correct Answer: TrueYour Answer: False Multiple Choice Single AnswerQuestion: OPTICS regarding clustering stands for :-Correct Answer: Ordering Points to identify the clustering StructureYour Answer: Ordering Points to identify the clustering Structure Multiple Choice Multiple AnswerQuestion: In data storage area metadata recorded by processes is used for :-Correct Answer: Users , Development , Administration Your Answer: Development , Administration Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :-Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features Multiple Choice Single AnswerQuestion: Which of the following is based on set of density distribution function clustering?Correct Answer: DBSCAN

Page 110 of 141

Page 111: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: DBSCAN True/FalseQuestion: A distinct feature of DB Miner is its data cube based online analytical mining.Correct Answer: TrueYour Answer: True True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse.Correct Answer: TrueYour Answer: True Match The FollowingQuestion Correct Answer Your AnswerExtraction is manual/Tool based Method of extraction Method of extractionIdentify source application Source identification Source identificationDenote time window Time window Time windowHandling unextractable input records Exception handling Exception handling Multiple Choice Single AnswerQuestion: The stored values of an attribute represents the value of attribute at this moment of time is :-Correct Answer: Current valueYour Answer: Current attribute True/FalseQuestion: The Structure that brings all the components together is known as Architecture.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ is the navigational map of data warehouse.Correct Answer: End user MetadataYour Answer: End user Metadata Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable Multiple Choice Multiple AnswerQuestion: Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct Answer: Data Cleaning , Relevance Analysis , Data Transformation Your Answer: Data Cleaning , Relevance Analysis Multiple Choice Single AnswerQuestion: Which of the following clustering algorithm integrates density based and grid based clustering?Correct Answer: CLQUEYour Answer: STING True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct Answer: True

Page 111 of 141

Page 112: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: True Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production dataNon volatile data Query and analysis Query and analysisData granularity Level of detail Level of detailData from external source External data External data True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Data processing is done for :-Correct Answer: Improving the efficiency , Ease of mining Your Answer: Improving the efficiency , Ease of mining Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :-Correct Answer: Binning , Clustering , Regression Your Answer: Binning , Clustering , Regression Multiple Choice Single AnswerQuestion: Many methods for data smoothing are also methods for data reduction involving :-Correct Answer: DiscretizationYour Answer: Discretization Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :-Correct Answer: Replace dataYour Answer: Represent actual data True/FalseQuestion: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ component of warehouse is responsible for coordinating services and activities within the data warehouse.Correct Answer: Management and ControlYour Answer: Management and Control Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.Correct Answer: Data Transformation

Page 112 of 141

Page 113: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Data Transformation Multiple Choice Single AnswerQuestion: Which type of following clustering computes augumented cluster ordering?Correct Answer: OPTICSYour Answer: CLQUE True/FalseQuestion: Data cleansing means removing noisy and inconsistent data.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction Select The BlankQuestion: Creating ________is violation of Normalization principles.Correct Answer: ArrayYour Answer: Structure Multiple Choice Multiple AnswerQuestion: The areas of classification for metadata are :-Correct Answer: Development/usage , Technical/business , BackRoom/Front Room Your Answer: Development/usage , BackRoom/Front Room , Administration Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :-Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization True/FalseQuestion: Data Mining refers to extracting knowledge from larger amount of data.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Data base miner provides multiple data mining algorithms including :-Correct Answer: Discovery driven OLAP analysis , Association , Classification Your Answer: Association , Classification , Regression Multiple Choice Multiple AnswerQuestion: Data transformation includes :-Correct Answer: Smoothing , Aggregation , Generalization Your Answer: Smoothing , Aggregation , Generalization Select The BlankQuestion: ________ includes Normalization and Aggregation as data preprocessing procedures.Correct Answer: Data transformation

Page 113 of 141

Page 114: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Data transformation Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis.Correct Answer: Heterogeneous and distributedYour Answer: Heterogeneous and distributed Select The BlankQuestion: ________ regression involves finding the best time to fit two variables.Correct Answer: LinearYour Answer: Linear

LIST OF ATTEMPTED QUESTIONS AND ANSWERS True/False Question: Data cubes created for varying levels of abstraction are referred as cuboids. Correct Answer: True Your Answer: True True/False Question: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis. Correct Answer: True Your Answer: True Select The Blank Question: ________ pilot proves validity of data warehousing concept to users and top management. Correct Answer: Proof of concept Your Answer: User tool appreciation Multiple Choice Multiple Answer Question: Mining values can be removed by :- Correct Answer: Filling values manually , Use of global constant , Use of attribute mean Your Answer: Filling values manually , Use of global constant , Use of attribute mean Multiple Choice Single Answer Question: Which of the following type of processing provides high concurrency? Correct Answer: SMP Your Answer: SMP True/False Question: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Multiple Choice Single Answer

Page 114 of 141

Page 115: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Select The Blank Question: According to ________ theory database schema consist of data and patterns that are stored in database. Correct Answer: Inductive databases Your Answer: Inductive databases True/False Question: A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Warehouse Operational infrastructure is to support each architecture component consists of :- Correct Answer: People , Procedures , Management software Your Answer: People , Procedures , Management software Multiple Choice Multiple Answer Question: Time variant nature of the data in data warehouse :- Correct Answer: Allows for analysis of the past , Relate information to the present , Enables forecasts for the future Your Answer: Allows for analysis of the past , Relate information to the present , Enables forecasts for the future Multiple Choice Multiple Answer Question: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Distance based , Deviation based , Diversion based Select The Blank Question: ________ regression involves finding the best time to fit two variables. Correct Answer: Linear Your Answer: Linear Multiple Choice Single Answer Question: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification True/False Question: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True True/False Question: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True

Page 115 of 141

Page 116: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple Answer Question: Following are the theories for the basis of data mining :- Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Microeconomic view , Pattern discovery , Probability theory Multiple Choice Single Answer Question: Which technique is used to predict categorical response variable? Correct Answer: Discriminant analysis Your Answer: Analysis of variance Multiple Choice Single Answer Question: EIS stands for :- Correct Answer: Executive Information System Your Answer: Executive Information System Match The Following Question Correct Answer Your AnswerIntegration Data merging from multiple sources Data merging from multiple sources Binning Sorted, neighbourhood data Sorted, neighbourhood data Clustering Similar values Similar values Regression Filtering of data Filtering of data Multiple Choice Single Answer Question: The DWT ( Discret Wavlet Transform) is a :- Correct Answer: Linear single processing technique Your Answer: Linear single processing technique True/False Question: Data mining often requires data integration. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Which is the typical example of Grid based clustering method Correct Answer: STING Your Answer: DBSCAN Multiple Choice Single Answer Question: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Multiple Choice Multiple Answer Question: For processing metadata in informal delivery area, data can be referred back for :- Correct Answer: Source data configuration , Data structure , Data transformation Your Answer: Source data configuration , Data structure , Data transformation Match The Following Question Correct Answer Your AnswerConstructive merge New record supercedes New record supercedes Initial Load Populating data warehouse Populating data warehouse table first

table first time time Incremental Load Applying ongoing changes Applying ongoing changes Load Image To correspond to target files To correspond to target files Select The Blank

Page 116 of 141

Page 117: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: ________ is the clustering method which encounters difficultes regarding the selection of merge/split points Correct Answer: Hierachical Your Answer: Hierachical Multiple Choice Multiple Answer Question: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules True/False Question: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable. Correct Answer: True Your Answer: True Select The Blank Question: Human being have around ________ gene. Correct Answer: 100000 Your Answer: 100000 Multiple Choice Single Answer Question: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism Select The Blank Question: In ________ duplicate sub trees exist within the tree. Correct Answer: Repetition Your Answer: Repetition Multiple Choice Single Answer Question: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :- Correct Answer: Huge size of data Your Answer: Complexity in data Multiple Choice Single Answer Question: The technique of data clustering facilitates :- Correct Answer: Serial access Your Answer: Random access Multiple Choice Multiple Answer Question: Before moving data to data warehouse is has to go through :- Correct Answer: Transformation , Integration , Consolidation Your Answer: Integration , Summarization , Consolidation Multiple Choice Single Answer Question: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X)(PH)/P(X|H) True/False Question: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False

Page 117 of 141

Page 118: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple Answer Question: DNA sequences are comprised of :- Correct Answer: Adenine , Gaunine , Thymine Your Answer: Adenine , Cytocine , Gaunine , Thymine Multiple Choice Multiple Answer Question: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality Multiple Choice Single Answer Question: Deviation based outlier detection identifes outliers by :- Correct Answer: Examining character of objects in groups Your Answer: Examining distance between objects Multiple Choice Multiple Answer Question: The functions of data acquisition are :- Correct Answer: Data Extraction , Data Transformation Your Answer: Data Extraction , Data Transformation , Data cleansing , Data storing Select The Blank Question: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational Multiple Choice Single Answer Question: A Wavelet transformation is :- Correct Answer: Single processing Technique that decomposes signals into different frequency subbands Your Answer: Single processing Technique that composes signals into different frequency subbands Select The Blank Question: Creating ________is violation of Normalization principles. Correct Answer: Array Your Answer: Array Select The Blank Question: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Robust True/False Question: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing

Page 118 of 141

Page 119: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

LIST OF ATTEMPTED QUESTIONS AND ANSWERS sheetu 2 Multiple Choice Multiple Answer Question: Data Mining means :- Correct Answer: Knowledge mining from database , Data /Pattern analysis , Data Archelogy Your Answer: Data Archelogy , Knowledge mining from database , Data /Pattern analysis Select The Blank Question: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery Match The Following Question Correct Answer Your AnswerOperating systems Security, reliability, availability Security, reliability, availability CompatibilityData Acquisition Data Extraction, Data Extraction, Transformation,

Transformation, cleansing, cleansing, integrationintegration

Data Storage Data loading , Archiving Data loading , Archiving Information Delivery Report generation, query Report generation, query processing

processing and complex and complex analysis analysis

True/False Question: The Structure that brings all the components together is known as Architecture. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Decomposition of cluster for accuracy Multiple Choice Multiple Answer Question: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data Stage , Data Storage , Information Delivery True/False Question: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectangles Correct Answer: False Your Answer: False True/False Question: In Database system multidimensional index trees are primarily used for providing fast data access. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the platform for complex data transformation for the purpose of cleanse it

Page 119 of 141

Page 120: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Separate optimal Platform Your Answer: Separate optimal Platform Multiple Choice Single Answer Question: Bitmapped indexes are more suitable for data warehouse environment than for an OLTP system Correct Answer: Bitmapped index Your Answer: Bitmapped index Multiple Choice Single Answer Question: The Clustering method DBSCAN stands for :- Correct Answer: Desity Based Spatial clustering of Application with Noise Your Answer: Desity Based Spatial clustering of Application with Noise Select The Blank Question: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK Multiple Choice Single Answer Question: Query tool is meant for :- Correct Answer: Data acquisition Your Answer: Information delivery Select The Blank Question: ________ are responsible for running queries and reports against data warehouse tables. Correct Answer: End users Your Answer: End users Multiple Choice Multiple Answer Question: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction Select The Blank Question: ________ architecture is more concerned with data access than memory access. Correct Answer: MPP Your Answer: MPP Select The Blank Question: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports Multiple Choice Single Answer Question: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Regression True/False Question: Data classification is two step process in which first step includes classfication of model and in second step model describes set of data. Correct Answer: False Your Answer: True

Page 120 of 141

Page 121: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Select The Blank Question: ________ clustering method follows statistical and neural network approach. Correct Answer: Model based Your Answer: Model based Multiple Choice Single Answer Question: Which of the following methods for regression is used on sparse data :- Correct Answer: Regression and log-linear model Your Answer: Regression and log-linear model True/False Question: Audio data mining can be an interesting alternative to visual mining. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: If many indexes are needed, then on which table which option is more preferable? Correct Answer: Splitting of tables Your Answer: Collecting of tables Select The Blank Question: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords Correct Answer: Web Search Your Answer: Web Search Multiple Choice Multiple Answer Question: Distinguishing characteristics of data warehouse architecture are :- Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Different Objective Scope , Data Content , Flexible and Dynamic Multiple Choice Single Answer Question: Which type of analysis of DNA facilitates discovery of group of genes and study of interaction and relationship between them? Correct Answer: Association analysis Your Answer: Association analysis True/False Question: Noise in data means error or variance in measured variable. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the user who has all access privileges like system, database administrator, for table and views. Correct Answer: Security administrator Your Answer: Security administrator Multiple Choice Multiple Answer Question: The main categories of Metadata in warehouse are :- Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Extraction and transformation Metadata , End user Metadata Multiple Choice Single Answer Question: Simple matching approach is used for computing disimilarity between two objects for :-

Page 121 of 141

Page 122: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Nominal variable Your Answer: Nominal variable True/False Question: One of the most important search problem in genetic analysis is similarity search and comparison among DNA sequence. Correct Answer: True Your Answer: True True/False Question: Data cube stores multidimensional aggregate information. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Large number of indexes affects the loading process because :- Correct Answer: Indexes are created for new records Your Answer: Searching record becomes difficult Select The Blank Question: Most of the warehouses employ ________ database Management System. Correct Answer: Relational Your Answer: Relational Multiple Choice Single Answer Question: In intermediate data extraction data capture through transaction log uses transaction from :- Correct Answer: Recovery from failure Your Answer: Recovery from failure Multiple Choice Single Answer Question: Redundancies can be deleted by :- Correct Answer: Co-relational analysis Your Answer: Co-relational analysis True/False Question: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: True Select The Blank Question: When data block contains excessive amount of free space, performance ________ Correct Answer: Degenerates Your Answer: Degenerates Multiple Choice Multiple Answer Question: The smoothing techniques are :- Correct Answer: Binning , Clustering , Regression Your Answer: Binning , Clustering , Regression True/False Question: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering Correct Answer: True Your Answer: True

Page 122 of 141

Page 123: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single Answer Question: For Banking and financial data which type of analysis is used? Correct Answer: Multidimensional Your Answer: Relational Multiple Choice Multiple Answer Question: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Multiple Choice Single Answer Question: Which of the following technique involves placing and managing related units of data in same physical block of storage Correct Answer: Clustering Your Answer: Clustering Multiple Choice Multiple Answer Question: Data processing techniques are :- Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation Match The Following Question Correct Answer Your AnswerClustering Data tuples as objects Great accuracy Dimension reduction Removal of irrelevant data Removal of irrelevant data Data compression More computations Encoding mechanism Wrapper approach Great accuracy Data reduction Select The Blank Question: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Index tree Multiple Choice Multiple Answer Question: Following are the issues to consider during data integration :- Correct Answer: Schema integration , Redundancy , Detection and resolution of data values Your Answer: Schema integration , Redundancy , Detection and resolution of data values

LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single Answer Question: Histograms, the methods to store reduced representation of data uses :- Correct Answer: Binning Your Answer: Aggregation Multiple Choice Single Answer Question: Which of the following is based on set of density distribution function clustering? Correct Answer: DBSCAN Your Answer: DBSCAN Multiple Choice Multiple Answer Question: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data

Page 123 of 141

Page 124: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Your Answer: Production Data , Internal External Data Select The Blank Question: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: CURE Select The Blank Question: Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer: Heterogeneous and distributed Your Answer: Heterogeneous and distributed True/False Question: Data staging and data storage may start out on same computing platform. Correct Answer: True Your Answer: True True/False Question: Data in data warehouse cuts across application. Correct Answer: True Your Answer: False True/False Question: Loan payment prediction and customer credit analysis are critical to business of bank. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Data integration means :- Correct Answer: Integrating database , Integrating cubes , Integrating files Your Answer: Integrating cubes , Integrating files , Integrating attributes Multiple Choice Multiple Answer Question: Data mining is applicable to :- Correct Answer: Relational Database , Data Warehouse , Transaction Database Your Answer: Relational Database , Data Warehouse , Transaction Database Multiple Choice Multiple Answer Question: The information delivery methods from data warehouse are :- Correct Answer: Complex queries , MD Analysis , Statistical Analysis Your Answer: Complex queries , MD Analysis , Statistical Analysis Multiple Choice Multiple Answer Question: SMP provides the features like :- Correct Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Your Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Multiple Choice Multiple Answer Question: Splitting of query by DBMS in intra query parallelization is for :- Correct Answer: Index read , Data read , Data joint Your Answer: Index read , Data read , Data joint

Page 124 of 141

Page 125: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single Answer Question: For Incremental data loads the sequence is :- Correct Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing Your Answer: Triggering ->data extraction ->Filtering -> Transformation ->Integration ->cleansing Multiple Choice Multiple Answer Question: The platform of Data warehouse consists of :- Correct Answer: Basic hardware components , Operating System , Network and Network software Your Answer: Operating System , Network and Network software , Utility software Multiple Choice Multiple Answer Question: Following factors play important role in financial analysis :- Correct Answer: Data warehouse , Data cubes , Outliner analysis Your Answer: Data warehouse , Data cubes , Outliner analysis Multiple Choice Single Answer Question: Which of the following data capture method of data abstraction is time consuming? Correct Answer: Capture by comparing files Your Answer: Capture by comparing files Multiple Choice Single Answer Question: Capture at data source and that's why this method is quite reliable :- Correct Answer: Capture by database Triggers Your Answer: Capture by database Triggers True/False Question: To remove noise from data is called as Smoothing. Correct Answer: True Your Answer: True True/False Question: NUMA provides better scalability than SMP. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: The Architecture defines :- Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , Standard , General Design Multiple Choice Multiple Answer Question: Data reduction includes :- Correct Answer: Single value decomposition , Wavelets , Regression Your Answer: Single value decomposition , Wavelets , Regression Multiple Choice Single Answer Question: Which of the following component includes database Management System? Correct Answer: Data Storage Your Answer: Management and control Match The Following Question Correct Answer Your Answer

Page 125 of 141

Page 126: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Data loading tool Primary key generation Primary key generation Data modeling tool Reverse Engineering Reverse Engineering capabilities

capabilitiesData Extraction tool Bulk extraction for full Bulk extraction for full refresh refreshData transformation Default values Default values tool Multiple Choice Single Answer Question: Which type of following clustering computes augumented cluster ordering? Correct Answer: OPTICS Your Answer: OPTICS Multiple Choice Single Answer Question: Which from the following are special programs that are stored on database and fired when certain predefined action occurs? Correct Answer: Triggers Your Answer: Triggers Multiple Choice Single Answer Question: Attribute construction is the part of :- Correct Answer: Transformation Your Answer: Transformation Multiple Choice Single Answer Question: The stored values of an attribute represents the value of attribute at this moment of time is :- Correct Answer: Current value Your Answer: Current value Multiple Choice Single Answer Question: The option "capture in source application technique of data extraction degrades performance of source application because :- Correct Answer: Additional processing needs Your Answer: Additional processing needs Select The Blank Question: ________ technique is the statistical technique for analyzing data. Correct Answer: Time series Your Answer: Analysis of variance Select The Blank Question: ________ function of data staging component involves many forms of combining pieces of data from different sources. Correct Answer: Data Transformation Your Answer: Data Transformation True/False Question: To detect money laundering and other financial crimes, it is important to integrate information for multiple databases. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Which of the following option of data extraction is known as application assisted data capture?

Page 126 of 141

Page 127: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Capture in source application Your Answer: Capture in source application Multiple Choice Single Answer Question: Dimensionality reduction reduces the data set size by removing :- Correct Answer: Irrelevant attributes Your Answer: Irrelevant attributes Multiple Choice Single Answer Question: Maintenance of cache consistency is the limitation of :- Correct Answer: MPP Your Answer: NUMA Select The Blank Question: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Analysis of variance True/False Question: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the type of pilot for early delivery with broader scope and may be integrated. Correct Answer: Broad business pilot Your Answer: Broad business pilot Select The Blank Question: In data ________, data encoding or transformations are applied to obtain reduced or compressed representation. Correct Answer: Compression Your Answer: Compression Multiple Choice Multiple Answer Question: Metadata in a data warehouse falls into following categories :- Correct Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Your Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata True/False Question: Data integration merges data from multiple sources into coherent sources. Correct Answer: True Your Answer: True Match The Following Question Correct Answer Your AnswerAdministration Providing support for all DBA functions Support for System administration Extensibility Hybrid Extension to OLAP Providing support for all DBA database

functions Portability Across platform APIs For tools from loading vendors Query tool APIs For tools from loading Hybrid Extension to OLTP database

vendors

Page 127 of 141

Page 128: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Multiple Answer Question: Data transformation includes :- Correct Answer: Smoothing , Aggregation , Generalization Your Answer: Smoothing , Aggregation , Generalization Multiple Choice Multiple Answer Question: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Single Answer Question: Queries run faster to find exact match using which type of indexing? Correct Answer: Clustered index Your Answer: Clustered index True/False Question: Intelligent miner is an IBM data mining product. Correct Answer: True Your Answer: True

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Management and Control , Source Data , Data Staging Your Answer: Management and Control , Source Data , Data Staging

Multiple Choice Single AnswerQuestion: Substantial portion of available information is stored in :-Correct Answer: Text dataYour Answer: Object oriented database

True/FalseQuestion: The data Warehouse is query-centric.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Data mining is a piece of integrated solutions.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Which of the following data capture method of data abstraction is time consuming?Correct Answer: Capture by comparing filesYour Answer: Capture by comparing files

Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE

True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.

Page 128 of 141

Page 129: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification

True/FalseQuestion: In physical design of warehouse, developing standard ensures consistency across the various areas.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Bayes Theorem is :-Correct Answer: P(H|X)=P(X|H)(P)/P(X)Your Answer: P(H|X)=P(X|H)(P)/P(X)

Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search

Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Object by variable structure

Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-Correct Answer: Huge size of dataYour Answer: Huge size of data

Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable

Multiple Choice Multiple AnswerQuestion: Clustering Techniques organised into following categories :-Correct Answer: Partitioning , Density Based , Grid Based Your Answer: Partitioning , Density Based , Grid Based

Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational

Multiple Choice Single AnswerQuestion: Data cleansing effort can begin with :-Correct Answer: High priority dataYour Answer: High priority data

Page 129 of 141

Page 130: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining.Correct Answer: TrueYour Answer: True

Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance data loading,

loading, recovery recoveryQuery Governer Abort runaway query Abort runaway queryQuery Optimizer Parsing, optimizing query Parsing, optimizing queryQuery Management Balancing extraction of query Balancing extraction of query

Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope , Data Content, Flexible and Dynamic Your Answer: Different Objective Scope , Data Content , Flexible and Dynamic

Multiple Choice Single AnswerQuestion: Which type of integrity constraint forces the establishment of parent -child relationship?Correct Answer: Referential integrityYour Answer: Referential integrity

Select The BlankQuestion: An information measures called ________ can be used to recursively partition the values of numeric attribute.Correct Answer: EntropyYour Answer: Entropy

True/FalseQuestion: Metadata is building block of data warehouse.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: In which of the following type of mining frequently occuring patterns related to time and sequence are mined?Correct Answer: Sequential pattern miningYour Answer: Time series data mining

Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually

Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis, Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction

Multiple Choice Multiple AnswerQuestion: Data processing techniques are :-Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation

Page 130 of 141

Page 131: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :-Correct Answer: Much smallerYour Answer: Much smaller

Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner?Correct Answer: Vertical parallelismYour Answer: Vertical parallelism

Multiple Choice Single AnswerQuestion: User gets an enterprise wide view of information from the data warehouse due to :-Correct Answer: Improved productivityYour Answer: Newer opportunity

Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational

Multiple Choice Single AnswerQuestion: Which database type stores a large amount of space-related data?Correct Answer: SpatialYour Answer: Spatial

Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , Thymine Your Answer: Adenine , Gaunine , Thymine

Multiple Choice Multiple AnswerQuestion: The strategies for data reduction are :-Correct Answer: Data aggregation , Dimension reduction , Numerocity reduction Your Answer: Data aggregation , Dimension reduction , Numerocity reduction

Select The BlankQuestion: ________ is an effective way to discover knowledge from huge amount of data.Correct Answer: Visual data miningYour Answer: Web mining

Select The BlankQuestion: ________ is the process of grouping data into classes.Correct Answer: ClusteringYour Answer: Classification

Multiple Choice Multiple Answer

Page 131 of 141

Page 132: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: Data mining Functionalities are :-Correct Answer: Charactrization and Discrimination, Association Analysis, Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis

Select The BlankQuestion: ________ is a summarization of general characteristics or features of a target class of data.Correct Answer: Data CharacterizationYour Answer: Data Characterization

Multiple Choice Single AnswerQuestion: Classification rules are extracted fromCorrect Answer: Decision TreeYour Answer: Decision Tree

Multiple Choice Single AnswerQuestion: Which of the follwing inheritance is supported by Object oriented databases?Correct Answer: Multiple InheritanceYour Answer: Single Inheritance

Select The BlankQuestion: For decision making process ________ process which considers finding only interesting patterns is used.Correct Answer: Microeconomic viewYour Answer: Pattern discovery

Match The FollowingQuestion Correct Answer Your AnswerInitial load of data as-is' data capture as-is' data capturewarehouseStatic data Capture of data in given Capture of data in given point of point of

time timeData revision Incremental data capture Incremental data captureIncremental data Differed data capture Differed data capture

True/FalseQuestion: Business metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Data in data warehouse cuts across application.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Remote deployment of desktop tools is usually faster.Correct Answer: TrueYour Answer: False

Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence

Page 132 of 141

Page 133: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Management and Control , Source Data , Data StagingYour Answer: Management and Control , Source Data , Data Staging

Multiple Choice Single AnswerQuestion: Substantial portion of available information is stored in :-Correct Answer: Text dataYour Answer: Object oriented database

True/FalseQuestion: The data Warehouse is query-centric.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Data mining is a piece of integrated solutions.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Which of the following data capture method of data abstraction is time consuming?Correct Answer: Capture by comparing filesYour Answer: Capture by comparing files

Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE

True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification

True/FalseQuestion: In physical design of warehouse, developing standard ensures consistency across the various areas.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Bayes Theorem is :-Correct Answer: P(H|X)=P(X|H)(P)/P(X)Your Answer: P(H|X)=P(X|H)(P)/P(X)

Page 133 of 141

Page 134: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search

Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Object by variable structure

Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-Correct Answer: Huge size of dataYour Answer: Huge size of data

Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable

Multiple Choice Multiple AnswerQuestion: Clustering Techniques organised into following categories :-Correct Answer: Partitioning , Density Based , Grid BasedYour Answer: Partitioning , Density Based , Grid Based

Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational

Multiple Choice Single AnswerQuestion: Data cleansing effort can begin with :-Correct Answer: High priority dataYour Answer: High priority data

True/FalseQuestion: Sequential pattern analysis and similarity searchtechniques have been developed in data mining.Correct Answer: TrueYour Answer: True

Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance

loading, recovery data loading, recoveryQuery Governer Abort runaway query Abort runaway queryQuery Optimizer Parsing, optimizing query Parsing, optimizing queryQuery Management Balancing extraction of query Balancing extraction of query

Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope, Data Content, Flexible and DynamicYour Answer: Different Objective Scope, Data Content, Flexible and Dynamic

Page 134 of 141

Page 135: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Multiple Choice Single AnswerQuestion: Which type of integrity constraint forces the establishment of parent -child relationship?Correct Answer: Referential integrityYour Answer: Referential integrity

Select The BlankQuestion: An information measures called ________ can be used to recursively partition the values of numeric attribute.Correct Answer: EntropyYour Answer: Entropy

True/FalseQuestion: Metadata is building block of data warehouse.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: In which of the following type of mining frequently occuring patterns related to time and sequence are mined?Correct Answer: Sequential pattern miningYour Answer: Time series data mining

Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually

Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis, Performance PredictionYour Answer: Credit approval , Medical Diagnosis , Performance Prediction

Multiple Choice Multiple AnswerQuestion: Data processing techniques are :-Correct Answer: Cleansing , Integration , TransformationYour Answer: Cleansing , Integration , Transformation

True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True

Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :-Correct Answer: Much smallerYour Answer: Much smaller

Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner?Correct Answer: Vertical parallelismYour Answer: Vertical parallelism

Multiple Choice Single AnswerQuestion: User gets an enterprise wide view of information from the data warehouse due to :-

Page 135 of 141

Page 136: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Correct Answer: Improved productivityYour Answer: Newer opportunity

Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational

Multiple Choice Single AnswerQuestion: Which database type stores a large amount of space-related data?Correct Answer: SpatialYour Answer: Spatial

Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , ThymineYour Answer: Adenine , Gaunine , Thymine

Multiple Choice Multiple AnswerQuestion: The strategies for data reduction are :-Correct Answer: Data aggregation , Dimension reduction ,Numerocity reductionYour Answer: Data aggregation , Dimension reduction , Numerocity reduction

Select The BlankQuestion: ________ is an effective way to discover knowledge from huge amount of data.Correct Answer: Visual data miningYour Answer: Web mining

Select The BlankQuestion: ________ is the process of grouping data into classes.Correct Answer: ClusteringYour Answer: Classification

Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :-Correct Answer: Charactrization and Discrimination, Association Analysis , Cluster AnalysisYour Answer: Charactrization and Discrimination, Association Analysis , Cluster Analysis

Select The BlankQuestion: ________ is a summarization of general characteristics or features of a target class of data.Correct Answer: Data CharacterizationYour Answer: Data Characterization

Multiple Choice Single AnswerQuestion: Classification rules are extracted fromCorrect Answer: Decision TreeYour Answer: Decision Tree

Multiple Choice Single AnswerQuestion: Which of the follwing inheritance is supported by Object oriented databases?Correct Answer: Multiple InheritanceYour Answer: Single Inheritance

Select The Blank

Page 136 of 141

Page 137: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

Question: For decision making process ________ process which considers finding only interesting patterns is used.Correct Answer: Microeconomic viewYour Answer: Pattern discovery

Match The FollowingQuestion Correct Answer Your AnswerInitial load of data warehouse as-is' data capture as-is' data captureStatic data Capture of data in given Capture of data in given point

point of time timeData revision Incremental data capture Incremental data captureIncremental data capture Differed data capture Differed data capture

True/FalseQuestion: Business metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Data in data warehouse cuts across application.Correct Answer: TrueYour Answer: True

True/FalseQuestion: Remote deployment of desktop tools is usually faster.Correct Answer: TrueYour Answer: False

Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence

Unattended QuestionsMatch the Following. Data Quality tool   

2    1.  Assist data ware house administration

 2. OLAP tools   6   

 2.  Locating data errors

 3. Alert system tool    5  

 3.  Transparent access to source system

 4. Middleware & connectivity tool   3  

 4.  Track on number of queries

                   5.  Users attention on exceptions

                   6.  Channel queries

Select The Blank

Page 137 of 141

Page 138: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

clustering method follows statistical and neural network approach.

True/FalseData cleansing means removing noisy and inconsistent data. TRUE

Match The Following1. Non volatile data     2     1.  External data

 2. Data granularity     4     2.  Query and analysis

 3. Data from external source     1     3.  Production data

 4. Disparate data     3     4.  Level of detail

                 5.  Archive data

                 6.  Internal data

Match The Following1. Data storage     1     1.  Data management

 2. Data staging     2     2.  Workbench for data

 3. Data Mining    5     3.  Details of summary

 4. Metadata    6      4.  Private spreadsheet data

                     5.  Knowledge discovery

                     6.  Roadmap for user

True/FalseThe Structure that brings all the components together is known as Architecture. TRUE/FALSE

Match The Following 1. Data modeling tool     1     1.  Reverse Engineering capabilities

 2. Data Extraction tool     4     2.  Default values

 3. Data transformation tool     2     3.  Formulating and running queries

 4. Data loading tool     5     4.  Bulk extraction for full refresh

               5.  Primary key generation

                6.  Replication

Match The Following 1. Static data   

      1.  Immediate data capture

 2. Data revision        

 2.  Capture of data in given point of time

 3. Incremental data capture        

 3.  Incremental data capture

 4. Initial load of data warehouse        

 4.  Value of attribute at specific time

Page 138 of 141

Page 139: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

                     5.  "as-is" data capture

                     6.  Differed data capture

Match The Following1. Initial Load   

4    1.  New record supercedes

 2. Incremental Load   6     

 2.  Offline data warehouse

 3. Load Image    5    

 3.  Applying data

 4. Constructive merge   1     

 4.  Populating data warehouse table first time

                   5.  To correspond to target files

                   6.  Applying ongoing changes

Match The Following1. Identify source application    2    1.  Method of extraction

 2. Denote time window     5     2.  Source identification

 3. Handling unextractable input records    6      3.  Extraction

 4. Extraction is manual/Tool based    1      4.  Job sequencing

                5.  Time window

                 6.  Exception handling

Multiple Choice Multiple Answer7. The main categories of Metadata in warehouse are :-a)

b)

c)d)

Operational

Execution and Transformation Metadata

Extraction and transformation Metadata

End user Metadata

Multiple Choice Multiple Answer20.

The ways of Intra query parallelization are :-

   a)

b)

c)

d)

Horizontal parallelization

Vertical Parallelization

Hybrid parallelization

Homogenous parallelization

Multiple Choice Single Answer

Page 139 of 141

Page 140: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

30.Sequence of physical design of data warehouse is :-   a)

b)

c)

d)

Develop standards--Create aggregate plans--determine data partitioning schemem--extablish clustering option--prepare indexing strategy--complete physical model

Develop standards--determine data partitioning scheme--Create aggregate plans--establish clustering option--prepare indexing strategy--complete physical model

Develop standards--prepare indexing strategy--Create aggregate plans--determine data partitioning scheme--establish clustering option---complete physical model

Develop standards--Create aggregate plans--establish clustering option--determine data partitioning scheme--prepare indexing strategy--complete physical model

Multiple Choice Single Answer44.Data migration affects performance requiring multiple blocks to be read which can be

adjusted by :-a)

b)

c)

d)

Block percent free

Block percent used

Block percent occupied

Block percent vacant

True/False48. In Linear regression data are modeled to fit a straight line.    

True

False

Select The Blank16. The technique of_____________enables concurrent input/output operations and improves file's access performance substantially.

a) Data migrationb) File striping c) Block utilizationd) Dynamic extension

Match the Following 1. Data visualization   

      1.  Visual display

 2. Data mining result visualization        

 2.  Presentation of knowledge

 3. Data mining process visualization        

 3.  Data mining in visual format

 4. Interactive visual data mining        

 4.  Visualization tool

                     5.  Graphical display

Page 140 of 141

Page 141: SCDL - Data Mining

SCDL – 4th Semester – Data Mining

                     6.  Audio signal

Page 141 of 141


Recommended