Date post: | 13-Nov-2014 |
Category: |
Documents |
Upload: | api-3733148 |
View: | 139 times |
Download: | 2 times |
SCDL – 4th Semester – Data Mining
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The Blank Question Semantic integration of ________ genome database is the important task of DNA
analysis. Correct Answer
Heterogeneous and distributed
Your Answer Heterogeneous and distributed
Multiple Choice Single Answer Question Main advantage of following which method is it's fast processing?
Correct Answer
Grid based
Your Answer Partioning based
Select The Blank Question With the widespread option of ________ real-time connection is viable for data
warehouse. Correct Answer
TCP/IP
Your Answer HTTP
Select The Blank Question ________ are responsible for running queries and reports against data warehouse tables.
Correct Answer
End users
Your Answer End users
Multiple Choice Multiple Answer Question Advantages of Wavelet transformation for clustering are :-
Correct Answer
Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast
Your Answer Unsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy
Multiple Choice Single Answer Question Query tool is meant for :-
Correct Answer
Data acquisition
Your Answer Information delivery
Multiple Choice Single Answer Question Which of the following function involves data cleaning, data standardizing and
summarizing? Correct Transforming data
Page 1 of 141
SCDL – 4th Semester – Data Mining
Answer Your Answer Storing data
Multiple Choice Multiple Answer Question Which of the following clustering analysis method uses multiresolution approach?
Correct Answer
STING , Wave Cluster
Your Answer STING , Wave Cluster
Multiple Choice Single Answer Question Which type of following clustering computes augumented cluster ordering?
Correct Answer
OPTICS
Your Answer CLQUE
Multiple Choice Multiple Answer Question Time variant nature of the data in data warehouse :-
Correct Answer
Allows for analysis of the past , Relate information to the present , Enables forecasts for the future
Your Answer Allows for analysis of the past , Relate information to the present , Enables forecasts for the future
True/False Question The Structure that brings all the components together is known as Architecture.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Data compression is to compress the given data by encoding in terms of :-
Correct Answer
Association rule , Decision tree , Cluster
Your Answer Bytes , Cluster
Multiple Choice Multiple Answer Question The different definitions of metadata are :-
Correct Answer
Data about data , Catalog of data , Data warehouse roadmap
Your Answer Data about data , Catalog of data , Data warehouse roadmap
True/False Question A distinct feature of DB Miner is its data cube based online analytical mining.
Page 2 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question Association rules mining is based on :-
Correct Answer
Clustering and Employing rules for classification
Your Answer Clustering and Employing rules for classification
True/False Question A distinguishing feature of Clementine is its object oriented extended module interface.
Correct Answer
True
Your Answer True
Select The Blank Question ________ includes Normalization and Aggregation as data preprocessing procedures.
Correct Answer
Data transformation
Your Answer Data transformation
True/False Question To remove noise from data is called as Smoothing.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Data matrix is :-
Correct Answer
Object by variable structure
Your Answer Object by variable structure
True/False Question Data updates are common place in an operational database.
Correct Answer
True
Your Answer True
True/False Question In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by
Page 3 of 141
SCDL – 4th Semester – Data Mining
rectangles Correct Answer
False
Your Answer True
True/False Question From a Dataware house perspective data mining canbe viewed as an advanced stage of
Online Analytical Programming. Correct Answer
True
Your Answer True
Match The FollowingQuestion Correct Answer Your Answer
Disparate data Production data Query and analysis
Non volatile data Query and analysis Archive data
Data granularity Level of detail Level of detail
Data from external source External data External data
Multiple Choice Multiple Answer Question In physical design of data warehouse administration provides features like :-
Correct Answer
Avoiding reorganizing of tables , Support backup and recovery , Query processing
Your Answer Support backup and recovery , Manage store area , Query processing
Select The Blank Question ________ is the user who has system access privileges but no database administration
privileges as well as not for table and views. Correct Answer
Network administrator
Your Answer End user
Multiple Choice Multiple Answer Question Data mining Functionalities are :-
Correct Answer
Charactrization and Discrimination , Association Analysis , Cluster Analysis
Your Answer Association Analysis , Cluster Analysis , Time series Data Analysis
Select The Blank Question ________ dimension of database in which primitive level data are spatial but
generalization becomes non spatial. Correct Answer
Spatial to non spatial
Page 4 of 141
SCDL – 4th Semester – Data Mining
Your Answer Spatial to non spatial
Multiple Choice Multiple Answer Question Source Data Component may be grouped into following categories :-
Correct Answer
Production Data , Internal External Data
Your Answer Internal External Data , Analyzed data , Non Analyzed data
Select The Blank Question ________ technique is the statistical technique for analyzing data.
Correct Answer
Time series
Your Answer Time series
Multiple Choice Multiple Answer Question The strategies for data reduction are :-
Correct Answer
Data aggregation , Dimension reduction , Numerocity reduction
Your Answer Data aggregation , Dimension reduction , Numerocity reduction
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer
Decision Tree
Your Answer Root-Node
Match The FollowingQuestion Correct Answer Your Answer
Data Mining Knowledge discovery Knowledge discovery
Metadata Roadmap for user Details of summary
Data storage Data management Data management
Data staging Workbench for data Workbench for data
True/False Question Data cube stores multidimensional aggregate information.
Correct Answer
True
Your Answer True
Page 5 of 141
SCDL – 4th Semester – Data Mining
Select The Blank Question ________ is the method used to predict the value of response variable from one to more
variables. Correct Answer
Regression
Your Answer Regression
Select The Blank Question ________ databases are one of the most poplularly available and rich information
repositories. Correct Answer
Relational
Your Answer Object oriented
True/False Question COBWEB is a method of incremental conceptual clustering.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Many methods for data smoothing are also methods for data reduction involving :-
Correct Answer
Discretization
Your Answer Clustering
Multiple Choice Single Answer Question Dimensionality reduction reduces the data set size by removing :-
Correct Answer
Irrelevant attributes
Your Answer Irrelevant attributes
Multiple Choice Single Answer Question Effect of one attibute value on a given class is independent of values of other attibute is
called Correct Answer
Value independence
Your Answer Class Conditional independence
Multiple Choice Single Answer Question Which from the following are special programs that are stored on database and fired when
certain predefined action occurs? Correct Answer
Triggers
Your Answer Triggers
Page 6 of 141
SCDL – 4th Semester – Data Mining
Select The Blank Question A web server usually registers ________ entry for every access of a web page
Correct Answer
Weblog
Your Answer Log
Multiple Choice Single Answer Question Bayes Theorem is :-
Correct Answer
P(H|X)=P(X|H)(P)/P(X)
Your Answer P(H|X)=P(X|H)(P)/P(X)
True/False Question Visual display can help user to give clear impression and overview of the data
characteristics in a database. Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Which of the following is based on set of density distribution function clustering?
Correct Answer
DBSCAN
Your Answer DBSCAN
Multiple Choice Multiple Answer Question Metadata in a data warehouse falls into following categories :-
Correct Answer
Operational Metadata , Extraction and Transformation metadata , End-user Metadata
Your Answer Operational Metadata , Extraction and Transformation metadata , End-user Metadata
Multiple Choice Multiple Answer Question Knowledge discovery process includes :-
Correct Answer
Data Cleaning , Data Intergration , Data Selectin
Your Answer Data Cleaning , Data Intergration , Data Selectin
Select The Blank Question Human being have around ________ gene.
Correct Answer
100000
Page 7 of 141
SCDL – 4th Semester – Data Mining
Your Answer 1000000
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
True/False Question A distinguishing feature of Clementine is its object oriented extended module interface.
Correct Answer
True
Your Answer True
Select The Blank Question Creating ________is violation of Normalization principles.
Correct Answer
Array
Your Answer Array
True/False Question Data Mining refers to extracting knowledge from larger amount of data.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Which of the following of Grid based clustering method explorates statistical information?
Correct Answer
STING
Your Answer CLIQUE
Multiple Choice Multiple Answer Question The different definitions of metadata are :-
Correct Answer
Data about data , Catalog of data , Data warehouse roadmap
Your Answer Catalog of data , Data warehouse roadmap , Brain of data
Select The Blank Question In ________ type smoothing, minimum and maximum values in given bin are identified as
bin boundaries. Correct Answer
Smoothing by bin boundaries
Your Answer Smoothing by medians
Page 8 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Query tool is meant for :-
Correct Answer
Data acquisition
Your Answer Information delivery
True/False Question Data cube stores multidimensional aggregate information.
Correct Answer
True
Your Answer False
Select The Blank Question ________ can store aggregate and detail data at varying levels of resolution or
abstraction. Correct Answer
Index tree
Your Answer R-Tree
Select The Blank Question ________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer Legacy platform
Multiple Choice Multiple Answer Question SMP provides the features like :-
Correct Answer
Each node has access to common set of disks , Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus
Your Answer Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , It is cluster of nodes
Multiple Choice Single Answer Question In intermediate data extraction data capture through transaction log uses transaction
from :- Correct Answer
Recovery from failure
Your Answer Recovery from failure
Multiple Choice Multiple Answer Question In data storage area , DBA uses metadata for processes of :-
Correct Answer
Backup , Recovery , Tuning Database
Your Answer Backup , Recovery , Management
Page 9 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question Foundation infrastructure of warehouse includes many elements such as :-
Correct Answer
Basic Computing platform , Hardware and operating system , DBMS and Query
Your Answer Basic Computing platform , DBMS and Query , Query processing components
Match The FollowingQuestion Correct Answer Your Answer
Data producer Responsible for data quality Responsible for data quality
Domain values Prevalent problem Foreign key preserved
Update security Prevention of unauthorized updates
Prevalent problem
Referential integrity Foreign key preserved Prevention of unauthorized updates
Select The Blank Question ________ is density based clustering method which computes on augumented clustering
ordering for automic ordering for automatic and interactive cluster analysis Correct Answer
DBSCAN
Your Answer DBSCAN
Multiple Choice Multiple Answer Question Data compression is to compress the given data by encoding in terms of :-
Correct Answer
Association rule , Decision tree , Cluster
Your Answer Bytes , Association rule , Decision tree
Multiple Choice Multiple Answer Question Knowledge discovery process includes :-
Correct Answer
Data Cleaning , Data Intergration , Data Selectin
Your Answer Data Cleaning , Data Intergration , Data movememnt
Multiple Choice Multiple Answer Question Building blocks of Data Warehouse are :-
Correct Answer
Source Data , Data Staging , Management and Control
Your Answer Data Staging , Data Manager , Management and Control
True/False Question All data extraction, transformation, integration and staging jobs run on selected hardware
under chosen operating system.
Page 10 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Real world databases are highly susceptible to noisy, missing and inconsistent data due
to :- Correct Answer
Huge size of data
Your Answer Huge size of data
Match The FollowingQuestion Correct Answer Your Answer
Clustering tool To group different cases To detect unusual attribute
Data visualization tool Transaction activity using graph To filter unrelated attributes
Linkage analysis tool To identify links To group different cases
Classification tool To filter unrelated attributes To identify links
Multiple Choice Multiple Answer Question Generalized linear model includes :-
Correct Answer
Logistic regression , Poisson regression
Your Answer Poisson regression , Linear regression , Polynomial Regression
True/False Question Metadata acts like a nerve center.
Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question OLAP is used for :-
Correct Answer
Online Analytical Processing
Your Answer Online Application Processing
Select The Blank Question ________ includes Normalization and Aggregation as data preprocessing procedures.
Correct Answer
Data transformation
Page 11 of 141
SCDL – 4th Semester – Data Mining
Your Answer Data integration
Multiple Choice Multiple Answer Question The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Multiple Choice Single Answer Question Maintenance of cache consistency is the limitation of :-
Correct Answer
MPP
Your Answer NUMA
Select The Blank Question In ________ duplicate sub trees exist within the tree.
Correct Answer
Repetition
Your Answer Replication
Select The Blank Question Indexed ________ engines search index,web pages and build huge keyword based
indices which help to search sets of web pages containing certain keywords Correct Answer
Web Search
Your Answer Web Search
Multiple Choice Single Answer Question Redundancies can be deleted by :-
Correct Answer
Co-relational analysis
Your Answer Coherent analysis
True/False Question To detect money laundering and other financial crimes, it is important to integrate
information for multiple databases. Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Common areas of application for mixed effect model includes :-
Page 12 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Multiple data , Repeated measures data , Block designs
Your Answer Multiple data , Dimensional data , Block designs
Select The Blank Question In data ________, data encoding or transformations are applied to obtain reduced or
compressed representation. Correct Answer
Compression
Your Answer Compression
Multiple Choice Single Answer Question Grouped data can be analyzed with the technique :-
Correct Answer
Mixed effect model
Your Answer Factor analysis
Select The Blank Question ________ is the navigational map of data warehouse.
Correct Answer
End user Metadata
Your Answer Extraction Metadata
Multiple Choice Multiple Answer Question Business metadata is useful for :-
Correct Answer
Providing support to end users , For external view of data , Provides technical support to search data
Your Answer Providing support to end users , For external view of data , Provides technical support to search data
True/False Question The elements of warehouse infrastructure are classified into operational and physical
infrastructure. Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Data reduction by volume can be used for data representation using which type of
reduction? Correct Answer
Numerosity reduction
Your Answer Histograms
True/False Question Descriptive mining takes perform ingerence on current data which predictive mining
Page 13 of 141
SCDL – 4th Semester – Data Mining
characterize the general properties of data in database Correct Answer
False
Your Answer False
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer
Decision Tree
Your Answer Decision Tree
Multiple Choice Single Answer Question Queries run faster to find exact match using which type of indexing?
Correct Answer
Clustered index
Your Answer Sequential index
Multiple Choice Single Answer Question Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer Clustering
True/False Question Data classification is two step process in which first step includes classfication of model
and in second step model describes set of data. Correct Answer
False
Your Answer True
Select The Blank Question In data warehouse architecture, the ________ component interleaves with and connects
other components. Correct Answer
Metadata
Your Answer Metadata
True/False Question Legacy data resides on Hierarchical or Network database.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer
Page 14 of 141
SCDL – 4th Semester – Data Mining
Question Metadata in a data warehouse falls into following categories :-
Correct Answer
Operational Metadata , Extraction and Transformation metadata , End-user Metadata
Your Answer Operational Metadata , Extraction and Transformation metadata , End-user Metadata
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple Answer Question Metadata is essential for IT for :-
Correct Answer
Source data structures , Data summarization
Your Answer Web enabling , Source data structures , Data summarization
Multiple Choice Multiple Answer Question Financial data called for banking and financial industry are often relatively :-
Correct Answer
Complete , Reliable , High Quality
Your Answer Complete , Reliable , Correct
Select The Blank Question ________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer Cluster
Match The FollowingQuestion Correct Answer Your Answer
Operating systems compatibility Security, reliability, availability Security, reliability, availability
Data Acquisition Data Extraction, Transformation, clensing, integration
Data Extraction, Transformation, clensing, integration
Data Storage Data loading , Archiving Data loading , Archiving
Information Delivery Report generation, query processing and complex analysis
Report generation, query processing and complex analysis
True/False Question A cluster is a collection of similar data objects in same cluster and disimilar to objects in
another cluster. Correct Answer
True
Your Answer True
Page 15 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Which of the following method creates copies of data in distributed environment?
Correct Answer
Replication
Your Answer Replication
True/False Question Data cube stores multidimensional aggregate information.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Capture at data source and that's why this method is quite reliable :-
Correct Answer
Capture by database Triggers
Your Answer Capture in source application
True/False Question The Structure that brings all the components together is known as Architecture.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question For Banking and financial data which type of analysis is used?
Correct Answer
Multidimensional
Your Answer Relational
Multiple Choice Single Answer Question Which of the following methods for regression is used on sparse data :-
Correct Answer
Regression and log-linear model
Your Answer Regression and transformation
Multiple Choice Multiple Answer Question Following data transformation methods are used in analysis of time series data :-
Correct Answer
Scaling , Normalization , Windows Stiching
Your Answer Scaling , Normalization , Windows Stiching
Page 16 of 141
SCDL – 4th Semester – Data Mining
Select The Blank Question ________ function of data staging component involves many forms of combining pieces
of data from different sources. Correct Answer
Data Transformation
Your Answer Data Loading
Multiple Choice Single Answer Question Real world databases are highly susceptible to noisy, missing and inconsistent data due
to :- Correct Answer
Huge size of data
Your Answer Relational data
Select The Blank Question Creating ________is violation of Normalization principles.
Correct Answer
Array
Your Answer Structure
Multiple Choice Multiple Answer Question The tools of metadata falls in following categories :-
Correct Answer
Development tools for IT professional , Information access tool for End user
Your Answer Access tool , Development tools for IT professional , Information access tool for End user
True/False Question Architecture comes first, tools follows it.
Correct Answer
True
Your Answer True
Select The Blank Question ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer ROKE
Multiple Choice Single Answer Question Which of the following function involves data cleaning, data standardizing and
summarizing? Correct Answer
Transforming data
Page 17 of 141
SCDL – 4th Semester – Data Mining
Your Answer Transforming data
True/False Question In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by
rectangles Correct Answer
False
Your Answer True
Multiple Choice Multiple Answer Question In data storage area , DBA uses metadata for processes of :-
Correct Answer
Backup , Recovery , Tuning Database
Your Answer Backup , Recovery , Tuning Database
Multiple Choice Single Answer Question Bayes Theorem is :-
Correct Answer
P(H|X)=P(X|H)(P)/P(X)
Your Answer P(X|H)=P(X|H)(PH)/P(X)
True/False Question Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer True
Select The Blank Question ________ are responsible for running queries and reports against data warehouse tables.
Correct Answer
End users
Your Answer End users
Select The Blank Question A web server usually registers ________ entry for every access of a web page
Correct Answer
Weblog
Your Answer Web site
Multiple Choice Multiple Answer Question Data processing techniques are :-
Page 18 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Cleansing , Integration , Transformation
Your Answer Cleansing , Transformation , Collection
Multiple Choice Single Answer Question Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer Clustering
Multiple Choice Single Answer Question Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer Examining objects in group
Multiple Choice Single Answer Question Data partitioning, data clustering are the techniques for :-
Correct Answer
Performance enhancement
Your Answer Performance enhancement
Multiple Choice Multiple Answer Question Following are the issues to consider during data integration :-
Correct Answer
Schema integration , Redundancy , Detection and resolution of data values
Your Answer Schema integration , Redundancy , Inconsistency
True/False Question Management architectural component manages and controls data acquisition functions.
Correct Answer
True
Your Answer True
Match The FollowingQuestion Correct Answer Your Answer
Data loading tool Primary key generation Primary key generation
Data modeling tool Reverse Engineering capabilities Reverse Engineering capabilities
Data Extraction tool Bulk extraction for full refresh Bulk extraction for full refresh
Page 19 of 141
SCDL – 4th Semester – Data Mining
Data transformation tool Default values Replication
Multiple Choice Multiple Answer Question DNA sequences are comprised of :-
Correct Answer
Adenine , Gaunine , Thymine
Your Answer Adenine , Cytocine , Gaunine
Multiple Choice Single Answer Question Large number of indexes affects the loading process because :-
Correct Answer
Indexes are created for new records
Your Answer Indexes are created for old records
Select The Blank Question The technique of ________ enables concurrent input/output operations and improves
file's access performance substantially. Correct Answer
File striping
Your Answer Data migration
Multiple Choice Multiple Answer Question Warehouse Operational infrastructure is to support each architecture component consists
of :- Correct Answer
People , Procedures , Management software
Your Answer People , Procedures , Management software
True/False Question In Purning method, postpruning requires more computation than prepruning yet generally
leads to more reliable. Correct Answer
True
Your Answer False
Select The Blank Question ________ technique can be used to reduce the number of values for a given continuous
attribute by dividing range of attributes into interval. Correct Answer
Descretization
Your Answer Compression
True/False Question Data cubes created for varying levels of abstraction are referred as cuboids.
Page 20 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Which of the following approach requires more computation?
Correct Answer
Filter approach
Your Answer Filter approach
Select The Blank Question ________components consists all the different ways of making the information from the
data warehouse available to the user. Correct Answer
Information Delivery
Your Answer Metadata
Multiple Choice Multiple Answer Question Data transformation includes :-
Correct Answer
Smoothing , Aggregation , Generalization
Your Answer Smoothing , Aggregation
Select The Blank Question ________ databases are one of the most poplularly available and rich information
repositories. Correct Answer
Relational
Your Answer Relational
True/False Question In Linear regression data are modeled to fit a straight line.
Correct Answer
True
Your Answer True
Select The Blank Question ________ is the method used to predict the value of response variable from one to more
variables. Correct Answer
Regression
Your Answer Analysis of variance
Multiple Choice Multiple Answer
Page 21 of 141
SCDL – 4th Semester – Data Mining
Question Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer Statistical , Distance based , Deviation based
Multiple Choice Multiple Answer Question Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer Discovery driven OLAP analysis , Association , Regression
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single Answer Question Deviation based outlier detection identifes outliers by :-
Correct Answer Examining character of objects in groups
Your Answer Examining character of objects in groups
Select The Blank Question ________ component of warehouse is responsible for coordinating services and
activities within the data warehouse. Correct Answer Management and Control
Your Answer Management and Control
True/False Question Sequential pattern analysis and similarity search techniques have been developed in
data mining. Correct Answer True
Your Answer True
True/False Question A distinct feature of DB Miner is its data cube based online analytical mining.
Correct Answer True
Your Answer True
Select The Blank Question ________ is the user who has system access privileges but no database administration
privileges as well as not for table and views.
Page 22 of 141
SCDL – 4th Semester – Data Mining
Correct Answer Network administrator
Your Answer Network administrator
Select The Blank Question For operational system, the stored data contains ________values.
Correct Answer Current data
Your Answer Current data
True/False Question Intelligent miner is an IBM data mining product.
Correct Answer True
Your Answer True
Select The Blank Question The technique of ________ enables concurrent input/output operations and improves
file's access performance substantially. Correct Answer File striping
Your Answer File striping
Multiple Choice Multiple Answer Question SMP provides the features like :-
Correct Answer Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks
Your Answer Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus
Match The FollowingQuestion Correct Answer Your Answer
Incremental data capture Differed data capture Differed data capture
Initial load of data warehouse "as-is" data capture "as-is" data capture
Static data Capture of data in given point of time
Capture of data in given point of time
Data revision Incremental data capture Incremental data capture
True/False Question In Purning method, postpruning requires more computation than prepruning yet
generally leads to more reliable. Correct Answer True
Page 23 of 141
SCDL – 4th Semester – Data Mining
Your Answer False
True/False Question Data preprocessing is an important step in knowledge discovery process.
Correct Answer True
Your Answer True
Multiple Choice Multiple Answer Question The dimensions of spatial data cube are :-
Correct Answer Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer Non- spatial dimension , Spatial to non spatial , Spatial to spatial
True/False Question Data mining often requires data integration.
Correct Answer True
Your Answer True
Multiple Choice Multiple Answer Question In data storage area , DBA uses metadata for processes of :-
Correct Answer Backup , Recovery , Tuning Database
Your Answer Backup , Recovery , Tuning Database
Multiple Choice Single Answer Question Effect of one attibute value on a given class is independent of values of other attibute is
called Correct Answer Value independence
Your Answer Attirbute conditional independence
Select The Blank Question ________components consists all the different ways of making the information from the
data warehouse available to the user. Correct Answer Information Delivery
Your Answer Information Delivery
Multiple Choice Multiple Answer Question Data processing techniques are :-
Page 24 of 141
SCDL – 4th Semester – Data Mining
Correct Answer Cleansing , Integration , Transformation
Your Answer Integration , Transformation , Cleansing
Multiple Choice Single Answer Question Data matrix is :-
Correct Answer Object by variable structure
Your Answer Two mode matrix
Match The FollowingQuestion Correct Answer Your Answer
Information Delivery Report generation, query processing and complex analysis
Report generation, query processing and complex analysis
Operating systems compatibility Security, reliability, availability Security, reliability, availability
Data Acquisition Data Extraction, Transformation, clensing, integration
Data Extraction, Transformation, clensing, integration
Data Storage Data loading , Archiving Data loading , Archiving
Select The Blank Question In ________ type smoothing, minimum and maximum values in given bin are identified
as bin boundaries. Correct Answer Smoothing by bin boundaries
Your Answer Smoothing by bin boundaries
Multiple Choice Single Answer Question Data partitioning, data clustering are the techniques for :-
Correct Answer Performance enhancement
Your Answer Data extraction
Select The Blank Question ________ technique is the statistical technique for analyzing data.
Correct Answer Time series
Your Answer Survival analysis
Multiple Choice Single Answer Question Association rules mining is based on :-
Correct Answer Clustering and Employing rules for classification
Page 25 of 141
SCDL – 4th Semester – Data Mining
Your Answer Clustering and Employing rules for classification
Select The Blank Question Most of the warehouses employ ________ database Management System.
Correct Answer Relational
Your Answer Relational
True/False Question NUMA provides better scalability than SMP.
Correct Answer True
Your Answer True
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer Decision Tree
Your Answer Decision Tree
Multiple Choice Single Answer Question Data migration affects performance requiring multiple blocks to be read which can be
adjusted by :- Correct Answer Block percent free
Your Answer Block percent free
Multiple Choice Multiple Answer Question Source Data Component may be grouped into following categories :-
Correct Answer Production Data , Internal External Data
Your Answer Production Data , Analyzed data , Non Analyzed data
Multiple Choice Single Answer Question Redundancies can be deleted by :-
Correct Answer Co-relational analysis
Your Answer Co-relational analysis
True/False Question A distinguishing feature of Clementine is its object oriented extended module interface.
Correct Answer True
Page 26 of 141
SCDL – 4th Semester – Data Mining
Your Answer True
Multiple Choice Multiple Answer Question The functions of data acquisition are :-
Correct Answer Data Transformation , Data Extraction
Your Answer Data Extraction , Data Transformation , Data cleansing
Multiple Choice Single Answer Question SMP stands for :-
Correct Answer Symmetric Multiprocessing
Your Answer Symmetric Multiprocessing
Multiple Choice Multiple Answer Question Mining values can be removed by :-
Correct Answer Filling values manually , Use of global constant , Use of attribute mean
Your Answer Filling values manually , Use of attribute mean
Multiple Choice Single Answer Question Which from the following is used for classification and prediction?
Correct Answer Regression trees
Your Answer Regression
Multiple Choice Multiple Answer Question Before moving data to data warehouse is has to go through :-
Correct Answer Transformation , Integration , Consolidation
Your Answer Transformation , Integration , Consolidation
Select The Blank Question ________ is the navigational map of data warehouse.
Correct Answer End user Metadata
Your Answer Operational Metadata
True/False Question Architecture comes first, tools follows it.
Correct Answer True
Page 27 of 141
SCDL – 4th Semester – Data Mining
Your Answer True
Multiple Choice Single Answer Question Which technique analyze experimental data?
Correct Answer Analysis of variance
Your Answer Regression
Multiple Choice Multiple Answer Question The need for metadata is for :-
Correct Answer Using data warehouse , Building data warehouse , Administration of warehouse
Your Answer Building data warehouse , Administration of warehouse
Multiple Choice Single Answer Question Development and deployment of your data warehouse is joint effort between :-
Correct Answer IT staff and user representatives
Your Answer IT staff and user representatives
Select The Blank Question ________ function of data staging component involves many forms of combining pieces
of data from different sources. Correct Answer Data Transformation
Your Answer Data Transformation
Multiple Choice Single Answer Question Bayes Theorem is :-
Correct Answer P(H|X)=P(X|H)(P)/P(X)
Your Answer P(X|H)=P(X|H)(PH)/P(X)
Multiple Choice Multiple Answer Question When you use tool for design and development, following things take place with
metadata :- Correct Answer Metadata is no longer passive document , Metadata takes part in process , Metadata
aids in automation of data warehouse process Your Answer Metadata is no longer passive document , Metadata takes part in process , Metadata
aids in automation of data warehouse process
Multiple Choice Multiple Answer Question The main categories of Metadata in warehouse are :-
Correct Answer Operational , Extraction and transformation Metadata , End user Metadata
Page 28 of 141
SCDL – 4th Semester – Data Mining
Your Answer Operational , Extraction and transformation Metadata , End user Metadata
Select The Blank Question ________ is the type of pilot for early delivery with broader scope and may be
integrated. Correct Answer Broad business pilot
Your Answer Proof of concept pilot
True/False Question A process of grouping a set of physical or abstract objects into classes of similar objects
is called clusiering Correct Answer True
Your Answer True
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single Answer Question Which type of Grid clustering depends on the granularity of lowest level of grid structure?
Correct Answer
STING
Your Answer OPTICS
Multiple Choice Single Answer Question Which of the following option of data extraction is known as application assisted data
capture? Correct Answer
Capture in source application
Your Answer Capture by comparing files
True/False Question Moving data into staging area and performing data transformation function is a part of
data acquisition. Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question The objective for physical design of data warehouse are :-
Correct Answer
Improve performance , Ensure scalability , Manage store
Your Answer Improve performance , Ensure scalability , Manage database
Page 29 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question User must have proper access to metadata for performing responsibilities of :-
Correct Answer
Design , Administration
Your Answer Design , Administration , Management
Multiple Choice Multiple Answer Question In Intelligent miner the data mining product provides data mining algorithm including
Correct Answer
Association , Classification , Regression
Your Answer Association , Regression , Aggregation
Multiple Choice Single Answer Question The big difference between data warehouse and any operational system is its :-
Correct Answer
Usage
Your Answer Organization
True/False Question Loan payment prediction and customer credit analysis are critical to business of bank.
Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question Which of the option is not considered as the major function needed to get data ready?
Correct Answer
Storing data
Your Answer Extracting data
True/False Question In the data acquisition area, the data flow begins at the data sources and pauses at
staging area. Correct Answer
True
Your Answer True
Select The Blank Question Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer Relational
Page 30 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Which of the following is based on set of density distribution function clustering?
Correct Answer
DBSCAN
Your Answer DBSCAN
True/False Question NUMA provides better scalability than SMP.
Correct Answer
True
Your Answer True
Select The Blank Question Human being have around ________ gene.
Correct Answer
100000
Your Answer 100000
True/False Question COBWEB is a method of incremental conceptual clustering.
Correct Answer
True
Your Answer True
Match The FollowingQuestion Correct Answer Your Answer
Interactive visual data mining Visualization tool Audio signal
Data visualization Visual display Graphical display
Data mining result visualization Presentation of knowledge Visualization tool
Data mining process visualization Data mining in visual format Data mining in visual format
Multiple Choice Single Answer Question Deliberate splitting of a table and its index data into manageable part is known as :-
Correct Answer
Partitioning
Your Answer Decomposing
Multiple Choice Multiple Answer
Page 31 of 141
SCDL – 4th Semester – Data Mining
Question Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer Relational Database , Data Warehouse , Transaction Database
True/False Question Data mining is not that much powerful tool for vast data such as gene sequences in DNA
analysis. Correct Answer
True
Your Answer False
True/False Question Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Which from the following is used for classification and prediction?
Correct Answer
Regression trees
Your Answer Generalized linear model
Multiple Choice Multiple Answer Question Data cleansing routines work to clean the data by :-
Correct Answer
Filling missing values , Smoothing noisy data
Your Answer Filling missing values , Smoothing noisy data , Resolving inconsistency
Select The Blank Question ________ is the method used to predict the value of response variable from one to more
variables. Correct Answer
Regression
Your Answer Factor analysis
Select The Blank Question ________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer Proof of concept pilot
Page 32 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question In physical design of data warehouse administration provides features like :-
Correct Answer
Avoiding reorganizing of tables , Support backup and recovery , Query processing
Your Answer Support backup and recovery , Manage store area , Query processing
Select The Blank Question ________ dimension of database in which primitive level data are spatial but
generalization becomes non spatial. Correct Answer
Spatial to non spatial
Your Answer Spatial
Multiple Choice Single Answer Question The data warehouse DBMS executes on :-
Correct Answer
Data server component
Your Answer Data server component
True/False Question A process of grouping a set of physical or abstract objects into classes of similar objects is
called clusiering Correct Answer
True
Your Answer False
Select The Blank Question ________ component of warehouse is responsible for coordinating services and activities
within the data warehouse. Correct Answer
Management and Control
Your Answer Management and Control
Multiple Choice Single Answer Question Large number of indexes affects the loading process because :-
Correct Answer
Indexes are created for new records
Your Answer Records are reshuffled
Match The FollowingQuestion Correct Answer Your Answer
Chasm Challenges Method to solve problem
Early majority Nature technology Technology to die out
Page 33 of 141
SCDL – 4th Semester – Data Mining
Innovators Method to solve problem Challenges
Early adaptors Increased interest Increased interest
Select The Blank Question ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer ROKE
Multiple Choice Single Answer Question Which technique is used to predict categorical response variable?
Correct Answer
Discriminant analysis
Your Answer Factor analysis
Multiple Choice Single Answer Question Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer Examining character of objects in groups
Multiple Choice Multiple Answer Question The information delivery methods from data warehouse are :-
Correct Answer
Complex queries , MD Analysis , Statistical Analysis
Your Answer Complex queries , MD Analysis , ETS System
True/False Question To remove noise from data is called as Smoothing.
Correct Answer
True
Your Answer True
Select The Blank Question ________ does not handle categorical attributes.
Correct Answer
CURE
Your Answer Chameleon
Multiple Choice Multiple Answer
Page 34 of 141
SCDL – 4th Semester – Data Mining
Question Data warehouse environment is functionally divided into following areas :-
Correct Answer
Data acquisition , Data storage , Information delivery
Your Answer Data storage , Information delivery , Data transformation
True/False Question Data mining often requires data integration.
Correct Answer
True
Your Answer True
Select The Blank Question ________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer Polynomial
Select The Blank Question With the widespread option of ________ real-time connection is viable for data
warehouse. Correct Answer
TCP/IP
Your Answer TCP/IP
Multiple Choice Multiple Answer Question The areas of classification for metadata are :-
Correct Answer
Development/usage , Technical/business , BackRoom/Front Room
Your Answer Development/usage , Technical/business , Administration
Multiple Choice Multiple Answer Question Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer Association , Classification , Regression
Select The Blank Question The ________ record is one-to-many relationship with corresponding fact table record.
Correct Answer
Dimension tables
Your Answer Fact table
Page 35 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question For Incremental data loads the sequence is :-
Correct Answer
Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing
Your Answer Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing
Multiple Choice Multiple Answer Question The platform of Data warehouse consists of :-
Correct Answer
Basic hardware components , Operating System , Network and Network software
Your Answer Basic hardware components , Network and Network software , Utility software
Multiple Choice Multiple Answer Question The smoothing techniques are :-
Correct Answer
Binning , Clustering , Regression
Your Answer Clustering , Regression , Insertion
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The Blank Question ________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer Robust
True/False Question Data classification is two step process in which first step includes classfication of model
and in second step model describes set of data. Correct Answer
False
Your Answer True
True/False Question Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Following factors play important role in financial analysis :-
Page 36 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Data warehouse , Data cubes , Outliner analysis
Your Answer Data warehouse , Data cubes , Data accuracy
Multiple Choice Single Answer Question Data matrix is :-
Correct Answer
Object by variable structure
Your Answer Object by object structure
Multiple Choice Multiple Answer Question The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Multiple Choice Single Answer Question Query tool is meant for :-
Correct Answer
Data acquisition
Your Answer Data acquisition
Multiple Choice Single Answer Question OLAP is used for :-
Correct Answer
Online Analytical Processing
Your Answer Online Analytical Processing
True/False Question Metadata acts like a nerve center.
Correct Answer
True
Your Answer True
Match The FollowingQuestion Correct Answer Your Answer
Constructive merge New record supercedes Populating data warehouse table first time
Initial Load Populating data warehouse table first time
Populating data warehouse table first time
Incremental Load Applying ongoing changes Applying ongoing changes
Page 37 of 141
SCDL – 4th Semester – Data Mining
Load Image To correspond to target files Applying data
Multiple Choice Single Answer Question Disparity is the significant & disturbing characteristic of which type of data?
Correct Answer
Production data
Your Answer Production data
Multiple Choice Single Answer Question Effect of one attibute value on a given class is independent of values of other attibute is
called Correct Answer
Value independence
Your Answer Class Conditional independence
True/False Question Audio data mining can be an interesting alternative to visual mining.
Correct Answer
True
Your Answer True
Select The Blank Question ________ platform is the platform on which the data warehouse DBMS runs and
database exist. Correct Answer
Data storage
Your Answer Data storage
True/False Question Smoothing by bin means each value in bin is replaced by the mean value of the bucket.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Following clustering method is classified as being agglomerative or divisive :-
Correct Answer
Grid based
Your Answer Hierarchical Method
Multiple Choice Multiple Answer Question Data processing is done for :-
Correct Improving the efficiency , Ease of mining
Page 38 of 141
SCDL – 4th Semester – Data Mining
Answer Your Answer Improving the efficiency , Ease of mining , Removing redundancy
Multiple Choice Single Answer Question For Banking and financial data which type of analysis is used?
Correct Answer
Multidimensional
Your Answer Relational
Select The Blank Question Semantic integration of ________ genome database is the important task of DNA
analysis. Correct Answer
Heterogeneous and distributed
Your Answer Homogenous and distributed
Multiple Choice Multiple Answer Question Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of
classification & prediction are :- Correct Answer
Data Cleaning , Relevance Analysis , Data Transformation
Your Answer Data Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Multiple Answer Question The functions of data acquisition are :-
Correct Answer
Data Extraction , Data Transformation
Your Answer Data Extraction , Data Transformation , Data cleansing
Multiple Choice Single Answer Question Data partitioning, data clustering are the techniques for :-
Correct Answer
Performance enhancement
Your Answer Performance enhancement
Multiple Choice Single Answer Question Main advantage of following which method is it's fast processing?
Correct Answer
Grid based
Your Answer Partioning based
Multiple Choice Multiple Answer Question The Main areas of Data Warehouse are :-
Page 39 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Data acquisition , Data Storage , Information Delivery
Your Answer Data acquisition , Data Storage , Information Delivery
True/False Question From a Dataware house perspective data mining canbe viewed as an advanced stage of
Online Analytical Programming. Correct Answer
True
Your Answer True
Select The Blank Question ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer ROCK
Select The Blank Question ________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer Separate optimal Platform
Multiple Choice Multiple Answer Question Metadata recorded in information delivery functional area is related to :-
Correct Answer
Predefined queries , Input parameter definition , Reports
Your Answer Predefined queries , Reports
True/False Question Data cubes created for varying levels of abstraction are referred as cuboids.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Association rules mining is based on :-
Correct Answer
Clustering and Employing rules for classification
Your Answer Clustering and Employing rules for classification
Select The Blank
Page 40 of 141
SCDL – 4th Semester – Data Mining
Question ________ includes Normalization and Aggregation as data preprocessing procedures.
Correct Answer
Data transformation
Your Answer Data transformation
True/False Question Moving data into staging area and performing data transformation function is a part of
data acquisition. Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer Statistical , Distance based , Deviation based
Multiple Choice Single Answer Question The first step of attibute oriented induction is :-
Correct Answer
Data focusing
Your Answer Data Collection
True/False Question Legacy data resides on Hierarchical or Network database.
Correct Answer
True
Your Answer True
Select The Blank Question ________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer Cluster
Multiple Choice Single Answer Question Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer Regression
Page 41 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Many methods for data smoothing are also methods for data reduction involving :-
Correct Answer
Discretization
Your Answer Regression
Multiple Choice Multiple Answer Question Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer Relational Database , Data Warehouse , Transaction Database
Multiple Choice Multiple Answer Question The different definitions of metadata are :-
Correct Answer
Data about data , Catalog of data , Data warehouse roadmap
Your Answer Data about data , Catalog of data , Data warehouse roadmap
Multiple Choice Single Answer Question The data warehouse DBMS executes on :-
Correct Answer
Data server component
Your Answer Data server component
Multiple Choice Multiple Answer Question Source Data Component may be grouped into following categories :-
Correct Answer
Production Data , Internal External Data
Your Answer Production Data , Internal External Data , Analyzed data
Select The Blank Question ________ databases are one of the most poplularly available and rich information
repositories. Correct Answer
Relational
Your Answer Relational
Match The FollowingQuestion Correct Answer Your Answer
Metadata Roadmap for user Details of summary
Page 42 of 141
SCDL – 4th Semester – Data Mining
Data storage Data management Data management
Data staging Workbench for data Workbench for data
Data Mining Knowledge discovery Knowledge discovery
True/False Question Data Mining refers to extracting knowledge from larger amount of data.
Correct Answer
True
Your Answer True
Select The Blank Question Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer Relational
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer
Decision Tree
Your Answer Decision Tree
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single Answer Question The technique of data clustering facilitates :-
Correct Answer
Serial access
Your Answer Indexed access
Select The Blank Question In ________ type smoothing, minimum and maximum values in given bin are identified as
bin boundaries. Correct Answer
Smoothing by bin boundaries
Your Answer Smoothing by bin boundaries
Multiple Choice Multiple Answer Question The ways of Intra query parallelization are :-
Correct Answer
Horizontal parallelization , Vertical Parallelization , Hybrid parallelization
Page 43 of 141
SCDL – 4th Semester – Data Mining
Your Answer Vertical Parallelization , Homogenous parallelization
Select The Blank Question ________ technique is the statistical technique for analyzing data.
Correct Answer
Time series
Your Answer Time series
True/False Question One of the most important search problem in genetic analysis is similarity search and
comparison among DNA sequence. Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question User must have proper access to metadata for performing responsibilities of :-
Correct Answer
Design , Administration
Your Answer Administration , Management , Accessing
Multiple Choice Single Answer Question Association rules mining is based on :-
Correct Answer
Clustering and Employing rules for classification
Your Answer Clustering and Employing rules for classification
Select The Blank Question ________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer Legacy platform
Multiple Choice Multiple Answer Question Classification and Prediction have following applications :-
Correct Answer
Credit approval , Medical Diagnosis , Performance Prediction
Your Answer Credit approval , Selective Marketing
Multiple Choice Multiple Answer Question In data storage area , DBA uses metadata for processes of :-
Correct Tuning Database , Backup , Recovery
Page 44 of 141
SCDL – 4th Semester – Data Mining
Answer Your Answer Tuning Database , Management
Multiple Choice Single Answer Question Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer Binning
True/False Question Tools perform major functions in data warehouse environment.
Correct Answer
True
Your Answer True
Select The Blank Question ________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer Cluster
True/False Question Data staging and data storage may start out on same computing platform.
Correct Answer
True
Your Answer False
Match The FollowingQuestion Correct Answer Your Answer
Middleware & connectivity tool Transparent access to source system
Assist data ware house administration
Data Quality tool Locating data errors Locating data errors
OLAP tools Channel queries Channel queries
Alert system tool Users attention on exceptions Users attention on exceptions
Multiple Choice Single Answer Question Attribute construction is the part of :-
Correct Answer
Transformation
Page 45 of 141
SCDL – 4th Semester – Data Mining
Your Answer Smoothing
Multiple Choice Single Answer Question Which from the following are special programs that are stored on database and fired when
certain predefined action occurs? Correct Answer
Triggers
Your Answer Triggers
True/False Question Data cube stores multidimensional aggregate information.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Deliberate splitting of a table and its index data into manageable part is known as :-
Correct Answer
Partitioning
Your Answer Partitioning
Multiple Choice Single Answer Question Effect of one attibute value on a given class is independent of values of other attibute is
called Correct Answer
Value independence
Your Answer Attirbute conditional independence
Multiple Choice Single Answer Question Simple matching approach is used for computing disimilarity between two objects for :-
Correct Answer
Nominal variable
Your Answer Invariant variable
True/False Question Data mining is not that much powerful tool for vast data such as gene sequences in DNA
analysis. Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question Following clustering method is classified as being agglomerative or divisive :-
Page 46 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Grid based
Your Answer Density based
Select The Blank Question ________ is the user who has system access privileges but no database administration
privileges as well as not for table and views. Correct Answer
Network administrator
Your Answer Network administrator
Select The Blank Question ________ clustering method follows statistical and neural network approach.
Correct Answer
Model based
Your Answer Grid based
True/False Question COBWEB is a method of incremental conceptual clustering.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question The different analysis tools which are useful to detect unusual patterns such as large
amount of cash flow at certain period by certain group of people are :- Correct Answer
Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Your Answer Linkage analysis tool , Outlier analysis tool , Complexity definition tool
Multiple Choice Multiple Answer Question DNA sequences are comprised of :-
Correct Answer
Adenine , Gaunine , Thymine
Your Answer Adenine , Cytocine , Gaunine , Thymine
True/False Question Management architectural component manages and controls data acquisition functions.
Correct Answer
True
Your Answer False
Multiple Choice Single Answer
Page 47 of 141
SCDL – 4th Semester – Data Mining
Question If many indexes are needed, then on which table which option is more preferable?
Correct Answer
Splitting of tables
Your Answer Splitting of tables
True/False Question To detect money laundering and other financial crimes, it is important to integrate
information for multiple databases. Correct Answer
True
Your Answer True
Select The Blank Question It is good practice to drop ________ before initial load.
Correct Answer
Index
Your Answer Index
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer
Decision Tree
Your Answer Decision Tree
True/False Question All data extraction, transformation, integration and staging jobs run on selected hardware
under chosen operating system. Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Metadata in a data warehouse falls into following categories :-
Correct Answer
Operational Metadata , Extraction and Transformation metadata , End-user Metadata
Your Answer Operational Metadata , Extraction and Transformation metadata , End-user Metadata
Multiple Choice Single Answer Question Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer Examining character of objects in groups
Page 48 of 141
SCDL – 4th Semester – Data Mining
Select The Blank Question ________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer Polynomial
Multiple Choice Multiple Answer Question The functional areas of metadata are :-
Correct Answer
Data Acquisition , Data storage , Information delivery
Your Answer Data Acquisition , Data storage , Information delivery
Match The FollowingQuestion Correct Answer Your Answer
Load Utility High performance data loading, recovery
High performance data loading, recovery
Query Governer Abort runaway query Balancing extraction of query
Query Optimizer Parsing, optimizing query Parsing, optimizing query
Query Management Balancing extraction of query Execution and rescheduling queries
Multiple Choice Single Answer Question The first step of attibute oriented induction is :-
Correct Answer
Data focusing
Your Answer Data Classification
Multiple Choice Single Answer Question Dimensionality reduction reduces the data set size by removing :-
Correct Answer
Irrelevant attributes
Your Answer Irrelevant attributes
True/False Question Architecture comes first, tools follows it.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer
Page 49 of 141
SCDL – 4th Semester – Data Mining
Question Data cleansing routines work to clean the data by :-
Correct Answer
Filling missing values , Smoothing noisy data
Your Answer Smoothing noisy data , Resolving inconsistency
Select The Blank Question ________ is the method used to predict the value of response variable from one to more
variables. Correct Answer
Regression
Your Answer Factor analysis
Select The Blank Question Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer Multidimensional
Multiple Choice Single Answer Question Which of the following method creates copies of data in distributed environment?
Correct Answer
Replication
Your Answer Replication
Select The Blank Question Human being have around ________ gene.
Correct Answer
100000
Your Answer 100000
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple Answer Question DNA sequences are comprised of :-
Correct Answer
Gaunine , Thymine , Adenine
Your Answer Gaunine , Thymine , Adenine , Cytocine
Multiple Choice Multiple Answer Question Source Data Component may be grouped into following categories :-
Correct Production Data , Internal External Data
Page 50 of 141
SCDL – 4th Semester – Data Mining
Answer Your Answer Production Data , Internal External Data
True/False Question Loan payment prediction and customer credit analysis are critical to business of bank.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of
classification & prediction are :- Correct Answer
Data Cleaning , Relevance Analysis , Data Transformation
Your Answer Data Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Single Answer Question The big difference between data warehouse and any operational system is its :-
Correct Answer
Usage
Your Answer Usage
True/False Question Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer True
True/False Question Moving data into staging area and performing data transformation function is a part of
data acquisition. Correct Answer
True
Your Answer True
Select The Blank Question ________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer Cluster
Select The Blank
Page 51 of 141
SCDL – 4th Semester – Data Mining
Question For operational system, the stored data contains ________values.
Correct Answer
Current data
Your Answer Current data
Multiple Choice Multiple Answer Question Splitting of data into smaller partition decision tree induction is prone to :-
Correct Answer
Fragmentation , Replication , Repetation
Your Answer Fragmentation , Generalization
Select The Blank Question ________ includes Normalization and Aggregation as data preprocessing procedures.
Correct Answer
Data transformation
Your Answer Data transformation
Multiple Choice Single Answer Question Bitmapped indexes are more suitable for data warehouse environment than for an OLTP
system Correct Answer
Bitmapped index
Your Answer Clustered index
True/False Question Data updates are common place in an operational database.
Correct Answer
True
Your Answer True
Select The Blank Question ________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer User tool appreciation
Match The FollowingQuestion Correct Answer Your Answer
Data Mining Knowledge discovery Knowledge discovery
Metadata Roadmap for user Roadmap for user
Page 52 of 141
SCDL – 4th Semester – Data Mining
Data storage Data management Data management
Data staging Workbench for data Workbench for data
Multiple Choice Single Answer Question A gene is usually comprised of hundreds of individual :-
Correct Answer
Nucleotides
Your Answer Nucleotides
True/False Question The Structure that brings all the components together is known as Architecture.
Correct Answer
True
Your Answer True
True/False Question NUMA provides better scalability than SMP.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer Examining distance between objects
Select The Blank Question ________ is density based clustering method which computes on augumented clustering
ordering for automic ordering for automatic and interactive cluster analysis Correct Answer
DBSCAN
Your Answer DBSCAN
Multiple Choice Single Answer Question Enterprise miner technique provides data mining algorithms including distinguishing
feature as :- Correct Answer
Advanced Statistical and advanced visualization tool
Your Answer Advanced Statistical and classification tool
Match The Following
Page 53 of 141
SCDL – 4th Semester – Data Mining
Question Correct Answer Your Answer
Load Image To correspond to target files To correspond to target files
Constructive merge New record supercedes New record supercedes
Initial Load Populating data warehouse table first time
Populating data warehouse table first time
Incremental Load Applying ongoing changes Applying ongoing changes
Multiple Choice Multiple Answer Question Advantages of Wavelet transformation for clustering are :-
Correct Answer
Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast
Your Answer Unsupervised clustering , Clustering is fast
Select The Blank Question With the widespread option of ________ real-time connection is viable for data
warehouse. Correct Answer
TCP/IP
Your Answer HTTP
True/False Question A process of grouping a set of physical or abstract objects into classes of similar objects is
called clusiering Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Development and deployment of your data warehouse is joint effort between :-
Correct Answer
IT staff and user representatives
Your Answer IT staff and user representatives
Multiple Choice Single Answer Question Attribute construction is the part of :-
Correct Answer
Transformation
Your Answer Aggregation
Multiple Choice Single Answer Question Which of the following data warehouse component includes dependent data marts,
special multidimensional database and full range of query and reporting facilities?
Page 54 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Information Delivery component
Your Answer Data Staging component
Multiple Choice Single Answer Question Which technique analyze experimental data?
Correct Answer
Analysis of variance
Your Answer Analysis of variance
Select The Blank Question ________ function of data staging component involves many forms of combining pieces
of data from different sources. Correct Answer
Data Transformation
Your Answer Data Transformation
Multiple Choice Multiple Answer Question Metadata is essential for IT for :-
Correct Answer
Source data structures , Data summarization
Your Answer Source data structures , Data summarization , Aggregation
Multiple Choice Multiple Answer Question Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer Statistical , Distance based , Deviation based
Select The Blank Question ________ are responsible for running queries and reports against data warehouse tables.
Correct Answer
End users
Your Answer Query tool specialist
Select The Blank Question ________ is the user who has system access privileges but no database administration
privileges as well as not for table and views. Correct Answer
Network administrator
Your Answer End user
Multiple Choice Multiple Answer
Page 55 of 141
SCDL – 4th Semester – Data Mining
Question Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer Discovery driven OLAP analysis , Association , Classification
Multiple Choice Multiple Answer Question Knowledge discovery process includes :-
Correct Answer
Data Cleaning , Data Intergration , Data Selectin
Your Answer Data Cleaning , Data Intergration , Data Selectin
True/False Question In Linear regression data are modeled to fit a straight line.
Correct Answer
True
Your Answer True
True/False Question Data in data warehouse cuts across application.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question If many indexes are needed, then on which table which option is more preferable?
Correct Answer
Splitting of tables
Your Answer Rearranging of tables
Multiple Choice Single Answer Question Which technique is used to predict categorical response variable?
Correct Answer
Discriminant analysis
Your Answer Discriminant analysis
Multiple Choice Multiple Answer Question Following data transformation methods are used in analysis of time series data :-
Correct Answer
Scaling , Normalization , Windows Stiching
Your Answer Scaling , Normalization , Windows Stiching
Page 56 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Concept Description generates description for :-
Correct Answer
Charaterisation and Comparison
Your Answer Charaterisation and Comparison
True/False Question Data preprocessing is an important step in knowledge discovery process.
Correct Answer
True
Your Answer True
Select The Blank Question ________ databases are one of the most poplularly available and rich information
repositories. Correct Answer
Relational
Your Answer Relational
Multiple Choice Multiple Answer Question Data Mining means :-
Correct Answer
Knowledge mining from database , Data /Pattern analysis , Data Archelogy
Your Answer Knowledge mining from database , Data /Pattern analysis , Data Archelogy
Multiple Choice Single Answer Question What improves accuracy and speed of subsequent mining process?
Correct Answer
Integration
Your Answer Integration
Multiple Choice Multiple Answer Question Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer Relational Database , Data Warehouse , Transaction Database
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
True/False
Page 57 of 141
SCDL – 4th Semester – Data Mining
Question Data cube stores multidimensional aggregate information.
Correct Answer
True
Your Answer True
Select The Blank Question ________ is a summarization of general characteristics or features of a target class of data.
Correct Answer
Data Characterization
Your Answer Data Generalization
Multiple Choice Single Answer Question The pilot which is useful for user and project team both as it touches all important functions
is :- Correct Answer
Expanded seed pilot
Your Answer User tool appreciation pilot
Multiple Choice Single Answer Question Which of the following technique involves placing and managing related units of data in
same physical block of storage Correct Answer
Clustering
Your Answer Clustering
Multiple Choice Multiple Answer Question History of metadata includes :-
Correct Answer
Changes to source system , Data extraction methods , Data transformation algorithm
Your Answer Changes to source system , Data extraction methods
Multiple Choice Single Answer Question Which of the following approach requires more computation?
Correct Answer
Filter approach
Your Answer Filter approach
True/False Question The substantial part of historical data comes form antiquated legacy system.
Correct Answer
True
Your Answer True
Page 58 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question Data reduction includes :-
Correct Answer
Single value decomposition , Wavelets , Regression
Your Answer Single value decomposition , Wavelets , Regression
Multiple Choice Single Answer Question Bayes Theorem is :-
Correct Answer
P(H|X)=P(X|H)(P)/P(X)
Your Answer P(H|X)=P(X|H)(P)/P(X)
Multiple Choice Single Answer Question Establish the importance of data quality, Form data quality steering committee, Institute a
data quality framework, Assign roles and responsibilities. These are the steps of :- Correct Answer
Data purification
Your Answer Data quality control
Select The Blank Question With the widespread option of ________ real-time connection is viable for data warehouse.
Correct Answer
TCP/IP
Your Answer TCP/IP
Multiple Choice Single Answer Question Which is the typical example of Grid based clustering method
Correct Answer
STING
Your Answer STING
Match The FollowingQuestion Correct Answer Your Answer
Normalization Scattered data Constructing small units of data
Smoothing Removal of noisy data Removal of noisy data
Aggregation Summary operations Constructing new attributes
Generalization Data hierarchies Data hierarchies
Multiple Choice Single Answer
Page 59 of 141
SCDL – 4th Semester – Data Mining
Question Association rules mining is based on :-
Correct Answer
Clustering and Employing rules for classification
Your Answer Clustering and Employing rules for classification
True/False Question Bitmapped indexing does not apply to fault tables.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question For processing metadata in informal delivery area, data can be referred back for :-
Correct Answer
Source data configuration , Data structure , Data transformation
Your Answer Source data configuration , Data structure , Data transformation
True/False Question The precision measure is the % of retrieved documents that are in fact relevant to query.
Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question Main advantage of following which method is it's fast processing?
Correct Answer
Grid based
Your Answer Grid based
Select The Blank Question Analysis of frequent sequential patterns is important in analysis ________ in generic
sequence. Correct Answer
Dismilarity and similarity
Your Answer Similarity
Select The Blank Question ________ is the clustering method which encounters difficultes regarding the selection of
merge/split points Correct Answer
Hierachical
Your Answer Hierachical
Page 60 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question Following clustering method is classified as being agglomerative or divisive :-
Correct Answer
Grid based
Your Answer Grid based
Multiple Choice Multiple Answer Question Normalization improves :-
Correct Answer
Efficiency , Accuracy
Your Answer Efficiency , Accuracy
Multiple Choice Single Answer Question A Wavelet transformation is :-
Correct Answer
Single processing Technique that decomposes signals into different frequency subbands
Your Answer Single processing Technique that decomposes signals into different frequency subbands
Multiple Choice Single Answer Question The Clustering method DBSCAN stands for :-
Correct Answer
Desity Based Spatial clustering of Application with Noise
Your Answer Desity Based Spatial clustering of Application with Noise
Select The Blank Question ________ can store aggregate and detail data at varying levels of resolution or abstraction.
Correct Answer
Index tree
Your Answer Index tree
Multiple Choice Single Answer Question Behavioral data of objects can be derived by the application of :-
Correct Answer
Method
Your Answer Method
Select The Blank Question ________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Page 61 of 141
SCDL – 4th Semester – Data Mining
Your Answer Broad business pilot
Multiple Choice Multiple Answer Question Metadata types can be classified as :-
Correct Answer
Business metadata , Technical metadata
Your Answer Business metadata , Technical metadata
Multiple Choice Single Answer Question Simple matching approach is used for computing disimilarity between two objects for :-
Correct Answer
Nominal variable
Your Answer Nominal variable
Multiple Choice Single Answer Question Effect of one attibute value on a given class is independent of values of other attibute is
called Correct Answer
Value independence
Your Answer Value independence
Multiple Choice Multiple Answer Question The different analysis tools which are useful to detect unusual patterns such as large
amount of cash flow at certain period by certain group of people are :- Correct Answer
Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Your Answer Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Multiple Choice Single Answer Question When DDL statements are created using database software, so to create an index system
creates :- Correct Answer
B-Tree index
Your Answer B-Tree index
Multiple Choice Multiple Answer Question Data processing techniques are :-
Correct Answer
Cleansing , Integration , Transformation
Your Answer Cleansing , Integration , Transformation
Match The FollowingQuestion Correct Answer Your Answer
Page 62 of 141
SCDL – 4th Semester – Data Mining
Load Utility High performance data loading, recovery
High performance data loading, recovery
Query Governer Abort runaway query Abort runaway query
Query Optimizer Parsing, optimizing query Parsing, optimizing query
Query Management Balancing extraction of query Balancing extraction of query
Select The Blank Question Indexed ________ engines search index, web pages and build huge keyword based indices
which help to search sets of web pages containing certain keywords Correct Answer
Web Search Engines
Your Answer Web Search Engines
True/False Question To detect money laundering and other financial crimes, it is important to integrate
information for multiple databases. Correct Answer
True
Your Answer True
Select The Blank Question ________ is the time consuming and less feasible approach for filling missing values.
Correct Answer
Filling missing values manually
Your Answer Filling missing values manually
Multiple Choice Single Answer Question Which from the following is used for classification and prediction?
Correct Answer
Regression trees
Your Answer Regression trees
Multiple Choice Multiple Answer Question Multimedia database stores and manages large collection of database such as :-
Correct Answer
Audio and Video , Sequence data , Text Markup and linkage
Your Answer Audio and Video , Sequence data
Select The Blank Question ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Page 63 of 141
SCDL – 4th Semester – Data Mining
Your Answer ROCK
Multiple Choice Single Answer Question Association rules mining is based on :-
Correct Answer
Clustering and Employing rules for classification
Your Answer Clustering and Employing rules for classification
Multiple Choice Single Answer Question Data matrix is :-
Correct Answer
Object by variable structure
Your Answer Object by variable structure
Select The Blank Question ________ architecture is more concerned with data access than memory access.
Correct Answer
MPP
Your Answer MPP
True/False Question Architecture comes first, tools follows it.
Correct Answer
True
Your Answer True
True/False Question Task of selection in data transformation forms part of extraction function.
Correct Answer
True
Your Answer False
Multiple Choice Single Answer Question Classification rules are extracted from
Correct Answer
Decision Tree
Your Answer Decision Tree
Select The Blank Question ________ includes Normalization and Aggregation as data preprocessing procedures.
Page 64 of 141
SCDL – 4th Semester – Data Mining
Correct Answer
Data transformation
Your Answer Data transformation
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
True/False Question Matching the choice of DBMS with selected server hardware is not important for
warehouse. Correct Answer
False
Your Answer False
Match The FollowingQuestion Correct Answer Your Answer
Metadata Roadmap for user Roadmap for user
Data storage Data management Data management
Data staging Workbench for data Workbench for data
Data Mining Knowledge discovery Knowledge discovery
True/False Question Database systems, data warehouse system and world wide web have become
mainstream information system. Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Bitmapped indexes are more suitable for data warehouse environment than for an
OLTP system Correct Answer
Bitmapped index
Your Answer Bitmapped index
Multiple Choice Single Answer Question The big difference between data warehouse and any operational system is its :-
Correct Answer
Usage
Your Answer Usage
Page 65 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question One major effort within data transformation is :-
Correct Answer
Improvement of data quality
Your Answer Analysis of data quality
Multiple Choice Multiple Answer Question Advantages of Wavelet transformation for clustering are :-
Correct Answer
Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast
Your Answer Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast
Multiple Choice Single Answer Question Which of the following technique is used to display group summary statistics?
Correct Answer
Quality control
Your Answer Survival analysis
Select The Blank Question ________ platform is the platform on which the data warehouse DBMS runs and
database exist. Correct Answer
Data storage
Your Answer Data storage
Multiple Choice Multiple Answer Question Class Comparison is performed through following steps :-
Correct Answer
Data Collection , Dimension relevance analysis , Presentation of derived comparison
Your Answer Data Collection , Dimension relevance analysis , Presentation of derived comparison
Select The Blank Question It is good practice to drop ________ before initial load.
Correct Answer
Index
Your Answer Index
Select The Blank Question ________ is the time consuming and less feasible approach for filling missing
values. Correct Answer
Filling missing values manually
Page 66 of 141
SCDL – 4th Semester – Data Mining
Your Answer Filling missing values manually
Multiple Choice Multiple Answer Question Basic Heuristic method of attribute subset selection includes following techniques :-
Correct Answer
Stepwise forward selection , Stepwise backward elimination
Your Answer Stepwise forward selection , Stepwise backward elimination , Combination of forward selection and backward elimination
True/False Question For maintaining the quality of data proper naming conventions help to make data
elements well understood by users. Correct Answer
True
Your Answer True
Select The Blank Question In ________ duplicate sub trees exist within the tree.
Correct Answer
Repetition
Your Answer Repetition
Select The Blank Question The technique of ________ enables concurrent input/output operations and
improves file's access performance substantially. Correct Answer
File striping
Your Answer File striping
Select The Blank Question ________ does not handle categorical attributes.
Correct Answer
CURE
Your Answer CURE
Select The Blank Question Creating ________is violation of Normalization principles.
Correct Answer
Array
Your Answer Array
True/False
Page 67 of 141
SCDL – 4th Semester – Data Mining
Question Data in warehouse is primarily for query.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Preprocessing steps of data in order to help improve accuracy, efficiency and
scalability of classification & prediction are :- Correct Answer
Data Cleaning , Relevance Analysis , Data Transformation
Your Answer Data Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Single Answer Question Which task in data transformation includes types of data manipulation on selected
parts of source data? Correct Answer
Splitting/Joining
Your Answer Splitting/Joining
True/False Question Business metadata is like a roadmap or easy to use information directory showing
contents and how to get there. Correct Answer
True
Your Answer True
True/False Question Data error discovery and data correction are two parts of data cleansing process.
Correct Answer
True
Your Answer False
Multiple Choice Multiple Answer Question The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Select The Blank Question ________ technique is known as snapshot differential technique.
Correct Answer
Capture based on comparing files
Page 68 of 141
SCDL – 4th Semester – Data Mining
Your Answer Capture based on comparing files
Multiple Choice Multiple Answer Question The benefits of improved data quality are :-
Correct Answer
Better customer service , Improved productivity , Reliable strategic decision making
Your Answer Better customer service , Improved productivity , Reliable strategic decision making
Multiple Choice Single Answer Question Which technique of data extraction is available to non relational databases?
Correct Answer
Capture through transaction log
Your Answer Capture of static data
Select The Blank Question ________ technique is the statistical technique for analyzing data.
Correct Answer
Time series
Your Answer Time series
True/False Question Noise in data means error or variance in measured variable.
Correct Answer
True
Your Answer True
Multiple Choice Multiple Answer Question Data mining at home can help to mine data related to :-
Correct Answer
Medical History , Cancer , Chromosome abnormalities
Your Answer Medical History , Chromosome abnormalities , Physiological conditions
True/False Question Data Mining refers to extracting knowledge from larger amount of data.
Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question Simple matching approach is used for computing disimilarity between two objects
for :- Correct Nominal variable
Page 69 of 141
SCDL – 4th Semester – Data Mining
Answer Your Answer Nominal variable
Multiple Choice Multiple Answer Question Following are the reasons for getting data polluted :-
Correct Answer
Data aging , Input errors , Fraud
Your Answer Data aging , Input errors , Processing errors
Select The Blank Question ________ is the type of pilot for early delivery with broader scope and may be
integrated. Correct Answer
Broad business pilot
Your Answer Broad business pilot
Multiple Choice Multiple Answer Question Following are the issues to consider during data integration :-
Correct Answer
Schema integration , Redundancy , Detection and resolution of data values
Your Answer Schema integration , Redundancy , Detection and resolution of data values
Match The FollowingQuestion Correct Answer Your Answer
Rough set Approach Noisy Data Previously unseen data
k-Nearest Neighbour Classifiers Learning Analogy Noisy Data
Class based Testing Instanace Based Learning Analogy
Generic Algorithms Natural Evolution Natural Evolution
Multiple Choice Single Answer Question When DDL statements are created using database software, so to create an index
system creates :- Correct Answer
B-Tree index
Your Answer B-Tree index
True/False Question The difficulties encountered in data transformation function relate to heterogeneity
of the source system. Correct Answer
True
Page 70 of 141
SCDL – 4th Semester – Data Mining
Your Answer False
True/False Question Data mining is not that much powerful tool for vast data such as gene sequences in
DNA analysis. Correct Answer
True
Your Answer True
Multiple Choice Single Answer Question When current extent on disk storage for a file is full, DBMS finds new extent and
allows an insertion of new record is known as :- Correct Answer
Dynamic extension
Your Answer Dynamic extension
Multiple Choice Multiple Answer Question Following are the types of normalization :-
Correct Answer
Min-Max Normalization , Z-score normalization , Normalization by scaling
Your Answer Min-Max Normalization , Z-score normalization , Normalization by scaling
Multiple Choice Multiple Answer Question In generation of numerical hierarchies for cluster analysis following techniques are
useful :- Correct Answer
Binning , Histogram analysis , Clustering
Your Answer Binning , Histogram analysis , Segmentation
Select The Blank Question ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer ROCK
Multiple Choice Multiple Answer Question Generalized linear model includes :-
Correct Answer
Logistic regression , Poisson regression
Your Answer Logistic regression , Poisson regression
Multiple Choice Single Answer Question Inherently Architected, Single, central storage of data about content, Centralized
rules and control, Seek quick result, these are the advantages of which type of data
Page 71 of 141
SCDL – 4th Semester – Data Mining
extraction? Correct Answer
Top down approach
Your Answer Top down approach
Multiple Choice Single Answer Question Data matrix is :-
Correct Answer
Object by variable structure
Your Answer Object by variable structure
Multiple Choice Single Answer Question Queries run faster to find exact match using which type of indexing?
Correct Answer
Clustered index
Your Answer Clustered index
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources. Correct Answer: Data Transformation Your Answer: Data Transformation Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data acquisition , Data Storage , Information Delivery Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis. Correct Answer: Integration Your Answer: Integration Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Represent actual data Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-
Page 72 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Different Objective Scope , Complete Analysis and Quick Response , Flexible and Dynamic Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components. Correct Answer: Metadata Your Answer: Metadata Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Statistical , Distance based , Deviation based True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction True/FalseQuestion: Data Integration means multiple resourses may be combined. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Multidimensional index tree True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition. Correct Answer: True Your Answer: True True/FalseQuestion: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK
Page 73 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :- Correct Answer: Huge size of data Your Answer: Huge size of data Multiple Choice Single AnswerQuestion: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X|H)(P)/P(X) Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :- Correct Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Select The BlankQuestion: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: ROCK Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Select The BlankQuestion: The ________ record is one-to-many relationship with corresponding fact table record. Correct Answer: Dimension tables Your Answer: Dimension tables True/FalseQuestion: In Database system multidimensional index trees are primarily used for providing fast data access. Correct Answer: True Your Answer: True Match The FollowingQuestion Correct Answer Your AnswerData Mining Knowledge discovery Knowledge discovery Metadata Roadmap for user Roadmap for user Data storage Data management Data management Data staging Workbench for data Workbench for data True/FalseQuestion: COBWEB is a method of incremental conceptual clustering. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :- Correct Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool Your Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Page 74 of 141
SCDL – 4th Semester – Data Mining
Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Multiple Choice Multiple AnswerQuestion: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Multiple Choice Single AnswerQuestion: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification Multiple Choice Single AnswerQuestion: Data can be smoothed by filling the data to function such as :- Correct Answer: Regression Your Answer: Regression Multiple Choice Multiple AnswerQuestion: In physical design of data warehouse administration provides features like :- Correct Answer: Support backup and recovery , Query processing , Avoiding reorganizing of tables Your Answer: Avoiding reorganizing of tables , Support backup and recovery , Query processing True/FalseQuestion: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Multiple Choice Single AnswerQuestion: Data partitioning, data clustering are the techniques for :- Correct Answer: Performance enhancement Your Answer: Performance enhancement Multiple Choice Multiple AnswerQuestion: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin
Page 75 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Single AnswerQuestion: Query tool is meant for :- Correct Answer: Data acquisition Your Answer: Data acquisition Multiple Choice Multiple AnswerQuestion: The functions of data acquisition are :- Correct Answer: Data Extraction , Data Transformation Your Answer: Data Extraction , Data Transformation Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational True/FalseQuestion: From a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach? Correct Answer: STING , Wave Cluster Your Answer: STING , Wave Cluster True/FalseQuestion: The Structure that brings all the components together is known as Architecture. Correct Answer: True Your Answer: True Select The BlankQuestion: Human being have around ________ gene. Correct Answer: 100000 Your Answer: 100000 Select The BlankQuestion: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Regression Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism True/FalseQuestion: Data cleansing means removing noisy and inconsistent data. Correct Answer: True Your Answer: True Multiple Choice Single Answer
Page 76 of 141
SCDL – 4th Semester – Data Mining
Question: When DDL statements are created using database software, so to create an index system creates :- Correct Answer: B-Tree index Your Answer: B-Tree index
LIST OF ATTEMPTED QUESTIONS AND ANSWERS True/FalseQuestion: Architecture comes first, tools follows it. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Following are the theories for the basis of data mining :- Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Pattern discovery , Probability theory , Microeconomic view True/FalseQuestion: Data preprocessing is an important step in knowledge discovery process. Correct Answer: True Your Answer: True True/FalseQuestion: A distinguishing feature of Clementine is its object oriented extended module interface. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Architecture defines :- Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , Standard , Standard Techniques Select The BlankQuestion: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery Match The FollowingQuestion Correct Answer Your AnswerClassification tool To filter unrelated attributes To characterize unusual access
sequence Clustering tool To group different cases Transaction activity using graph Data visualization Transaction activity using To group different cases Tool graphLinkage analysis tool To identify links To identify links Multiple Choice Multiple AnswerQuestion: Data processing techniques are :- Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation Select The BlankQuestion: Creating ________is violation of Normalization principles.
Page 77 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Array Your Answer: Cluster Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :- Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Data Manager Multiple Choice Single AnswerQuestion: OPTICS regarding clustering stands for :- Correct Answer: Ordering Points to identify the clustering Structure Your Answer: Ordering Points to identify the clustering Structure Select The BlankQuestion: ________ that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: The stored values of an attribute represents the value of attribute at this moment of time is :- Correct Answer: Current value Your Answer: Value of attribute Match The FollowingQuestion Correct Answer Your AnswerData loading tool Primary key generation Bulk extraction for full refresh Data modeling tool Reverse Engineering Reverse Engineering capabilities CapabilitiesData Extraction tool Bulk extraction for full Default values
refreshData transformation Default values Primary key generation tool True/FalseQuestion: Audio data mining can be an interesting alternative to visual mining. Correct Answer: True Your Answer: True Select The BlankQuestion: Most of the warehouses employ ________ database Management System. Correct Answer: Relational Your Answer: Hierarchical Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs? Correct Answer: Triggers
Page 78 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Events Multiple Choice Multiple AnswerQuestion: For processing metadata in informal delivery area, data can be referred back for :- Correct Answer: Data structure , Data transformation , Source data configuration Your Answer: Source data configuration , Data structure , Data transformation Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :- Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :- Correct Answer: Grid based Your Answer: Partioning based Multiple Choice Single AnswerQuestion: The big difference between data warehouse and any operational system is its :- Correct Answer: Usage Your Answer: Structure Multiple Choice Multiple AnswerQuestion: Following are the data movement options in data warehouse :- Correct Answer: Shared disk , Mass data transmission , Real time connection Your Answer: Shared disk , Mass data transmission , Real time connection True/FalseQuestion: Data Mining refers to extracting knowledge from larger amount of data. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing? Correct Answer: Grid based Your Answer: Density based Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords Correct Answer: Web Search Your Answer: Web Search Multiple Choice Multiple AnswerQuestion: Data base miner provides multiple data mining algorithms including :- Correct Answer: Discovery driven OLAP analysis , Association , Classification Your Answer: Discovery driven OLAP analysis , Association , Regression Select The BlankQuestion: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Non parametric True/False
Page 79 of 141
SCDL – 4th Semester – Data Mining
Question: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Deviation based outlier detection identifes outliers by :- Correct Answer: Examining character of objects in groups Your Answer: Examining character of objects in groups Select The BlankQuestion: It is good practice to drop ________ before initial load. Correct Answer: Index Your Answer: Index Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques. Correct Answer: B-Tree Your Answer: B-Tree Select The BlankQuestion: In ________ duplicate sub trees exist within the tree. Correct Answer: Repetition Your Answer: Fragmentation Multiple Choice Single AnswerQuestion: Which is the typical example of Grid based clustering method Correct Answer: STING Your Answer: DBSCAN True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: In data storage area , DBA uses metadata for processes of :- Correct Answer: Backup , Recovery , Tuning Database Your Answer: Backup , Recovery True/FalseQuestion: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: True Select The BlankQuestion: When data block contains excessive amount of free space, performance ________ Correct Answer: Degenerates Your Answer: Degenerates Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist. Correct Answer: Data storage
Page 80 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Legacy Multiple Choice Multiple AnswerQuestion: Data integration means :- Correct Answer: Integrating database , Integrating cubes , Integrating files Your Answer: Integrating database , Integrating cubes , Integrating files Multiple Choice Single AnswerQuestion: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Analysis of variance True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Maintenance of cache consistency is the limitation of :- Correct Answer: MPP Your Answer: SMP Multiple Choice Multiple AnswerQuestion: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules Multiple Choice Single AnswerQuestion: Redundancies can be deleted by :- Correct Answer: Co-relational analysis Your Answer: Relational analysis Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :- Correct Answer: Much smaller Your Answer: Much smaller
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis. Correct Answer: Integration Your Answer: Integration Select The BlankQuestion: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Robust Multiple Choice Single AnswerQuestion: Bitmap index takes significantly less space than which type of index? Correct Answer: B-Tree index
Page 81 of 141
SCDL – 4th Semester – Data Mining
Your Answer: B-Tree index Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user. Correct Answer: Information Delivery Your Answer: Information Delivery True/FalseQuestion: Architecture comes first, tools follows it. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data acquisition , Data Storage , Information Delivery Select The BlankQuestion: ________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis Correct Answer: DBSCAN Your Answer: DBSCAN Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance data loading,
loading, recovery recovery Query Governer Abort runaway query Active data catalog/directory Query Optimizer Parsing, optimizing query Parsing, optimizing query Query Management Balancing extraction of query Execution and rescheduling queries Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data Select The BlankQuestion: ________ is the type of pilot for early delivery with broader scope and may be integrated. Correct Answer: Broad business pilot Your Answer: Broad business pilot Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :- Correct Answer: Binning , Clustering , Regression Your Answer: Clustering , Regression Multiple Choice Single AnswerQuestion: Which of the following data warehouse component includes dependent data marts, special multidimensional database and full range of query and reporting facilities? Correct Answer: Information Delivery component Your Answer: Metadata Component True/FalseQuestion: The Structure that brings all the components together is known as Architecture.
Page 82 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: True Your Answer: True Select The BlankQuestion: The technique of ________ enables concurrent input/output operations and improves file's access performance substantially. Correct Answer: File striping Your Answer: File striping True/FalseQuestion: Management architectural component manages and controls data acquisition functions. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: If many indexes are needed, then on which table which option is more preferable? Correct Answer: Splitting of tables Your Answer: Rearranging of tables Multiple Choice Single AnswerQuestion: Which of the following of Grid based clustering method explorates statistical information? Correct Answer: STING Your Answer: STING Multiple Choice Single AnswerQuestion: Attribute construction is the part of :- Correct Answer: Transformation Your Answer: Aggregation Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :- Correct Answer: Adenine , Gaunine , Thymine Your Answer: Gaunine , Thymine , Adenine True/FalseQuestion: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectangles Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Multiple Choice Single AnswerQuestion: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques. Correct Answer: B-Tree
Page 83 of 141
SCDL – 4th Semester – Data Mining
Your Answer: B-Tree Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Multiple Choice Single AnswerQuestion: Dimensionality reduction reduces the data set size by removing :- Correct Answer: Irrelevant attributes Your Answer: Irrelevant attributes Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :- Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features , Restructuring True/FalseQuestion: Data integration merges data from multiple sources into coherent sources. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: The option "capture in source application technique of data extraction degrades performance of source application because :- Correct Answer: Additional processing needs Your Answer: Additional processing needed to capture changes on separate files Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism Multiple Choice Single AnswerQuestion: Data partitioning, data clustering are the techniques for :- Correct Answer: Performance enhancement Your Answer: Performance enhancement True/FalseQuestion: COBWEB is an extension of CLASSIT for incremental clustering of contineous data. Correct Answer: False Your Answer: True Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :- Correct Answer: Detection and resolution of data values , Schema integration , Redundancy Your Answer: Schema integration , Redundancy , Detection and resolution of data values Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree
Page 84 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach? Correct Answer: STING , Wave Cluster Your Answer: STING , Wave Cluster True/FalseQuestion: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: CURE Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Multiple Choice Single AnswerQuestion: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X|H)(P)/P(X) True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis. Correct Answer: True Your Answer: False Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial True/FalseQuestion: Easily accessible metadata is crucial for end users. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction True/FalseQuestion: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: False Select The Blank
Page 85 of 141
SCDL – 4th Semester – Data Mining
Question: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational Multiple Choice Multiple AnswerQuestion: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Select The BlankQuestion: ________ is the platform for complex data transformation for the purpose of cleanse it Correct Answer: Separate optimal Platform Your Answer: Separate optimal Platform Select The BlankQuestion: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single AnswerQuestion: Data matrix is :- Correct Answer: Object by variable structure Your Answer: Object by variable structure Multiple Choice Multiple AnswerQuestion: Following are the data movement options in data warehouse :- Correct Answer: Shared disk , Mass data transmission , Real time connection Your Answer: Shared disk , Mass data transmission , Real time connection Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Replace data True/FalseQuestion: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: For Incremental data loads the sequence is :- Correct Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing Your Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing True/FalseQuestion: COBWEB incrementally incarporates objects into classification tree. Correct Answer: True Your Answer: True
Page 86 of 141
SCDL – 4th Semester – Data Mining
True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition. Correct Answer: True Your Answer: True Select The BlankQuestion: Creating ________is violation of Normalization principles. Correct Answer: Array Your Answer: Cluster Multiple Choice Single AnswerQuestion: Which of the following method is built on Influece function? Correct Answer: DENCLUE Your Answer: STING Multiple Choice Single AnswerQuestion: Which of the following methods for regression is used on sparse data :- Correct Answer: Regression and log-linear model Your Answer: Regression and log-linear model Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :- Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control Multiple Choice Multiple AnswerQuestion: Metadata in a data warehouse falls into following categories :- Correct Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Your Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Multiple Choice Single AnswerQuestion: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing Multiple Choice Multiple AnswerQuestion: Partitioning in physical design of data warehouse consists of :- Correct Answer: Fact tables and dimension tables , Number of partitions for each table , Criteria for dividing table Your Answer: Fact tables and dimension tables , Number of partitions for each table , Criteria for dividing table True/FalseQuestion: Data updates are common place in an operational database. Correct Answer: True Your Answer: True True/FalseQuestion: A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster. Correct Answer: True Your Answer: True
Page 87 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: The functional areas of metadata are :- Correct Answer: Data Acquisition , Data storage , Information delivery Your Answer: Data transformation , Data Acquisition , Information delivery Select The BlankQuestion: ________ regression involves finding the best time to fit two variables. Correct Answer: Linear Your Answer: Linear Match The FollowingQuestion Correct Answer Your AnswerAdministration Providing support for all Support for System administration DBA functionsExtensibility Hybrid Extension to OLAP Hybrid Extension to OLTP database
databasePortability Across platform Across platform Query tool APIs For tools from loading Providing support for all DBA vendors
functions Multiple Choice Single AnswerQuestion: Which of the following type of processing provides high concurrency? Correct Answer: SMP Your Answer: ccNUMA Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer: Heterogeneous and distributed Your Answer: Heterogeneous and distributed True/FalseQuestion: To remove noise from data is called as Smoothing. Correct Answer: True Your Answer: True Match The FollowingQuestion Correct Answer Your AnswerData Mining Knowledge discovery Knowledge discovery Metadata Roadmap for user Roadmap for user Data storage Data management Data management Data staging Workbench for data Workbench for data Multiple Choice Multiple AnswerQuestion: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Statistical , Distance based , Deviation based Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :-
Page 88 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Grid based Your Answer: Grid based Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components. Correct Answer: Metadata Your Answer: Metadata Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :- Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Multiple Choice Multiple AnswerQuestion: The objective for physical design of data warehouse are :- Correct Answer: Improve performance , Ensure scalability , Manage store Your Answer: Improve performance , Ensure scalability , Manage database True/FalseQuestion: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: What improves accuracy and speed of subsequent mining process? Correct Answer: Integration Your Answer: Regression Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables. Correct Answer: End users Your Answer: End users Select The BlankQuestion: For operational system, the stored data contains ________values. Correct Answer: Current data Your Answer: Current data Multiple Choice Single AnswerQuestion: Enterprise miner technique provides data mining algorithms including distinguishing feature as :- Correct Answer: Advanced Statistical and advanced visualization tool Your Answer: Robust Graphics tools Multiple Choice Multiple AnswerQuestion: Splitting of query by DBMS in intra query parallelization is for :- Correct Answer: Index read , Data read , Data joint Your Answer: Index read , Data read , Data joint Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation? Correct Answer: Filter approach Your Answer: Filter approach
Page 89 of 141
SCDL – 4th Semester – Data Mining
True/FalseQuestion: Data in warehouse is primarily for query. Correct Answer: True Your Answer: False Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :- Correct Answer: Nominal variable Your Answer: Invariant variable Select The BlankQuestion: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :- Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Multiple Choice Multiple AnswerQuestion: The different definitions of metadata are :- Correct Answer: Data about data , Catalog of data , Data warehouse roadmap Your Answer: Data about data , Catalog of data , Data warehouse roadmap Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval. Correct Answer: Descretization Your Answer: Descretization True/FalseQuestion: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing? Correct Answer: Grid based Your Answer: Grid based Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Index tree Select The BlankQuestion: ________ architecture is more concerned with data access than memory access. Correct Answer: MPP Your Answer: SMP
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Page 90 of 141
SCDL – 4th Semester – Data Mining
True/False Question: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data Storage , Information Delivery , Data acquisition Select The BlankQuestion: ________ is the navigational map of data warehouse. Correct Answer: End user Metadata Your Answer: End user Metadata Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :- Correct Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis Multiple Choice Single AnswerQuestion: Which of the following option is to share data by placing data at common place :- Correct Answer: Shared disk Your Answer: Shared disk Multiple Choice Multiple AnswerQuestion: Data mining is applicable to :- Correct Answer: Relational Database , Data Warehouse , Transaction Database Your Answer: Relational Database , Data Warehouse , Transaction Database Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation? Correct Answer: Filter approach Your Answer: Filter approach Match The FollowingQuestion Correct Answer Your AnswerClustering Data tuples as objects Data tuples as objects Dimension reduction Removal of irrelevant data Removal of irrelevant data Data compression More computations More computations Wrapper approach Great accuracy Great accuracy
Select The BlankQuestion: According to ________ theory database schema consist of data and patterns that are stored in database. Correct Answer: Inductive databases Your Answer: Inductive databases True/False Question: Data cubes created for varying levels of abstraction are referred as cuboids. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: The Architecture defines :- Correct Answer: Measurements , Standard , General Design
Page 91 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Measurements , Standard , General Design Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data Multiple Choice Multiple AnswerQuestion: When you use tool for design and development, following things take place with metadata :- Correct Answer: Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process Your Answer: Metadata aids in automation of data warehouse process , Metadata is no longer passive document , Metadata takes part in process True/False Question: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True Multiple Choice Multiple AnswerQuestion: Before moving data to data warehouse is has to go through :- Correct Answer: Transformation , Integration , Consolidation Your Answer: Transformation , Integration , Consolidation Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production data Non volatile data Query and analysis Query and analysis Data granularity Level of detail Level of detail Data from external External data External data source Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values. Correct Answer: Filling missing values manually Your Answer: Use of row mean Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK Multiple Choice Single AnswerQuestion: Which of the following is based on set of density distribution function clustering? Correct Answer: DBSCAN Your Answer: DBSCAN True/False Question: All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system. Correct Answer: True Your Answer: True Select The Blank
Page 92 of 141
SCDL – 4th Semester – Data Mining
Question: ________ component of warehouse is responsible for coordinating services and activities within the data warehouse. Correct Answer: Management and Control Your Answer: Management and Control Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval. Correct Answer: Descretization Your Answer: Descretization Multiple Choice Single AnswerQuestion: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Analysis of variance Multiple Choice Single AnswerQuestion: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user. Correct Answer: Information Delivery Your Answer: Information Delivery True/False Question: In Linear regression data are modeled to fit a straight line. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist. Correct Answer: Data storage Your Answer: Data storage Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :- Correct Answer: Replace data Your Answer: Replace data Multiple Choice Single AnswerQuestion: The DWT ( Discret Wavlet Transform) is a :- Correct Answer: Linear single processing technique Your Answer: Linear single processing technique Multiple Choice Multiple AnswerQuestion: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules True/False Question: A distinct feature of DB Miner is its data cube based online analytical mining. Correct Answer: True
Page 93 of 141
SCDL – 4th Semester – Data Mining
Your Answer: True Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality True/False Question: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing Select The BlankQuestion: In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries. Correct Answer: Smoothing by bin boundaries Your Answer: Smoothing by bin boundaries Select The BlankQuestion: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Regression Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :- Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features True/False Question: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True True/False Question: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Select The BlankQuestion: ________ is the user who has all access privileges like system, database administrator, for table and views. Correct Answer: Security administrator Your Answer: Power user Multiple Choice Multiple AnswerQuestion: Generalized linear model includes :- Correct Answer: Logistic regression , Poisson regression Your Answer: Logistic regression , Poisson regression
Page 94 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: The main categories of Metadata in warehouse are :- Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Extraction and transformation Metadata , End user Metadata Multiple Choice Single AnswerQuestion: Data migration affects performance requiring multiple blocks to be read which can be adjusted by :- Correct Answer: Block percent free Your Answer: Block percent free True/False Question: Data Integration means multiple resourses may be combined. Correct Answer: True Your Answer: True Multiple Choice Single AnswerQuestion: Data reduction by volume can be used for data representation using which type of reduction? Correct Answer: Numerosity reduction Your Answer: Numerosity reduction Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Attirbute conditional independence Multiple Choice Single AnswerQuestion: Which of the following technique involves placing and managing related units of data in same physical block of storage Correct Answer: Clustering Your Answer: Clustering
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Multiple AnswerQuestion: Data mining is applicable to :-Correct Answer: Transaction Database , Relational Database , Data Warehouse Your Answer: Transaction Database , Relational Database , Data Warehouse Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: Chameleon Multiple Choice Single AnswerQuestion: Main advantage of following which method is it's fast processing?Correct Answer: Grid basedYour Answer: Density based Select The Blank
Page 95 of 141
SCDL – 4th Semester – Data Mining
Question: When data block contains excessive amount of free space, performance ________Correct Answer: DegeneratesYour Answer: Degenerates Select The BlankQuestion: ________components consists all the different ways of making the information from the data warehouse available to the user.Correct Answer: Information DeliveryYour Answer: Information Delivery Multiple Choice Single AnswerQuestion: SMP stands for :-Correct Answer: Symmetric MultiprocessingYour Answer: Symmetric Multiprocessing Multiple Choice Multiple AnswerQuestion: The need for metadata is for :-Correct Answer: Using data warehouse , Building data warehouse , Administration of warehouse Your Answer: Building data warehouse , Administration of warehouse , Accessing data in warehouse Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables.Correct Answer: End usersYour Answer: Query tool specialist
Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Data Content , Complete Analysis and Quick Response , Flexible and Dynamic Multiple Choice Single AnswerQuestion: Redundancies can be deleted by :-Correct Answer: Co-relational analysisYour Answer: Relational analysis True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.Correct Answer: TrueYour Answer: True
Match The FollowingQuestion Correct Answer Your AnswerLoad Image To correspond to target files Offline data warehouseConstructive merge New record supercedes Populating data warehouse table first
timeInitial Load Populating data warehouse Applying data
table first timeIncremental Load Applying ongoing changes Applying ongoing changes True/FalseQuestion: COBWEB incrementally incarporates objects into classification tree.Correct Answer: True
Page 96 of 141
SCDL – 4th Semester – Data Mining
Your Answer: True Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control True/FalseQuestion: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusieringCorrect Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Application server serves following purposes :-Correct Answer: To run middleware and establish connectivity , To execute management and control software , To manage metadata Your Answer: To run middleware and establish connectivity , To execute management and control software , To run OLTP application True/FalseQuestion: Data mining often requires data integration.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: The option "capture in source application technique of data extraction degrades performance of source application because :-Correct Answer: Additional processing needsYour Answer: Additional processing needs Multiple Choice Multiple AnswerQuestion: The main categories of Metadata in warehouse are :-Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Execution and Transformation Metadata , End user Metadata Multiple Choice Single AnswerQuestion: Which of the following method creates copies of data in distributed environment?Correct Answer: ReplicationYour Answer: Replication Multiple Choice Multiple AnswerQuestion: Common areas of application for mixed effect model includes :-Correct Answer: Multiple data , Repeated measures data , Block designs Your Answer: Multiple data , Repeated measures data , Block designs Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :-Correct Answer: Detection and resolution of data values , Schema integration , Redundancy Your Answer: Schema integration , Redundancy , Inconsistency True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket.Correct Answer: TrueYour Answer: True
Page 97 of 141
SCDL – 4th Semester – Data Mining
Select The BlankQuestion: In ________ duplicate sub trees exist within the tree.Correct Answer: RepetitionYour Answer: Replication Multiple Choice Multiple AnswerQuestion: The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-Correct Answer: Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool Your Answer: Linkage analysis tool , Complexity definition tool , Sequential pattern analysis tool Select The BlankQuestion: According to ________ theory database schema consist of data and patterns that are stored in database.Correct Answer: Inductive databasesYour Answer: Data compression Multiple Choice Single AnswerQuestion: Which of the following methods for regression is used on sparse data :-Correct Answer: Regression and log-linear modelYour Answer: Regression and log-linear model Multiple Choice Single AnswerQuestion: The big difference between data warehouse and any operational system is its :-Correct Answer: UsageYour Answer: Structure Multiple Choice Single AnswerQuestion: In intermediate data extraction data capture through transaction log uses transaction from :-Correct Answer: Recovery from failureYour Answer: Logs of successful transaction Multiple Choice Multiple AnswerQuestion: SMP provides the features like :-Correct Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Your Answer: Controllers which are accessible to all processors , Each node has access to common set of disks , It is cluster of nodes Match The FollowingQuestion Correct Answer Your AnswerData producer Responsible for data quality Foreign key preservedDomain values Prevalent problem Primary key introducedUpdate security Prevention of unauthorized Prevention of unauthorized
updates updatesReferential integrity Foreign key preserved Responsible for data quality True/FalseQuestion: Management architectural component manages and controls data acquisition functions.Correct Answer: TrueYour Answer: False
Page 98 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single AnswerQuestion: EIS stands for :-Correct Answer: Executive Information SystemYour Answer: Extracted Integrated System True/FalseQuestion: NUMA provides better scalability than SMP.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ architecture is more concerned with data access than memory access.Correct Answer: MPPYour Answer: MPP Select The BlankQuestion: Human being have around ________ gene.Correct Answer: 100000Your Answer: 1000000 Select The BlankQuestion: With the widespread option of ________ real-time connection is viable for data warehouse.Correct Answer: TCP/IPYour Answer: TCP/IP True/FalseQuestion: In Linear regression data are modeled to fit a straight line.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Development and deployment of your data warehouse is joint effort between :-Correct Answer: IT staff and user representativesYour Answer: IT staff and developer True/FalseQuestion: Lower the level of detail, finer the data granularity.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following technique involves placing and managing related units of data in same physical block of storageCorrect Answer: ClusteringYour Answer: Indexing True/FalseQuestion: Loan payment prediction and customer credit analysis are critical to business of bank.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ is the platform for complex data transformation for the purpose of cleanse itCorrect Answer: Separate optimal PlatformYour Answer: Legacy platform
Page 99 of 141
SCDL – 4th Semester – Data Mining
Select The BlankQuestion: ________ clustering method follows statistical and neural network approach.Correct Answer: Model basedYour Answer: Hierarchical Method Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , Thymine Your Answer: Cytocine , Gaunine , Thymine Multiple Choice Multiple AnswerQuestion: Business metadata is useful for :-Correct Answer: Providing support to end users , For external view of data , Provides technical support to search data Your Answer: Providing support to end users , For external view of data , Provides technical support to search data , Helps in searching data Multiple Choice Single AnswerQuestion: Following clustering method is classified as being agglomerative or divisive :-Correct Answer: Grid basedYour Answer: Grid based
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Multiple AnswerQuestion: Metadata in a data warehouse falls into following categories :-Correct Answer: End-user Metadata , Operational Metadata , Extraction and Transformation metadata Your Answer: End-user Metadata , Operational Metadata , Extraction and Transformation metadata Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Performance Prediction , Selective Marketing Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Two mode matrix Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Internal dataNon volatile data Query and analysis Production dataData granularity Level of detail Archive dataData from external source External data Query and analysis
Multiple Choice Single AnswerQuestion: Bitmapped indexes are more suitable for data warehouse environment than for an OLTP systemCorrect Answer: Bitmapped indexYour Answer: B-Tree index
Page 100 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Source Data , Data Staging , Management and Control Your Answer: Source Data , Data Staging , Management and Control Multiple Choice Single AnswerQuestion: Queries run faster to find exact match using which type of indexing?Correct Answer: Clustered indexYour Answer: Clustered index True/FalseQuestion: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following option is to share data by placing data at common place :-Correct Answer: Shared diskYour Answer: Mass data transmission Multiple Choice Single AnswerQuestion: The category in which the value of each attribute is preserved as status every time a change occurs is :-Correct Answer: Periodic statusYour Answer: Periodic status True/FalseQuestion: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectanglesCorrect Answer: FalseYour Answer: False True/FalseQuestion: Intelligent miner is an IBM data mining product.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct Answer: TriggersYour Answer: Triggers Multiple Choice Single AnswerQuestion: Attribute construction is the part of :-Correct Answer: TransformationYour Answer: Transformation True/FalseQuestion: Metadata acts like a nerve center.Correct Answer: TrueYour Answer: True Multiple Choice Multiple Answer
Page 101 of 141
SCDL – 4th Semester – Data Mining
Question: Data reduction includes :-Correct Answer: Single value decomposition , Wavelets , Regression Your Answer: Wavelets , Regression True/FalseQuestion: Data cleansing means removing noisy and inconsistent data.Correct Answer: TrueYour Answer: True True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct Answer: Data Cleaning , Relevance Analysis , Data Transformation Your Answer: Data Cleaning , Data Transformation Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :-Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , Correct Multiple Choice Single AnswerQuestion: Which of the option is not considered as the major function needed to get data ready?Correct Answer: Storing dataYour Answer: Extracting data Select The BlankQuestion: ________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval.Correct Answer: DescretizationYour Answer: Reduction Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Invariant variable Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :-Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Hybrid parallelization , Homogenous parallelization True/FalseQuestion: Legacy data resides on Hierarchical or Network database.Correct Answer: TrueYour Answer: True Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis.Correct Answer: IntegrationYour Answer: Integration
Page 102 of 141
SCDL – 4th Semester – Data Mining
Select The BlankQuestion: ________ dimension of database in which primitive level data are spatial but generalization becomes non spatial.Correct Answer: Spatial to non spatialYour Answer: Spatial to non spatial Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction.Correct Answer: Index treeYour Answer: Index tree Multiple Choice Multiple AnswerQuestion: Following factors play important role in financial analysis :-Correct Answer: Data warehouse , Data cubes , Outliner analysis Your Answer: Data warehouse , Data cubes , Outliner analysis Multiple Choice Multiple AnswerQuestion: Following are the types of normalization :-Correct Answer: Min-Max Normalization , Z-score normalization , Normalization by scaling Your Answer: Min-Max Normalization , Normalization by scaling Select The BlankQuestion: ________ are responsible for running queries and reports against data warehouse tables.Correct Answer: End usersYour Answer: End users Multiple Choice Single AnswerQuestion: Which of the following approach requires more computation?Correct Answer: Filter approachYour Answer: Wrapper approach Select The BlankQuestion: When data block contains excessive amount of free space, performance ________Correct Answer: DegeneratesYour Answer: Degenerates Multiple Choice Single AnswerQuestion: Which of the following type of processing provides high concurrency?Correct Answer: SMPYour Answer: MPP Select The BlankQuestion: ________ option of warehouse architecture provides incremental growth.Correct Answer: ClusterYour Answer: Cluster Match The FollowingQuestion Correct Answer Your AnswerConstructive merge New record supercedes New record supercedesInitial Load Populating data warehouse Populating data warehouse
table first time table first timeIncremental Load Applying ongoing changes Applying ongoing changesLoad Image To correspond to target files To correspond to target files
Page 103 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: Data cleansing routines work to clean the data by :-Correct Answer: Filling missing values , Smoothing noisy data Your Answer: Filling missing values , Smoothing noisy data , Resolving inconsistency True/FalseQuestion: From a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ platform is the platform on which the data warehouse DBMS runs and database exist.Correct Answer: Data storageYour Answer: Data storage Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :-Correct Answer: Binning , Clustering , Regression Your Answer: Clustering , Regression , Insertion True/FalseQuestion: The elements of warehouse infrastructure are classified into operational and physical infrastructure.Correct Answer: TrueYour Answer: True Select The BlankQuestion: It is good practice to drop ________ before initial load.Correct Answer: IndexYour Answer: Splitting Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm.Correct Answer: ROCKYour Answer: CURE Select The BlankQuestion: Most of DBMS have ________ index techniques as default index techniques.Correct Answer: B-TreeYour Answer: B-Tree True/FalseQuestion: A distinguishing feature of Clementine is its object oriented extended module interface.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence
Page 104 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: The information delivery methods from data warehouse are :-Correct Answer: Complex queries , MD Analysis , Statistical Analysis
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single AnswerQuestion: Capture at data source and that's why this method is quite reliable :-Correct Answer: Capture by database TriggersYour Answer: Capture in source application Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Rules for classification Select The BlankQuestion: A web server usually registers ________ entry for every access of a web pageCorrect Answer: WeblogYour Answer: Weblog Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components.Correct Answer: MetadataYour Answer: Metadata True/FalseQuestion: To remove noise from data is called as Smoothing.Correct Answer: TrueYour Answer: True Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis.Correct Answer: Heterogeneous and distributedYour Answer: Homogenous and stagnant True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE True/FalseQuestion: Tools perform major functions in data warehouse environment.Correct Answer: TrueYour Answer: True
Page 105 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple AnswerQuestion: Common areas of application for mixed effect model includes :-Correct Answer: Multiple data , Repeated measures data , Block designs Your Answer: Multiple data , Dimensional data , Block designs Multiple Choice Single AnswerQuestion: Bitmap index takes significantly less space than which type of index?Correct Answer: B-Tree indexYour Answer: Clustered index Multiple Choice Multiple AnswerQuestion: Data processing is done for :-Correct Answer: Improving the efficiency , Ease of mining Your Answer: Improving the efficiency , Removing redundancy , Removing complexity Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.Correct Answer: Data TransformationYour Answer: Data Transformation Multiple Choice Multiple AnswerQuestion: Mining values can be removed by :-Correct Answer: Filling values manually , Use of global constant , Use of attribute mean Your Answer: Filling values manually , Use of global constant , Use of row mean Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :-Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Select The BlankQuestion: In ________ duplicate sub trees exist within the tree.Correct Answer: RepetitionYour Answer: Replication Select The BlankQuestion: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another.Correct Answer: Data portsYour Answer: Data cubes Match The FollowingQuestion Correct Answer Your AnswerData loading tool Primary key generation Formulating and running queriesData modeling tool Reverse Engineering capabilities Primary key generationData Extraction tool Bulk extraction for full refresh Bulk extraction for full
refreshData transformation toolDefault values Formulating and running queries
Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational Multiple Choice Multiple Answer
Page 106 of 141
SCDL – 4th Semester – Data Mining
Question: Metadata types can be classified as :-Correct Answer: Business metadata , Technical metadata Your Answer: Business metadata , Technical metadata , Logical metadata True/FalseQuestion: COBWEB is an extension of CLASSIT for incremental clustering of contineous data.Correct Answer: FalseYour Answer: True Multiple Choice Single AnswerQuestion: Which type of analysis of DNA facilitates discovery of group of genes and study of interaction and relationship between them?Correct Answer: Association analysisYour Answer: Generic data analysis Multiple Choice Multiple AnswerQuestion: Following are the issues to consider during data integration :-Correct Answer: Schema integration , Redundancy , Detection and resolution of data values Your Answer: Schema integration , Redundancy , Detection and resolution of data values Multiple Choice Single AnswerQuestion: Data migration affects performance requiring multiple blocks to be read which can be adjusted by :-Correct Answer: Block percent freeYour Answer: Block percent vacant Multiple Choice Multiple AnswerQuestion: Normalization improves :-Correct Answer: Efficiency , Accuracy Your Answer: Efficiency , Accuracy True/FalseQuestion: Smoothing by bin means each value in bin is replaced by the mean value of the bucket.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: In intermediate data extraction data capture through transaction log uses transaction from :-Correct Answer: Recovery from failureYour Answer: All Transaction Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search Multiple Choice Single AnswerQuestion: The first step of attibute oriented induction is :-Correct Answer: Data focusingYour Answer: Data Collection Multiple Choice Single Answer
Page 107 of 141
SCDL – 4th Semester – Data Mining
Question: Enterprise miner technique provides data mining algorithms including distinguishing feature as :-Correct Answer: Advanced Statistical and advanced visualization toolYour Answer: Robust Graphics tools Select The BlankQuestion: ________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysisCorrect Answer: DBSCANYour Answer: Hierachical True/FalseQuestion: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusieringCorrect Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Grouped data can be analyzed with the technique :-Correct Answer: Mixed effect modelYour Answer: Regression Multiple Choice Multiple AnswerQuestion: Which of the following clustering analysis method uses multiresolution approach?Correct Answer: STING , Wave Cluster Your Answer: STING , Only Wave Cluster True/FalseQuestion: COBWEB is a method of incremental conceptual clustering.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Source Data Component may be grouped into following categories :-Correct Answer: Production Data , Internal External Data Your Answer: Production Data , Internal External Data , Non Analyzed data Multiple Choice Single AnswerQuestion: Which type of indexing do not work with data whose selectivity is low :-Correct Answer: B-Tree indexYour Answer: B-Tree index True/FalseQuestion: Easily accessible metadata is crucial for end users.Correct Answer: TrueYour Answer: False Match The FollowingQuestion Correct Answer Your AnswerClementine Integral solutions SASIntelligent miner IBM IBMEnterprise miner SAS DB miner technologyMineset Silicon Graphics Integral solutions Multiple Choice Single AnswerQuestion: Data can be smoothed by filling the data to function such as :-
Page 108 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: RegressionYour Answer: Clustering True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: The need for metadata is for :-Correct Answer: Using data warehouse , Building data warehouse , Administration of warehouse Your Answer: Using data warehouse , Building data warehouse , Administration of warehouse Multiple Choice Multiple AnswerQuestion: The Architecture defines :-Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , General Design , Standard Techniques Multiple Choice Multiple AnswerQuestion: Following are the theories for the basis of data mining :-Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Pattern discovery , Probability theory , Macroeconomic view Select The BlankQuestion: In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.Correct Answer: Smoothing by bin boundariesYour Answer: Smoothing by bin boundaries True/FalseQuestion: Data Integration means multiple resourses may be combined.Correct Answer: TrueYour Answer: True Multiple Choice Single AnswerQuestion: Which of the following function involves data cleaning, data standardizing and summarizing?Correct Answer: Transforming dataYour Answer: Transforming data
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The BlankQuestion: For operational system, the stored data contains ________values.Correct Answer: Current dataYour Answer: Current data Select The BlankQuestion: ________ is the user who has system access privileges but no database administration privileges as well as not for table and views.Correct Answer: Network administratorYour Answer: Security administrator
Page 109 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single AnswerQuestion: Selection of which part of data warehouse hardware is ' Bit your bottom dollar'?Correct Answer: Server hardwareYour Answer: Workstation hardware Multiple Choice Single AnswerQuestion: The Clustering method DBSCAN stands for :-Correct Answer: Desity Based Spatial clustering of Application with NoiseYour Answer: Desity Based Spatial clustering of Application with Noise Multiple Choice Single AnswerQuestion: Which of the option is not considered as the major function needed to get data ready?Correct Answer: Storing dataYour Answer: Storing data Multiple Choice Single AnswerQuestion: Which from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct Answer: TriggersYour Answer: Triggers Multiple Choice Multiple AnswerQuestion: User must have proper access to metadata for performing responsibilities of :-Correct Answer: Design , Administration Your Answer: Administration , Management True/FalseQuestion: Architecture comes first, tools follows it.Correct Answer: TrueYour Answer: True True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.Correct Answer: TrueYour Answer: False Multiple Choice Single AnswerQuestion: OPTICS regarding clustering stands for :-Correct Answer: Ordering Points to identify the clustering StructureYour Answer: Ordering Points to identify the clustering Structure Multiple Choice Multiple AnswerQuestion: In data storage area metadata recorded by processes is used for :-Correct Answer: Users , Development , Administration Your Answer: Development , Administration Multiple Choice Multiple AnswerQuestion: Data reduction reduces data size by :-Correct Answer: Aggregation , Eliminating redundant features Your Answer: Aggregation , Eliminating redundant features Multiple Choice Single AnswerQuestion: Which of the following is based on set of density distribution function clustering?Correct Answer: DBSCAN
Page 110 of 141
SCDL – 4th Semester – Data Mining
Your Answer: DBSCAN True/FalseQuestion: A distinct feature of DB Miner is its data cube based online analytical mining.Correct Answer: TrueYour Answer: True True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse.Correct Answer: TrueYour Answer: True Match The FollowingQuestion Correct Answer Your AnswerExtraction is manual/Tool based Method of extraction Method of extractionIdentify source application Source identification Source identificationDenote time window Time window Time windowHandling unextractable input records Exception handling Exception handling Multiple Choice Single AnswerQuestion: The stored values of an attribute represents the value of attribute at this moment of time is :-Correct Answer: Current valueYour Answer: Current attribute True/FalseQuestion: The Structure that brings all the components together is known as Architecture.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ is the navigational map of data warehouse.Correct Answer: End user MetadataYour Answer: End user Metadata Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable Multiple Choice Multiple AnswerQuestion: Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct Answer: Data Cleaning , Relevance Analysis , Data Transformation Your Answer: Data Cleaning , Relevance Analysis Multiple Choice Single AnswerQuestion: Which of the following clustering algorithm integrates density based and grid based clustering?Correct Answer: CLQUEYour Answer: STING True/FalseQuestion: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct Answer: True
Page 111 of 141
SCDL – 4th Semester – Data Mining
Your Answer: True Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually Match The FollowingQuestion Correct Answer Your AnswerDisparate data Production data Production dataNon volatile data Query and analysis Query and analysisData granularity Level of detail Level of detailData from external source External data External data True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Data processing is done for :-Correct Answer: Improving the efficiency , Ease of mining Your Answer: Improving the efficiency , Ease of mining Multiple Choice Multiple AnswerQuestion: The smoothing techniques are :-Correct Answer: Binning , Clustering , Regression Your Answer: Binning , Clustering , Regression Multiple Choice Single AnswerQuestion: Many methods for data smoothing are also methods for data reduction involving :-Correct Answer: DiscretizationYour Answer: Discretization Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :-Correct Answer: Replace dataYour Answer: Represent actual data True/FalseQuestion: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.Correct Answer: TrueYour Answer: True Select The BlankQuestion: ________ component of warehouse is responsible for coordinating services and activities within the data warehouse.Correct Answer: Management and ControlYour Answer: Management and Control Select The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.Correct Answer: Data Transformation
Page 112 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Data Transformation Multiple Choice Single AnswerQuestion: Which type of following clustering computes augumented cluster ordering?Correct Answer: OPTICSYour Answer: CLQUE True/FalseQuestion: Data cleansing means removing noisy and inconsistent data.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction Select The BlankQuestion: Creating ________is violation of Normalization principles.Correct Answer: ArrayYour Answer: Structure Multiple Choice Multiple AnswerQuestion: The areas of classification for metadata are :-Correct Answer: Development/usage , Technical/business , BackRoom/Front Room Your Answer: Development/usage , BackRoom/Front Room , Administration Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational Multiple Choice Multiple AnswerQuestion: The ways of Intra query parallelization are :-Correct Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization Your Answer: Horizontal parallelization , Vertical Parallelization , Hybrid parallelization True/FalseQuestion: Data Mining refers to extracting knowledge from larger amount of data.Correct Answer: TrueYour Answer: True Multiple Choice Multiple AnswerQuestion: Data base miner provides multiple data mining algorithms including :-Correct Answer: Discovery driven OLAP analysis , Association , Classification Your Answer: Association , Classification , Regression Multiple Choice Multiple AnswerQuestion: Data transformation includes :-Correct Answer: Smoothing , Aggregation , Generalization Your Answer: Smoothing , Aggregation , Generalization Select The BlankQuestion: ________ includes Normalization and Aggregation as data preprocessing procedures.Correct Answer: Data transformation
Page 113 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Data transformation Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification Select The BlankQuestion: Semantic integration of ________ genome database is the important task of DNA analysis.Correct Answer: Heterogeneous and distributedYour Answer: Heterogeneous and distributed Select The BlankQuestion: ________ regression involves finding the best time to fit two variables.Correct Answer: LinearYour Answer: Linear
LIST OF ATTEMPTED QUESTIONS AND ANSWERS True/False Question: Data cubes created for varying levels of abstraction are referred as cuboids. Correct Answer: True Your Answer: True True/False Question: Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis. Correct Answer: True Your Answer: True Select The Blank Question: ________ pilot proves validity of data warehousing concept to users and top management. Correct Answer: Proof of concept Your Answer: User tool appreciation Multiple Choice Multiple Answer Question: Mining values can be removed by :- Correct Answer: Filling values manually , Use of global constant , Use of attribute mean Your Answer: Filling values manually , Use of global constant , Use of attribute mean Multiple Choice Single Answer Question: Which of the following type of processing provides high concurrency? Correct Answer: SMP Your Answer: SMP True/False Question: Lower the level of detail, finer the data granularity. Correct Answer: True Your Answer: True Multiple Choice Single Answer
Page 114 of 141
SCDL – 4th Semester – Data Mining
Question: Effect of one attibute value on a given class is independent of values of other attibute is called Correct Answer: Value independence Your Answer: Value independence Select The Blank Question: According to ________ theory database schema consist of data and patterns that are stored in database. Correct Answer: Inductive databases Your Answer: Inductive databases True/False Question: A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Warehouse Operational infrastructure is to support each architecture component consists of :- Correct Answer: People , Procedures , Management software Your Answer: People , Procedures , Management software Multiple Choice Multiple Answer Question: Time variant nature of the data in data warehouse :- Correct Answer: Allows for analysis of the past , Relate information to the present , Enables forecasts for the future Your Answer: Allows for analysis of the past , Relate information to the present , Enables forecasts for the future Multiple Choice Multiple Answer Question: Methods for outlier detection are categorised into following approaches :- Correct Answer: Statistical , Distance based , Deviation based Your Answer: Distance based , Deviation based , Diversion based Select The Blank Question: ________ regression involves finding the best time to fit two variables. Correct Answer: Linear Your Answer: Linear Multiple Choice Single Answer Question: Association rules mining is based on :- Correct Answer: Clustering and Employing rules for classification Your Answer: Clustering and Employing rules for classification True/False Question: Smoothing by bin means each value in bin is replaced by the mean value of the bucket. Correct Answer: True Your Answer: True True/False Question: Metadata describes all the pertinent aspects of the data in data warehouse. Correct Answer: True Your Answer: True
Page 115 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question: Following are the theories for the basis of data mining :- Correct Answer: Pattern discovery , Probability theory , Microeconomic view Your Answer: Microeconomic view , Pattern discovery , Probability theory Multiple Choice Single Answer Question: Which technique is used to predict categorical response variable? Correct Answer: Discriminant analysis Your Answer: Analysis of variance Multiple Choice Single Answer Question: EIS stands for :- Correct Answer: Executive Information System Your Answer: Executive Information System Match The Following Question Correct Answer Your AnswerIntegration Data merging from multiple sources Data merging from multiple sources Binning Sorted, neighbourhood data Sorted, neighbourhood data Clustering Similar values Similar values Regression Filtering of data Filtering of data Multiple Choice Single Answer Question: The DWT ( Discret Wavlet Transform) is a :- Correct Answer: Linear single processing technique Your Answer: Linear single processing technique True/False Question: Data mining often requires data integration. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Which is the typical example of Grid based clustering method Correct Answer: STING Your Answer: DBSCAN Multiple Choice Single Answer Question: Classification rules are extracted from Correct Answer: Decision Tree Your Answer: Decision Tree Multiple Choice Multiple Answer Question: For processing metadata in informal delivery area, data can be referred back for :- Correct Answer: Source data configuration , Data structure , Data transformation Your Answer: Source data configuration , Data structure , Data transformation Match The Following Question Correct Answer Your AnswerConstructive merge New record supercedes New record supercedes Initial Load Populating data warehouse Populating data warehouse table first
table first time time Incremental Load Applying ongoing changes Applying ongoing changes Load Image To correspond to target files To correspond to target files Select The Blank
Page 116 of 141
SCDL – 4th Semester – Data Mining
Question: ________ is the clustering method which encounters difficultes regarding the selection of merge/split points Correct Answer: Hierachical Your Answer: Hierachical Multiple Choice Multiple Answer Question: Substantial portion of Business metadata originates from :- Correct Answer: Textual documents , Spreadsheets , Business rules Your Answer: Textual documents , Spreadsheets , Business rules True/False Question: In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable. Correct Answer: True Your Answer: True Select The Blank Question: Human being have around ________ gene. Correct Answer: 100000 Your Answer: 100000 Multiple Choice Single Answer Question: Which of the following type executes query operations in pipeline manner? Correct Answer: Vertical parallelism Your Answer: Vertical parallelism Select The Blank Question: In ________ duplicate sub trees exist within the tree. Correct Answer: Repetition Your Answer: Repetition Multiple Choice Single Answer Question: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :- Correct Answer: Huge size of data Your Answer: Complexity in data Multiple Choice Single Answer Question: The technique of data clustering facilitates :- Correct Answer: Serial access Your Answer: Random access Multiple Choice Multiple Answer Question: Before moving data to data warehouse is has to go through :- Correct Answer: Transformation , Integration , Consolidation Your Answer: Integration , Summarization , Consolidation Multiple Choice Single Answer Question: Bayes Theorem is :- Correct Answer: P(H|X)=P(X|H)(P)/P(X) Your Answer: P(H|X)=P(X)(PH)/P(X|H) True/False Question: MDDBMS stands for - Multilevel Database Management System. Correct Answer: False Your Answer: False
Page 117 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question: DNA sequences are comprised of :- Correct Answer: Adenine , Gaunine , Thymine Your Answer: Adenine , Cytocine , Gaunine , Thymine Multiple Choice Multiple Answer Question: Financial data called for banking and financial industry are often relatively :- Correct Answer: Complete , Reliable , High Quality Your Answer: Complete , Reliable , High Quality Multiple Choice Single Answer Question: Deviation based outlier detection identifes outliers by :- Correct Answer: Examining character of objects in groups Your Answer: Examining distance between objects Multiple Choice Multiple Answer Question: The functions of data acquisition are :- Correct Answer: Data Extraction , Data Transformation Your Answer: Data Extraction , Data Transformation , Data cleansing , Data storing Select The Blank Question: ________ databases are one of the most poplularly available and rich information repositories. Correct Answer: Relational Your Answer: Relational Multiple Choice Single Answer Question: A Wavelet transformation is :- Correct Answer: Single processing Technique that decomposes signals into different frequency subbands Your Answer: Single processing Technique that composes signals into different frequency subbands Select The Blank Question: Creating ________is violation of Normalization principles. Correct Answer: Array Your Answer: Array Select The Blank Question: ________ method of regression is useful when errors fails to satisfy normal conditions. Correct Answer: Robust Your Answer: Robust True/False Question: Sequential pattern analysis and similarity search techniques have been developed in data mining. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: SMP stands for :- Correct Answer: Symmetric Multiprocessing Your Answer: Symmetric Multiprocessing
Page 118 of 141
SCDL – 4th Semester – Data Mining
LIST OF ATTEMPTED QUESTIONS AND ANSWERS sheetu 2 Multiple Choice Multiple Answer Question: Data Mining means :- Correct Answer: Knowledge mining from database , Data /Pattern analysis , Data Archelogy Your Answer: Data Archelogy , Knowledge mining from database , Data /Pattern analysis Select The Blank Question: ________ technique contribute to machine learning, neural network, association mining, sequential pattern mining. Correct Answer: Pattern discovery Your Answer: Pattern discovery Match The Following Question Correct Answer Your AnswerOperating systems Security, reliability, availability Security, reliability, availability CompatibilityData Acquisition Data Extraction, Data Extraction, Transformation,
Transformation, cleansing, cleansing, integrationintegration
Data Storage Data loading , Archiving Data loading , Archiving Information Delivery Report generation, query Report generation, query processing
processing and complex and complex analysis analysis
True/False Question: The Structure that brings all the components together is known as Architecture. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Advantages of Wavelet transformation for clustering are :- Correct Answer: Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer: Unsupervised clustering , Detection of cluster for accuracy , Decomposition of cluster for accuracy Multiple Choice Multiple Answer Question: The Main areas of Data Warehouse are :- Correct Answer: Data acquisition , Data Storage , Information Delivery Your Answer: Data Stage , Data Storage , Information Delivery True/False Question: In decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectangles Correct Answer: False Your Answer: False True/False Question: In Database system multidimensional index trees are primarily used for providing fast data access. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the platform for complex data transformation for the purpose of cleanse it
Page 119 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Separate optimal Platform Your Answer: Separate optimal Platform Multiple Choice Single Answer Question: Bitmapped indexes are more suitable for data warehouse environment than for an OLTP system Correct Answer: Bitmapped index Your Answer: Bitmapped index Multiple Choice Single Answer Question: The Clustering method DBSCAN stands for :- Correct Answer: Desity Based Spatial clustering of Application with Noise Your Answer: Desity Based Spatial clustering of Application with Noise Select The Blank Question: ________ is an alternative aggolomerative hierarchical clustering algorithm. Correct Answer: ROCK Your Answer: ROCK Multiple Choice Single Answer Question: Query tool is meant for :- Correct Answer: Data acquisition Your Answer: Information delivery Select The Blank Question: ________ are responsible for running queries and reports against data warehouse tables. Correct Answer: End users Your Answer: End users Multiple Choice Multiple Answer Question: Classification and Prediction have following applications :- Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction Select The Blank Question: ________ architecture is more concerned with data access than memory access. Correct Answer: MPP Your Answer: MPP Select The Blank Question: ________ are the inter platform devices that unable massive quantities of data to be transported from one platform to another. Correct Answer: Data ports Your Answer: Data ports Multiple Choice Single Answer Question: Which technique analyze experimental data? Correct Answer: Analysis of variance Your Answer: Regression True/False Question: Data classification is two step process in which first step includes classfication of model and in second step model describes set of data. Correct Answer: False Your Answer: True
Page 120 of 141
SCDL – 4th Semester – Data Mining
Select The Blank Question: ________ clustering method follows statistical and neural network approach. Correct Answer: Model based Your Answer: Model based Multiple Choice Single Answer Question: Which of the following methods for regression is used on sparse data :- Correct Answer: Regression and log-linear model Your Answer: Regression and log-linear model True/False Question: Audio data mining can be an interesting alternative to visual mining. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: If many indexes are needed, then on which table which option is more preferable? Correct Answer: Splitting of tables Your Answer: Collecting of tables Select The Blank Question: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords Correct Answer: Web Search Your Answer: Web Search Multiple Choice Multiple Answer Question: Distinguishing characteristics of data warehouse architecture are :- Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic Your Answer: Different Objective Scope , Data Content , Flexible and Dynamic Multiple Choice Single Answer Question: Which type of analysis of DNA facilitates discovery of group of genes and study of interaction and relationship between them? Correct Answer: Association analysis Your Answer: Association analysis True/False Question: Noise in data means error or variance in measured variable. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the user who has all access privileges like system, database administrator, for table and views. Correct Answer: Security administrator Your Answer: Security administrator Multiple Choice Multiple Answer Question: The main categories of Metadata in warehouse are :- Correct Answer: Operational , Extraction and transformation Metadata , End user Metadata Your Answer: Operational , Extraction and transformation Metadata , End user Metadata Multiple Choice Single Answer Question: Simple matching approach is used for computing disimilarity between two objects for :-
Page 121 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Nominal variable Your Answer: Nominal variable True/False Question: One of the most important search problem in genetic analysis is similarity search and comparison among DNA sequence. Correct Answer: True Your Answer: True True/False Question: Data cube stores multidimensional aggregate information. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Large number of indexes affects the loading process because :- Correct Answer: Indexes are created for new records Your Answer: Searching record becomes difficult Select The Blank Question: Most of the warehouses employ ________ database Management System. Correct Answer: Relational Your Answer: Relational Multiple Choice Single Answer Question: In intermediate data extraction data capture through transaction log uses transaction from :- Correct Answer: Recovery from failure Your Answer: Recovery from failure Multiple Choice Single Answer Question: Redundancies can be deleted by :- Correct Answer: Co-relational analysis Your Answer: Co-relational analysis True/False Question: Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database Correct Answer: False Your Answer: True Select The Blank Question: When data block contains excessive amount of free space, performance ________ Correct Answer: Degenerates Your Answer: Degenerates Multiple Choice Multiple Answer Question: The smoothing techniques are :- Correct Answer: Binning , Clustering , Regression Your Answer: Binning , Clustering , Regression True/False Question: A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering Correct Answer: True Your Answer: True
Page 122 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question: For Banking and financial data which type of analysis is used? Correct Answer: Multidimensional Your Answer: Relational Multiple Choice Multiple Answer Question: The dimensions of spatial data cube are :- Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial Multiple Choice Single Answer Question: Which of the following technique involves placing and managing related units of data in same physical block of storage Correct Answer: Clustering Your Answer: Clustering Multiple Choice Multiple Answer Question: Data processing techniques are :- Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation Match The Following Question Correct Answer Your AnswerClustering Data tuples as objects Great accuracy Dimension reduction Removal of irrelevant data Removal of irrelevant data Data compression More computations Encoding mechanism Wrapper approach Great accuracy Data reduction Select The Blank Question: ________ can store aggregate and detail data at varying levels of resolution or abstraction. Correct Answer: Index tree Your Answer: Index tree Multiple Choice Multiple Answer Question: Following are the issues to consider during data integration :- Correct Answer: Schema integration , Redundancy , Detection and resolution of data values Your Answer: Schema integration , Redundancy , Detection and resolution of data values
LIST OF ATTEMPTED QUESTIONS AND ANSWERS Multiple Choice Single Answer Question: Histograms, the methods to store reduced representation of data uses :- Correct Answer: Binning Your Answer: Aggregation Multiple Choice Single Answer Question: Which of the following is based on set of density distribution function clustering? Correct Answer: DBSCAN Your Answer: DBSCAN Multiple Choice Multiple Answer Question: Source Data Component may be grouped into following categories :- Correct Answer: Production Data , Internal External Data
Page 123 of 141
SCDL – 4th Semester – Data Mining
Your Answer: Production Data , Internal External Data Select The Blank Question: ________ does not handle categorical attributes. Correct Answer: CURE Your Answer: CURE Select The Blank Question: Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer: Heterogeneous and distributed Your Answer: Heterogeneous and distributed True/False Question: Data staging and data storage may start out on same computing platform. Correct Answer: True Your Answer: True True/False Question: Data in data warehouse cuts across application. Correct Answer: True Your Answer: False True/False Question: Loan payment prediction and customer credit analysis are critical to business of bank. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: Data integration means :- Correct Answer: Integrating database , Integrating cubes , Integrating files Your Answer: Integrating cubes , Integrating files , Integrating attributes Multiple Choice Multiple Answer Question: Data mining is applicable to :- Correct Answer: Relational Database , Data Warehouse , Transaction Database Your Answer: Relational Database , Data Warehouse , Transaction Database Multiple Choice Multiple Answer Question: The information delivery methods from data warehouse are :- Correct Answer: Complex queries , MD Analysis , Statistical Analysis Your Answer: Complex queries , MD Analysis , Statistical Analysis Multiple Choice Multiple Answer Question: SMP provides the features like :- Correct Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Your Answer: Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks Multiple Choice Multiple Answer Question: Splitting of query by DBMS in intra query parallelization is for :- Correct Answer: Index read , Data read , Data joint Your Answer: Index read , Data read , Data joint
Page 124 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single Answer Question: For Incremental data loads the sequence is :- Correct Answer: Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing Your Answer: Triggering ->data extraction ->Filtering -> Transformation ->Integration ->cleansing Multiple Choice Multiple Answer Question: The platform of Data warehouse consists of :- Correct Answer: Basic hardware components , Operating System , Network and Network software Your Answer: Operating System , Network and Network software , Utility software Multiple Choice Multiple Answer Question: Following factors play important role in financial analysis :- Correct Answer: Data warehouse , Data cubes , Outliner analysis Your Answer: Data warehouse , Data cubes , Outliner analysis Multiple Choice Single Answer Question: Which of the following data capture method of data abstraction is time consuming? Correct Answer: Capture by comparing files Your Answer: Capture by comparing files Multiple Choice Single Answer Question: Capture at data source and that's why this method is quite reliable :- Correct Answer: Capture by database Triggers Your Answer: Capture by database Triggers True/False Question: To remove noise from data is called as Smoothing. Correct Answer: True Your Answer: True True/False Question: NUMA provides better scalability than SMP. Correct Answer: True Your Answer: True Multiple Choice Multiple Answer Question: The Architecture defines :- Correct Answer: Measurements , Standard , General Design Your Answer: Measurements , Standard , General Design Multiple Choice Multiple Answer Question: Data reduction includes :- Correct Answer: Single value decomposition , Wavelets , Regression Your Answer: Single value decomposition , Wavelets , Regression Multiple Choice Single Answer Question: Which of the following component includes database Management System? Correct Answer: Data Storage Your Answer: Management and control Match The Following Question Correct Answer Your Answer
Page 125 of 141
SCDL – 4th Semester – Data Mining
Data loading tool Primary key generation Primary key generation Data modeling tool Reverse Engineering Reverse Engineering capabilities
capabilitiesData Extraction tool Bulk extraction for full Bulk extraction for full refresh refreshData transformation Default values Default values tool Multiple Choice Single Answer Question: Which type of following clustering computes augumented cluster ordering? Correct Answer: OPTICS Your Answer: OPTICS Multiple Choice Single Answer Question: Which from the following are special programs that are stored on database and fired when certain predefined action occurs? Correct Answer: Triggers Your Answer: Triggers Multiple Choice Single Answer Question: Attribute construction is the part of :- Correct Answer: Transformation Your Answer: Transformation Multiple Choice Single Answer Question: The stored values of an attribute represents the value of attribute at this moment of time is :- Correct Answer: Current value Your Answer: Current value Multiple Choice Single Answer Question: The option "capture in source application technique of data extraction degrades performance of source application because :- Correct Answer: Additional processing needs Your Answer: Additional processing needs Select The Blank Question: ________ technique is the statistical technique for analyzing data. Correct Answer: Time series Your Answer: Analysis of variance Select The Blank Question: ________ function of data staging component involves many forms of combining pieces of data from different sources. Correct Answer: Data Transformation Your Answer: Data Transformation True/False Question: To detect money laundering and other financial crimes, it is important to integrate information for multiple databases. Correct Answer: True Your Answer: True Multiple Choice Single Answer Question: Which of the following option of data extraction is known as application assisted data capture?
Page 126 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Capture in source application Your Answer: Capture in source application Multiple Choice Single Answer Question: Dimensionality reduction reduces the data set size by removing :- Correct Answer: Irrelevant attributes Your Answer: Irrelevant attributes Multiple Choice Single Answer Question: Maintenance of cache consistency is the limitation of :- Correct Answer: MPP Your Answer: NUMA Select The Blank Question: ________ is the method used to predict the value of response variable from one to more variables. Correct Answer: Regression Your Answer: Analysis of variance True/False Question: Metadata is building block of data warehouse. Correct Answer: True Your Answer: True Select The Blank Question: ________ is the type of pilot for early delivery with broader scope and may be integrated. Correct Answer: Broad business pilot Your Answer: Broad business pilot Select The Blank Question: In data ________, data encoding or transformations are applied to obtain reduced or compressed representation. Correct Answer: Compression Your Answer: Compression Multiple Choice Multiple Answer Question: Metadata in a data warehouse falls into following categories :- Correct Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata Your Answer: Operational Metadata , Extraction and Transformation metadata , End-user Metadata True/False Question: Data integration merges data from multiple sources into coherent sources. Correct Answer: True Your Answer: True Match The Following Question Correct Answer Your AnswerAdministration Providing support for all DBA functions Support for System administration Extensibility Hybrid Extension to OLAP Providing support for all DBA database
functions Portability Across platform APIs For tools from loading vendors Query tool APIs For tools from loading Hybrid Extension to OLTP database
vendors
Page 127 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Multiple Answer Question: Data transformation includes :- Correct Answer: Smoothing , Aggregation , Generalization Your Answer: Smoothing , Aggregation , Generalization Multiple Choice Multiple Answer Question: Knowledge discovery process includes :- Correct Answer: Data Cleaning , Data Intergration , Data Selectin Your Answer: Data Cleaning , Data Intergration , Data Selectin Multiple Choice Single Answer Question: Queries run faster to find exact match using which type of indexing? Correct Answer: Clustered index Your Answer: Clustered index True/False Question: Intelligent miner is an IBM data mining product. Correct Answer: True Your Answer: True
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Management and Control , Source Data , Data Staging Your Answer: Management and Control , Source Data , Data Staging
Multiple Choice Single AnswerQuestion: Substantial portion of available information is stored in :-Correct Answer: Text dataYour Answer: Object oriented database
True/FalseQuestion: The data Warehouse is query-centric.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Data mining is a piece of integrated solutions.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Which of the following data capture method of data abstraction is time consuming?Correct Answer: Capture by comparing filesYour Answer: Capture by comparing files
Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE
True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.
Page 128 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification
True/FalseQuestion: In physical design of warehouse, developing standard ensures consistency across the various areas.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Bayes Theorem is :-Correct Answer: P(H|X)=P(X|H)(P)/P(X)Your Answer: P(H|X)=P(X|H)(P)/P(X)
Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search
Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Object by variable structure
Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-Correct Answer: Huge size of dataYour Answer: Huge size of data
Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable
Multiple Choice Multiple AnswerQuestion: Clustering Techniques organised into following categories :-Correct Answer: Partitioning , Density Based , Grid Based Your Answer: Partitioning , Density Based , Grid Based
Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational
Multiple Choice Single AnswerQuestion: Data cleansing effort can begin with :-Correct Answer: High priority dataYour Answer: High priority data
Page 129 of 141
SCDL – 4th Semester – Data Mining
True/FalseQuestion: Sequential pattern analysis and similarity search techniques have been developed in data mining.Correct Answer: TrueYour Answer: True
Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance data loading,
loading, recovery recoveryQuery Governer Abort runaway query Abort runaway queryQuery Optimizer Parsing, optimizing query Parsing, optimizing queryQuery Management Balancing extraction of query Balancing extraction of query
Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope , Data Content, Flexible and Dynamic Your Answer: Different Objective Scope , Data Content , Flexible and Dynamic
Multiple Choice Single AnswerQuestion: Which type of integrity constraint forces the establishment of parent -child relationship?Correct Answer: Referential integrityYour Answer: Referential integrity
Select The BlankQuestion: An information measures called ________ can be used to recursively partition the values of numeric attribute.Correct Answer: EntropyYour Answer: Entropy
True/FalseQuestion: Metadata is building block of data warehouse.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: In which of the following type of mining frequently occuring patterns related to time and sequence are mined?Correct Answer: Sequential pattern miningYour Answer: Time series data mining
Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually
Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis, Performance Prediction Your Answer: Credit approval , Medical Diagnosis , Performance Prediction
Multiple Choice Multiple AnswerQuestion: Data processing techniques are :-Correct Answer: Cleansing , Integration , Transformation Your Answer: Cleansing , Integration , Transformation
Page 130 of 141
SCDL – 4th Semester – Data Mining
True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :-Correct Answer: Much smallerYour Answer: Much smaller
Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner?Correct Answer: Vertical parallelismYour Answer: Vertical parallelism
Multiple Choice Single AnswerQuestion: User gets an enterprise wide view of information from the data warehouse due to :-Correct Answer: Improved productivityYour Answer: Newer opportunity
Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational
Multiple Choice Single AnswerQuestion: Which database type stores a large amount of space-related data?Correct Answer: SpatialYour Answer: Spatial
Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , Thymine Your Answer: Adenine , Gaunine , Thymine
Multiple Choice Multiple AnswerQuestion: The strategies for data reduction are :-Correct Answer: Data aggregation , Dimension reduction , Numerocity reduction Your Answer: Data aggregation , Dimension reduction , Numerocity reduction
Select The BlankQuestion: ________ is an effective way to discover knowledge from huge amount of data.Correct Answer: Visual data miningYour Answer: Web mining
Select The BlankQuestion: ________ is the process of grouping data into classes.Correct Answer: ClusteringYour Answer: Classification
Multiple Choice Multiple Answer
Page 131 of 141
SCDL – 4th Semester – Data Mining
Question: Data mining Functionalities are :-Correct Answer: Charactrization and Discrimination, Association Analysis, Cluster Analysis Your Answer: Charactrization and Discrimination , Association Analysis , Cluster Analysis
Select The BlankQuestion: ________ is a summarization of general characteristics or features of a target class of data.Correct Answer: Data CharacterizationYour Answer: Data Characterization
Multiple Choice Single AnswerQuestion: Classification rules are extracted fromCorrect Answer: Decision TreeYour Answer: Decision Tree
Multiple Choice Single AnswerQuestion: Which of the follwing inheritance is supported by Object oriented databases?Correct Answer: Multiple InheritanceYour Answer: Single Inheritance
Select The BlankQuestion: For decision making process ________ process which considers finding only interesting patterns is used.Correct Answer: Microeconomic viewYour Answer: Pattern discovery
Match The FollowingQuestion Correct Answer Your AnswerInitial load of data as-is' data capture as-is' data capturewarehouseStatic data Capture of data in given Capture of data in given point of point of
time timeData revision Incremental data capture Incremental data captureIncremental data Differed data capture Differed data capture
True/FalseQuestion: Business metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Data in data warehouse cuts across application.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Remote deployment of desktop tools is usually faster.Correct Answer: TrueYour Answer: False
Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence
Page 132 of 141
SCDL – 4th Semester – Data Mining
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple AnswerQuestion: Building blocks of Data Warehouse are :-Correct Answer: Management and Control , Source Data , Data StagingYour Answer: Management and Control , Source Data , Data Staging
Multiple Choice Single AnswerQuestion: Substantial portion of available information is stored in :-Correct Answer: Text dataYour Answer: Object oriented database
True/FalseQuestion: The data Warehouse is query-centric.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Data mining is a piece of integrated solutions.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Which of the following data capture method of data abstraction is time consuming?Correct Answer: Capture by comparing filesYour Answer: Capture by comparing files
Select The BlankQuestion: ________ does not handle categorical attributes.Correct Answer: CUREYour Answer: CURE
True/FalseQuestion: In the data acquisition area, the data flow begins at the data sources and pauses at staging area.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Association rules mining is based on :-Correct Answer: Clustering and Employing rules for classificationYour Answer: Clustering and Employing rules for classification
True/FalseQuestion: In physical design of warehouse, developing standard ensures consistency across the various areas.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Bayes Theorem is :-Correct Answer: P(H|X)=P(X|H)(P)/P(X)Your Answer: P(H|X)=P(X|H)(P)/P(X)
Page 133 of 141
SCDL – 4th Semester – Data Mining
Select The BlankQuestion: Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywordsCorrect Answer: Web SearchYour Answer: Web Search
Multiple Choice Single AnswerQuestion: Data matrix is :-Correct Answer: Object by variable structureYour Answer: Object by variable structure
Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-Correct Answer: Huge size of dataYour Answer: Huge size of data
Multiple Choice Single AnswerQuestion: Simple matching approach is used for computing disimilarity between two objects for :-Correct Answer: Nominal variableYour Answer: Nominal variable
Multiple Choice Multiple AnswerQuestion: Clustering Techniques organised into following categories :-Correct Answer: Partitioning , Density Based , Grid BasedYour Answer: Partitioning , Density Based , Grid Based
Select The BlankQuestion: Most of the warehouses employ ________ database Management System.Correct Answer: RelationalYour Answer: Relational
Multiple Choice Single AnswerQuestion: Data cleansing effort can begin with :-Correct Answer: High priority dataYour Answer: High priority data
True/FalseQuestion: Sequential pattern analysis and similarity searchtechniques have been developed in data mining.Correct Answer: TrueYour Answer: True
Match The FollowingQuestion Correct Answer Your AnswerLoad Utility High performance data High performance
loading, recovery data loading, recoveryQuery Governer Abort runaway query Abort runaway queryQuery Optimizer Parsing, optimizing query Parsing, optimizing queryQuery Management Balancing extraction of query Balancing extraction of query
Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-Correct Answer: Different Objective Scope, Data Content, Flexible and DynamicYour Answer: Different Objective Scope, Data Content, Flexible and Dynamic
Page 134 of 141
SCDL – 4th Semester – Data Mining
Multiple Choice Single AnswerQuestion: Which type of integrity constraint forces the establishment of parent -child relationship?Correct Answer: Referential integrityYour Answer: Referential integrity
Select The BlankQuestion: An information measures called ________ can be used to recursively partition the values of numeric attribute.Correct Answer: EntropyYour Answer: Entropy
True/FalseQuestion: Metadata is building block of data warehouse.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: In which of the following type of mining frequently occuring patterns related to time and sequence are mined?Correct Answer: Sequential pattern miningYour Answer: Time series data mining
Select The BlankQuestion: ________ is the time consuming and less feasible approach for filling missing values.Correct Answer: Filling missing values manuallyYour Answer: Filling missing values manually
Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-Correct Answer: Credit approval , Medical Diagnosis, Performance PredictionYour Answer: Credit approval , Medical Diagnosis , Performance Prediction
Multiple Choice Multiple AnswerQuestion: Data processing techniques are :-Correct Answer: Cleansing , Integration , TransformationYour Answer: Cleansing , Integration , Transformation
True/FalseQuestion: Data in warehouse is primarily for query.Correct Answer: TrueYour Answer: True
Multiple Choice Single AnswerQuestion: Data reduction obtains a reduced representation of data set that is :-Correct Answer: Much smallerYour Answer: Much smaller
Multiple Choice Single AnswerQuestion: Which of the following type executes query operations in pipeline manner?Correct Answer: Vertical parallelismYour Answer: Vertical parallelism
Multiple Choice Single AnswerQuestion: User gets an enterprise wide view of information from the data warehouse due to :-
Page 135 of 141
SCDL – 4th Semester – Data Mining
Correct Answer: Improved productivityYour Answer: Newer opportunity
Select The BlankQuestion: ________ databases are one of the most poplularly available and rich information repositories.Correct Answer: RelationalYour Answer: Relational
Multiple Choice Single AnswerQuestion: Which database type stores a large amount of space-related data?Correct Answer: SpatialYour Answer: Spatial
Multiple Choice Multiple AnswerQuestion: DNA sequences are comprised of :-Correct Answer: Adenine , Gaunine , ThymineYour Answer: Adenine , Gaunine , Thymine
Multiple Choice Multiple AnswerQuestion: The strategies for data reduction are :-Correct Answer: Data aggregation , Dimension reduction ,Numerocity reductionYour Answer: Data aggregation , Dimension reduction , Numerocity reduction
Select The BlankQuestion: ________ is an effective way to discover knowledge from huge amount of data.Correct Answer: Visual data miningYour Answer: Web mining
Select The BlankQuestion: ________ is the process of grouping data into classes.Correct Answer: ClusteringYour Answer: Classification
Multiple Choice Multiple AnswerQuestion: Data mining Functionalities are :-Correct Answer: Charactrization and Discrimination, Association Analysis , Cluster AnalysisYour Answer: Charactrization and Discrimination, Association Analysis , Cluster Analysis
Select The BlankQuestion: ________ is a summarization of general characteristics or features of a target class of data.Correct Answer: Data CharacterizationYour Answer: Data Characterization
Multiple Choice Single AnswerQuestion: Classification rules are extracted fromCorrect Answer: Decision TreeYour Answer: Decision Tree
Multiple Choice Single AnswerQuestion: Which of the follwing inheritance is supported by Object oriented databases?Correct Answer: Multiple InheritanceYour Answer: Single Inheritance
Select The Blank
Page 136 of 141
SCDL – 4th Semester – Data Mining
Question: For decision making process ________ process which considers finding only interesting patterns is used.Correct Answer: Microeconomic viewYour Answer: Pattern discovery
Match The FollowingQuestion Correct Answer Your AnswerInitial load of data warehouse as-is' data capture as-is' data captureStatic data Capture of data in given Capture of data in given point
point of time timeData revision Incremental data capture Incremental data captureIncremental data capture Differed data capture Differed data capture
True/FalseQuestion: Business metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Data in data warehouse cuts across application.Correct Answer: TrueYour Answer: True
True/FalseQuestion: Remote deployment of desktop tools is usually faster.Correct Answer: TrueYour Answer: False
Multiple Choice Single AnswerQuestion: Effect of one attibute value on a given class is independent of values of other attibute is calledCorrect Answer: Value independenceYour Answer: Value independence
Unattended QuestionsMatch the Following. Data Quality tool
2 1. Assist data ware house administration
2. OLAP tools 6
2. Locating data errors
3. Alert system tool 5
3. Transparent access to source system
4. Middleware & connectivity tool 3
4. Track on number of queries
5. Users attention on exceptions
6. Channel queries
Select The Blank
Page 137 of 141
SCDL – 4th Semester – Data Mining
clustering method follows statistical and neural network approach.
True/FalseData cleansing means removing noisy and inconsistent data. TRUE
Match The Following1. Non volatile data 2 1. External data
2. Data granularity 4 2. Query and analysis
3. Data from external source 1 3. Production data
4. Disparate data 3 4. Level of detail
5. Archive data
6. Internal data
Match The Following1. Data storage 1 1. Data management
2. Data staging 2 2. Workbench for data
3. Data Mining 5 3. Details of summary
4. Metadata 6 4. Private spreadsheet data
5. Knowledge discovery
6. Roadmap for user
True/FalseThe Structure that brings all the components together is known as Architecture. TRUE/FALSE
Match The Following 1. Data modeling tool 1 1. Reverse Engineering capabilities
2. Data Extraction tool 4 2. Default values
3. Data transformation tool 2 3. Formulating and running queries
4. Data loading tool 5 4. Bulk extraction for full refresh
5. Primary key generation
6. Replication
Match The Following 1. Static data
1. Immediate data capture
2. Data revision
2. Capture of data in given point of time
3. Incremental data capture
3. Incremental data capture
4. Initial load of data warehouse
4. Value of attribute at specific time
Page 138 of 141
SCDL – 4th Semester – Data Mining
5. "as-is" data capture
6. Differed data capture
Match The Following1. Initial Load
4 1. New record supercedes
2. Incremental Load 6
2. Offline data warehouse
3. Load Image 5
3. Applying data
4. Constructive merge 1
4. Populating data warehouse table first time
5. To correspond to target files
6. Applying ongoing changes
Match The Following1. Identify source application 2 1. Method of extraction
2. Denote time window 5 2. Source identification
3. Handling unextractable input records 6 3. Extraction
4. Extraction is manual/Tool based 1 4. Job sequencing
5. Time window
6. Exception handling
Multiple Choice Multiple Answer7. The main categories of Metadata in warehouse are :-a)
b)
c)d)
Operational
Execution and Transformation Metadata
Extraction and transformation Metadata
End user Metadata
Multiple Choice Multiple Answer20.
The ways of Intra query parallelization are :-
a)
b)
c)
d)
Horizontal parallelization
Vertical Parallelization
Hybrid parallelization
Homogenous parallelization
Multiple Choice Single Answer
Page 139 of 141
SCDL – 4th Semester – Data Mining
30.Sequence of physical design of data warehouse is :- a)
b)
c)
d)
Develop standards--Create aggregate plans--determine data partitioning schemem--extablish clustering option--prepare indexing strategy--complete physical model
Develop standards--determine data partitioning scheme--Create aggregate plans--establish clustering option--prepare indexing strategy--complete physical model
Develop standards--prepare indexing strategy--Create aggregate plans--determine data partitioning scheme--establish clustering option---complete physical model
Develop standards--Create aggregate plans--establish clustering option--determine data partitioning scheme--prepare indexing strategy--complete physical model
Multiple Choice Single Answer44.Data migration affects performance requiring multiple blocks to be read which can be
adjusted by :-a)
b)
c)
d)
Block percent free
Block percent used
Block percent occupied
Block percent vacant
True/False48. In Linear regression data are modeled to fit a straight line.
True
False
Select The Blank16. The technique of_____________enables concurrent input/output operations and improves file's access performance substantially.
a) Data migrationb) File striping c) Block utilizationd) Dynamic extension
Match the Following 1. Data visualization
1. Visual display
2. Data mining result visualization
2. Presentation of knowledge
3. Data mining process visualization
3. Data mining in visual format
4. Interactive visual data mining
4. Visualization tool
5. Graphical display
Page 140 of 141
SCDL – 4th Semester – Data Mining
6. Audio signal
Page 141 of 141