Question 1 :
Repeating the holdout many times is called ______
- random subsampling
- cross validation
- bootstrap
- bagging
Question 2 :
The _______numerical measure which tells that two objects are alike
- dissimilarity
- clarity
- non clarity
- simmilarity
Question 3 :
As per the concept of KDD process, which of the following statement is valid ?
- KDD and Data Mining have no connection at all
- KDD is one of the steps in Data Mining
- Data Mining is one of the steps in KDD process
- KDD and Data Mining mean the same
Question 4 :
How many coefficients do you need to estimate in a simple linear regression model (One independent and one dependent variable)?
- 1
- 2
- 3
- 4
Question 5 :
Comparison of the general features of the target class data object against the general features of objects from one or multiple contrasting classes is a process of
- Data Characterization
- Data Classification
- Data discrimination
- Data selection
Question 6 :
To extract rules in supervised learning __________is used
- root node
- sibling
- decision trees
- branches
Question 7 :
__________ may be defined as the data objects that do not comply with the general behavior or model of the data available.
- Evolution Analysis
- Prediction
- Classification
- Outlier Analysis
Question 8 :
A data warehouse can be used to analyze a particular ________
- graph
- chart
- domain
- subject
Question 9 :
After the initial load, the data warehouse is kept up-to-date by two actions: REFRESH and UPDATE. As the number of records increase in a Data Warehouse, cost of update operation ______________ .
- decreases
- increases
- remains constant
- is same as cost of Refresh
Question 10 :
_______________ is a process of taking operational data from one or more sources and mapping it, field by field, onto a new data structure in the data warehouse
- Transformation
- Cleansing
- Integration
- Scrubbing
Question 11 :
Euclidean distance measure is
- A stage of the KDD process in which new data is added to the existing selection.
- The process of finding a solution for a problem simply by enumerating all possible solutions according to some pre-defined order and then testing them
- The distance between two points as calculated using the Pythagoras theorem
- The distance between two points as calculated using interval scale
Question 12 :
Periodic Status is
- data in which changes to existing records cause the previous version of the records to be eliminated
- the value of the attribute is preserved as the status every time a change occurs
- data that are never altered or deleted once they have been added
- the value of the attribute at this moment of time.
Question 13 :
Binary attribute are
- This takes only two values. In general, these values will be 0 and 1 and .they can be coded as one bit
- This takes only three values.
- This takes only four values.
- It cant take any value.
Question 14 :
The following rule is an example of which association rule.{ age (X, '20…...29') ^ occupation(X, 'student')→ buys(X, 'laptop') }.
- multilevel association rules
- interlevel association rules
- multidimensional association rules
- intralevel association rules
Question 15 :
Converting data from different sources into a common format for processing is called as ________.
- Selection
- Preprocessing
- Transformation
- Interpretation
Question 16 :
Which algorithm requires fewer scans of data?
- Apriori
- FP growth
- Apriori and FP Growth
- decision
Question 17 :
Given two objects represented by the tuples (22, 1, 42, 10) and (20, 0, 36, 8):Compute the Euclidean distance between the two objects.
- 6.32
- 6.71
- 6.15
- 6.22
Question 18 :
The values of an ________ attribute provide enough information to order objects.
- ratio
- Binary
- Interval
- ordinal
Question 19 :
information stored in the data warehouse.
- additive atleast over one dimension
- Only numeric measures are used
- All possible summaries are used
- It is additive over every dimension of its dimensionality
Question 20 :
Removing duplicate records is a data mining process called ____________ .
- Data isolation
- Recovery
- Data Cleaning
- Data dredging
Question 21 :
It is measured on a scale of equal size units,these attributes allows us to compare such as temperature in C or F and thus values of attributes have order.
- Interval Scaled attribute
- Ratio scaled attribute
- Binary attribute
- Ternary attribute
Question 22 :
The mapping or classification of a class with some predefined group or class is known as?
- Data Characterization
- Data Discrimination
- Data Subset
- Data set
Question 23 :
Which of the following is not a valid Visualization technique ?
- Scatter plot
- Decision Tree
- Box plot
- Histogram
Question 24 :
which information is not provided by Information packages
- Define the common subject areas
- date of full refresh
- Establish data granularity
- Estimate data warehouse size
Question 25 :
from the given options______ is a predictive model
- Clustering
- Regression
- Summarization
- Association rules