Top 150+ Solved Bigdata MCQ Questions Answer
Q. Which of the following is false?
a. data visualization include the ability to absorb information quickly
b. Data visualization is another form of visual art
c. Data visualization decrease the insights and take solwer decisions
d. None Of the above
Q. Common use cases for data visualization include?
a. Politics
b. Sales and marketing
c. Healthcare
d. All of the above
Q. Which of the following plots are often used for checking randomness intime series?
a. Autocausation
b. Autorank
c. Autocorrelation
d. None of the above
Q. To find the minimum or the maximum of a function, we set the gradient to zero because:
a. The value of the gradient at extrema of a function is always zero
b. Depends on the type of problem
c. Both A and B
d. None of the above
Q. Which of the following techniques can not be used for normalization in text mining?
a. Stemming
b. Lemmatization
c. Stop Word Removal
d. None of the above
Q. In which of the following cases will K-means clustering fail to give goodresults? 1) Data points with outliers 2) Data points with different densities 3) Data points with nonconvex shapes
a. 1 and 2
b. 2 and 3
c. 1 and 3
d. All of the above
Q. Which of the following is a reasonable way to select the number ofprincipal components "k"?
a. Choose k to be the smallest value so that at least 99% of the varinace is retained.
b. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
c. Choose k to be the largest value so that 99% of the variance is retained.
d. Use the elbow metho
Q. Which of the following is false?
a. Subsetting can be used to select and exclude variables and observations
b. Raw data should be processed only one time.
c. Merging concerns combining datasets on the same observations to produce a result with more variables
d. None Of the above
Q. According to analysts, for what can traditional IT systems provide a foundation when they’re integrated with big data technologies like Hadoop?
a. Big data management and data mining
b. Data warehousing and business intelligence
c. Management of Hadoop clusters
d. Collecting and storing unstructured data
Q. File containing R scripts end with extension _______.
a. .R
b. .S
c. .bigdata
d. All of the above
Q. Which of the following is a subset of machine learning?
a. Numpy
b. SciPy
c. Deep Learning
d. All of the above