Top 150+ Solved Bigdata MCQ Questions Answer

From 61 to 75 of 103

Q. The model will be trained with data in one single batch is known as ?

a. Batch learning

b. Offline learning

c. Both A and B

d. None of the above

  • c. Both A and B

Q. Which of the following are ML methods?

a. based on human supervision

b. supervised Learning

c. semi-reinforcement Learning

d. All of the above

  • a. based on human supervision

Q. In Model based learning methods, an iterative process takes place on theML models that are built based on various model parameters, called ?

a. mini-batches

b. optimizedparameters

c. hyperparameters

d. superparameters

  • c. hyperparameters

Q. Which of the following is a widely used and effective machine learningalgorithm based on the idea of bagging?

a. Decision Tree

b. Regression

c. Classification

d. Random Forest

  • d. Random Forest

Q. Which of the following is a disadvantage of decision trees?

a. Factor analysis

b. Decision trees are robust to outliers

c. Decision trees are prone to be overfit

d. None of the above

  • c. Decision trees are prone to be overfit

Q. How do you handle missing or corrupted data in a dataset?

a. Drop missing rows or columns

b. Replace missing values with mean/median/mode

c. Assign a unique category to missing values

d. All of the above

  • d. All of the above

Q. When performing regression or classification, which of the following is thecorrect way to preprocess the data?

a. Normalize the data -> PCA -> training

b. PCA -> normalize PCA output -> training

c. Normalize the data -> PCA -> normalize PCA output -> training

d. None of the above

  • a. Normalize the data -> PCA -> training

Q. Which of the following statements about regularization is not correct?

a. Using too large a value of lambda can cause your hypothesis to underfit the dat

b. Using too large a value of lambda can cause your hypothesis to overfit the data

c. Using a very large value of lambda cannot hurt the performance of your hypothesis.

d. None of the above

  • d. None of the above

Q. What is a sentence parser typically used for?

a. It is used to parse sentences to check if they are utf-8 compliant.

b. It is used to parse sentences to derive their most likely syntax tree structures.

c. It is used to parse sentences to assign POS tags to all tokens.

d. It is used to check if sentences can be parsed into meaningful tokens.

  • b. It is used to parse sentences to derive their most likely syntax tree structures.

Q. To find the minimum or the maximum of a function, we set the gradient tozero because:

a. The value of the gradient at extrema of a function is always zero

b. Depends on the type of problem

c. Both A and B

d. None of the above

  • a. The value of the gradient at extrema of a function is always zero

Q. Which of the following techniques can not be used for normalization intext mining?

a. Stemming

b. Lemmatization

c. Stop Word Removal

d. None of the above

  • c. Stop Word Removal

Q. Which of the following is a reasonable way to select the number of principal components "k"?

a. Choose k to be the smallest value so that at least 99% of the varinace is retained.

b. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).

c. Choose k to be the largest value so that 99% of the variance is retained.

d. Use the elbow metho

  • a. Choose k to be the smallest value so that at least 99% of the varinace is retained.

Q. Data Analysis is a process of?

a. inspecting data

b. cleaning data

c. transforming data

d. All of the above

  • d. All of the above

Q. Which of the following is not a major data analysis approaches?

a. Data Mining

b. Predictive Intelligence

c. Business Intelligence

d. Text Analytics

  • b. Predictive Intelligence
Subscribe Now

Get All Updates & News