Midterm Lab Exam_ Attempt Review

6/26/24, 8:34 AM Midterm Lab Exam: Attempt review
Home / My courses / UGRD-CYBS6101-2333T / MIDTERM EXAMINATION / Midterm Lab Exam
Started on Friday, 21 June 2024, 10:55 AM

State Finished
Completed on Friday, 21 June 2024, 11:05 AM
Time taken 9 mins 29 secs
Marks 43.00/50.00
Grade 86.00 out of 100.00
Question 1
Correct
Mark 1.00 out of 1.00
The KL distance between two discrete probability distributions P and Q is defined as:
Select one:
a. The sum of the ratio of the probabilities of each event in P and Q
b. The sum of the differences between the probabilities of each event in P and Q
c. The sum of the logarithm of the ratio of the probabilities of each event in P and Q
d. The sum of the products of the probabilities of each event in P and Q
Question 2
Correct
Hierarchical clustering is a type of ______________ technique.
Select one:
a. Regression
b. Dimensionality reduction
c. Classification
d. Clustering
https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 1/17
Question 3
Correct
Can the least squares method be used for multiple linear regression?
Select one:
a. It depends on the method used to transform the data set
b. No
c. It depends on the data set
d. Yes
Question 4
Correct
In information theory, the KL distance can be used to measure the information lost when approximating one distribution with
another. Which of the following is NOT a property of the KL distance in this context?
Select one:
a. It is non-negative
b. It is zero only when the two distributions are identical
c. It is always positive
d. It is non-symmetric
Question 5
Correct
What is the disadvantage of the Naive Bayes classifier?
Select one:
a. It is slower to train and predict
b. It is unable to handle large amounts of data
c. It is inflexible
d. It is less accurate
Question 6
Correct
What is the process of evaluating the performance of a trained perceptron on unseen data called?
Select one:
a. Testing
b. Pruning
c. Validation
d. Training
Question 7
Correct
What is the process of removing unnecessary weights from a trained perceptron called?
Select one:
a. Validation
b. Pruning
c. Testing
d. Training
Question 8
Correct
What is the Kullback-Leibler (KL) distance used for?
Select one:
a. To measure the uncertainty of a probability distribution
b. To measure the similarity between two probability distributions
c. To measure the predictability of a probability distribution
d. To measure the dissimilarity between two probability distributions
Question 9
Incorrect
How is the line of best fit calculated using the least squares method?
Select one:
a. By minimizing the mean of the data set
b. By minimizing the sum of the squares of the errors between the data points and the line of best fit
c. By minimizing the variance of the data set
d. By minimizing the sum of the absolute values of the errors between the data points and the line of best fit 
Question 10
Correct
Which of the following is NOT a feature of KNIME?
Select one:
a. Data storage
b. Machine learning
c. Data transformation
d. Flow-based programming
Question 11
Correct
What is the main goal of the k-means algorithm?
Select one:
a. To partition a dataset into a specified number of clusters
b. To classify data into predefined categories
c. To predict the value of a continuous target variable
d. To discover patterns or relationships within a dataset
Question 12
Correct
What is the Hebb rule?
Select one:
a. A rule used to adjust the weights in a neural network
b. A rule used to determine the structure of a neural network
c. A rule used to calculate the output of a neural network
d. A rule used to determine the input to a neural network
Question 13
Correct
How does the k-means algorithm determine which data points belong to which cluster?
Select one:
a. By evaluating the variance of each cluster
b. By computing the distance between data points and the centroid of each cluster
c. By evaluating the probability that a data point belongs to each cluster
d. By comparing the data point to the characteristics of each cluster
Question 14
Correct
What is the equation for the Hebb rule?
Select one:
a. w(new) = w(old) + η(output)x(input)
b. w(new) = w(old) + η(output - target)x(input)
c. w(new) = w(old) + η(input - output)x(target)
d. w(new) = w(old) + η(target - output)x(input)
Question 15
Correct
What is an edge in a Bayesian network?
Select one:
a. A point in the network where two or more nodes meet
b. None of the above
c. A variable in the system being modeled
d. A probabilistic relationship between two variables
Question 16
Correct
How does the least squares method handle outliers in the data set?
Select one:
a. It removes them
b. It gives them more weight
c. It gives them less weight
d. It ignores them
Question 17
Correct
What is the process of adjusting the weights of a perceptron based on the output of the previous layer in a multi-layer perceptron
called?
Select one:
a. Training
b. Testing
c. Backpropagation
d. Validation
Question 18
Correct
What is a directed acyclic graph (DAG)?
Select one:
a. A graph in which the edges do not have a direction and there are no cycles
b. A graph in which the edges have a direction and there are no cycles
c. A graph in which the edges have a direction and there are cycles
d. A graph in which the edges do not have a direction and there are cycles
Question 19
Correct
What is an example of a regression task in supervised learning?
Select one:
a. Grouping customers into different segments based on their spending habits
b. Predicting the stock price for the next day based on historical data
c. Determining whether an email is spam or not
d. Predicting the price of a house based on its characteristics
Question 20
Correct
What is the advantage of using the Gaussian Naive Bayes classifier over other types of Naive Bayes classifiers?
Select one:
a. It is able to handle continuous features
b. It is faster to train and predict
c. It is more accurate
d. It is able to handle categorical features
Question 21
Correct
How is KNIME different from other data analysis tools?
Select one:
a. It has a user-friendly interface
b. It is free
c. It allows users to build custom data pipelines
d. It is open source
Question 22
Correct
The KL distance can be used to measure the difference between two probability distributions in terms of the information content of
the distributions. In this context, the KL distance is also known as:
Select one:
a. The information distance
b. The information gain
c. The information divergence
d. The information ratio
Question 23
Correct
What is the assumption made by the Naive Bayes classifier?
Select one:
a. That the features in the data are dependent on each other
b. That the features in the data are uniformly distributed
c. That the features in the data are normally distributed
d. That the features in the data are independent of each other
Question 24
Correct
What is an example of a batch learning algorithm used for classification tasks?
Select one:
a. K-nearest neighbors
b. Decision tree
c. Linear regression
d. Support vector machine
Question 25
Correct
What is the EM algorithm used to optimize in the "M" step?
Select one:
a. The likelihood of the model
b. The model parameters
c. The prediction accuracy of the model
d. The latent variables
Question 26
Incorrect
How is the final set of clusters determined in the k-means algorithm?
Select one:
a. By selecting the set of clusters that maximize the sum of squared errors
b. By selecting the set of clusters that minimize the within-cluster variance 
c. By selecting the set of clusters that maximize the within-cluster variance
d. By selecting the set of clusters that minimize the sum of squared errors
Question 27
Correct
What is the technical term for a node in a directed acyclic graph (DAG)?
Select one:
a. Vertex
b. Graph
c. Edge
d. Cycle
Question 28
Correct
What is the process of adjusting the weights of a perceptron based on the input data and the desired output called?
Select one:
a. Training
b. Validation
c. Pruning
d. Testing
Question 29
Incorrect
What is the process of calculating the error between the desired output and the actual output of a perceptron called?
Select one:
a. Pruning
b. Validation
c. Training
d. Testing 
Question 30
Correct
Hierarchical clustering is a type of ______________ clustering.
Select one:
a. Deep
b. Hybrid
c. Hierarchical
d. Flat
Question 31
Incorrect
What is an example of a real-world application of directed acyclic graphs (DAGs)?
Select one:
a. All of the above
b. Computer networks
c. Data pipelines 
d. Social media networks
Question 32
Correct
Is the least squares method a deterministic or a probabilistic method?
Select one:
a. Deterministic
b. Probabilistic
c. Both deterministic and probabilistic
d. Neither deterministic nor probabilistic
Question 33
Correct
What is an example of a batch learning algorithm?
Select one:
a. Support vector machine
b. K-nearest neighbors
c. All of the above
d. Linear regression
Question 34
Correct
What is a key characteristic of Bayesian networks?
Select one:
a. They are trained on large amounts of data
b. They use linear algebra for prediction
c. They are based on probability theory
d. They use decision trees for prediction
Question 35
Correct
Which of the following is NOT a limitation of the k-means algorithm?
Select one:
a. It is sensitive to the initial placement of centroids
b. It may produce suboptimal results if the clusters are not spherical
c. It requires the user to specify the number of clusters in advance
d. It is not affected by the scale of the variables
Question 36
Correct
How is the Hebb rule used in the training of a neural network?
Select one:
a. It is used to determine the structure of the neural network
b. It is used to determine the input to the neural network
c. It is used to calculate the output of the neural network
d. It is used to adjust the weights of the neural network based on the input and output
Question 37
Correct
What is the "M" step in the EM algorithm?
Select one:
a. The step where the prediction accuracy of the model is calculated
b. The step where the expectation of the latent variables is calculated
c. The step where the model parameters are updated
d. The step where the likelihood of the model is maximized
Question 38
Correct
What is the process of transforming data into a consistent format called?
Select one:
a. Cleaning
b. Sampling
c. Normalizing
d. Filtering
Question 39
Correct
The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters based on the mean
distance between their points.
Select one:
a. Average
b. Single
c. Complete
d. Centroid
Question 40
Correct
Which of the following file types can be imported into KNIME?
Select one:
a. Excel
b. All of the above
c. CSV
d. XML
Question 41
Correct
Which of the following is NOT a common application of the k-means algorithm?
Select one:
a. Customer segmentation
b. Anomaly detection
c. Image compression
d. Regression analysis
Question 42
Incorrect
How does supervised learning differ from unsupervised learning?
Select one:
a. Supervised learning involves predicting a continuous value, while unsupervised learning involves predicting a categorical 
value
b. Supervised learning involves predicting a value, while unsupervised learning involves clustering data
c. Supervised learning involves labeled data, while unsupervised learning involves unlabeled data
d. Supervised learning involves clustering data, while unsupervised learning involves predicting a value
Question 43
Correct
What is the EM algorithm used to estimate in the "E" step?
Select one:
a. The prediction accuracy of the model
b. The model parameters
c. The likelihood of the model
d. The latent variables
Question 44
Correct
What is the advantage of the Naive Bayes classifier over other classifiers?
Select one:
a. It is faster to train and predict
b. It is more accurate
c. It is able to handle large amounts of data
d. It is more flexible
Question 45
Incorrect
What is the role of the learning rate (η) in the Hebb rule?
Select one:
a. It determines the speed of the weight update
b. It determines the size of the weight update 
c. It determines the direction of the weight update
d. It determines the convergence of the weight update
Question 46
Incorrect
The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters that have the
maximum distance between them.
Select one:
a. Centroid
b. Single
c. Complete 
d. Average
Question 47
Correct
What is an example of a batch learning algorithm used for regression tasks?
Select one:
a. Support vector machine
b. Decision tree
c. K-nearest neighbors
d. Linear regression
Question 48
Correct
The KL distance can be used to measure the information lost when approximating one distribution with another. In this context, the
distribution being approximated is known as the:
Select one:
a. Target distribution
b. Base distribution
c. Approximation distribution
d. Reference distribution
Question 49
Correct
What is an example of a batch learning algorithm used for feature selection tasks?
Select one:
a. Mutual information
b. Variance threshold
c. All of the above
d. Recursive feature elimination
Question 50
Correct
The KL distance is often used in machine learning to evaluate the performance of a classification model. In this context, a low KL
distance indicates that the model's predicted class probabilities are:
Select one:
a. Very similar to the true class probabilities
b. Very different from the true class probabilities
c. Somewhat similar to the true class probabilities
d. Somewhat different from the true class probabilities
◄ Midterm Exam
Jump to...

Midterm Lab Exam_ Attempt Review

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Midterm Lab Exam_ Attempt Review

Uploaded by

Copyright:

Available Formats

6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Home / My courses / UGRD-CYBS6101-2333T / MIDTERM EXAMINATION / Midterm Lab Exam

Started on Friday, 21 June 2024, 10:55 AM

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

Hierarchical clustering is a type of ______________ technique.

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

What is the disadvantage of the Naive Bayes classifier?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

What is the Kullback-Leibler (KL) distance used for?

Mark 0.00 out of 1.00

Mark 1.00 out of 1.00

Which of the following is NOT a feature of KNIME?

Mark 1.00 out of 1.00

What is the main goal of the k-means algorithm?

Mark 1.00 out of 1.00

What is the Hebb rule?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

What is the equation for the Hebb rule?

Mark 1.00 out of 1.00

What is an edge in a Bayesian network?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

What is a directed acyclic graph (DAG)?

Mark 1.00 out of 1.00

What is an example of a regression task in supervised learning?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

How is KNIME different from other data analysis tools?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

What is the assumption made by the Naive Bayes classifier?

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm used for classification tasks?

Mark 1.00 out of 1.00

What is the EM algorithm used to optimize in the "M" step?

Mark 0.00 out of 1.00

How is the final set of clusters determined in the k-means algorithm?

Mark 1.00 out of 1.00

Mark 1.00 out of 1.00

Mark 0.00 out of 1.00

Mark 1.00 out of 1.00

Hierarchical clustering is a type of ______________ clustering.

Mark 0.00 out of 1.00

What is an example of a real-world application of directed acyclic graphs (DAGs)?

Mark 1.00 out of 1.00

Is the least squares method a deterministic or a probabilistic method?

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm?

Mark 1.00 out of 1.00

What is a key characteristic of Bayesian networks?

Mark 1.00 out of 1.00

Which of the following is NOT a limitation of the k-means algorithm?

Mark 1.00 out of 1.00

How is the Hebb rule used in the training of a neural network?

Mark 1.00 out of 1.00

What is the "M" step in the EM algorithm?

Mark 1.00 out of 1.00