Download as pdf or txt
Download as pdf or txt
You are on page 1of 17

6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Home / My courses / UGRD-CYBS6101-2333T / MIDTERM EXAMINATION / Midterm Lab Exam

Started on Friday, 21 June 2024, 10:55 AM


State Finished
Completed on Friday, 21 June 2024, 11:05 AM
Time taken 9 mins 29 secs
Marks 43.00/50.00
Grade 86.00 out of 100.00

Question 1
Correct

Mark 1.00 out of 1.00

The KL distance between two discrete probability distributions P and Q is defined as:

Select one:
a. The sum of the ratio of the probabilities of each event in P and Q
b. The sum of the differences between the probabilities of each event in P and Q
c. The sum of the logarithm of the ratio of the probabilities of each event in P and Q
d. The sum of the products of the probabilities of each event in P and Q

Question 2

Correct

Mark 1.00 out of 1.00

Hierarchical clustering is a type of ______________ technique.

Select one:
a. Regression
b. Dimensionality reduction
c. Classification
d. Clustering

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 1/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 3

Correct

Mark 1.00 out of 1.00

Can the least squares method be used for multiple linear regression?

Select one:
a. It depends on the method used to transform the data set
b. No
c. It depends on the data set
d. Yes

Question 4

Correct

Mark 1.00 out of 1.00

In information theory, the KL distance can be used to measure the information lost when approximating one distribution with
another. Which of the following is NOT a property of the KL distance in this context?

Select one:
a. It is non-negative
b. It is zero only when the two distributions are identical
c. It is always positive
d. It is non-symmetric

Question 5

Correct

Mark 1.00 out of 1.00

What is the disadvantage of the Naive Bayes classifier?

Select one:
a. It is slower to train and predict
b. It is unable to handle large amounts of data
c. It is inflexible
d. It is less accurate

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 2/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 6

Correct

Mark 1.00 out of 1.00

What is the process of evaluating the performance of a trained perceptron on unseen data called?

Select one:
a. Testing
b. Pruning
c. Validation
d. Training

Question 7

Correct

Mark 1.00 out of 1.00

What is the process of removing unnecessary weights from a trained perceptron called?

Select one:
a. Validation
b. Pruning
c. Testing
d. Training

Question 8
Correct

Mark 1.00 out of 1.00

What is the Kullback-Leibler (KL) distance used for?

Select one:
a. To measure the uncertainty of a probability distribution
b. To measure the similarity between two probability distributions
c. To measure the predictability of a probability distribution
d. To measure the dissimilarity between two probability distributions

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 3/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 9

Incorrect

Mark 0.00 out of 1.00

How is the line of best fit calculated using the least squares method?

Select one:
a. By minimizing the mean of the data set
b. By minimizing the sum of the squares of the errors between the data points and the line of best fit
c. By minimizing the variance of the data set
d. By minimizing the sum of the absolute values of the errors between the data points and the line of best fit 

Question 10

Correct

Mark 1.00 out of 1.00

Which of the following is NOT a feature of KNIME?

Select one:
a. Data storage
b. Machine learning
c. Data transformation
d. Flow-based programming

Question 11
Correct

Mark 1.00 out of 1.00

What is the main goal of the k-means algorithm?

Select one:
a. To partition a dataset into a specified number of clusters
b. To classify data into predefined categories
c. To predict the value of a continuous target variable
d. To discover patterns or relationships within a dataset

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 4/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 12

Correct

Mark 1.00 out of 1.00

What is the Hebb rule?

Select one:
a. A rule used to adjust the weights in a neural network
b. A rule used to determine the structure of a neural network
c. A rule used to calculate the output of a neural network
d. A rule used to determine the input to a neural network

Question 13

Correct

Mark 1.00 out of 1.00

How does the k-means algorithm determine which data points belong to which cluster?

Select one:
a. By evaluating the variance of each cluster
b. By computing the distance between data points and the centroid of each cluster
c. By evaluating the probability that a data point belongs to each cluster
d. By comparing the data point to the characteristics of each cluster

Question 14
Correct

Mark 1.00 out of 1.00

What is the equation for the Hebb rule?

Select one:
a. w(new) = w(old) + η(output)x(input)
b. w(new) = w(old) + η(output - target)x(input)
c. w(new) = w(old) + η(input - output)x(target)
d. w(new) = w(old) + η(target - output)x(input)

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 5/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 15

Correct

Mark 1.00 out of 1.00

What is an edge in a Bayesian network?

Select one:
a. A point in the network where two or more nodes meet
b. None of the above
c. A variable in the system being modeled
d. A probabilistic relationship between two variables

Question 16

Correct

Mark 1.00 out of 1.00

How does the least squares method handle outliers in the data set?

Select one:
a. It removes them
b. It gives them more weight
c. It gives them less weight
d. It ignores them

Question 17
Correct

Mark 1.00 out of 1.00

What is the process of adjusting the weights of a perceptron based on the output of the previous layer in a multi-layer perceptron
called?

Select one:
a. Training
b. Testing
c. Backpropagation
d. Validation

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 6/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 18

Correct

Mark 1.00 out of 1.00

What is a directed acyclic graph (DAG)?

Select one:
a. A graph in which the edges do not have a direction and there are no cycles
b. A graph in which the edges have a direction and there are no cycles
c. A graph in which the edges have a direction and there are cycles
d. A graph in which the edges do not have a direction and there are cycles

Question 19

Correct

Mark 1.00 out of 1.00

What is an example of a regression task in supervised learning?

Select one:
a. Grouping customers into different segments based on their spending habits
b. Predicting the stock price for the next day based on historical data
c. Determining whether an email is spam or not
d. Predicting the price of a house based on its characteristics

Question 20
Correct

Mark 1.00 out of 1.00

What is the advantage of using the Gaussian Naive Bayes classifier over other types of Naive Bayes classifiers?

Select one:
a. It is able to handle continuous features
b. It is faster to train and predict
c. It is more accurate
d. It is able to handle categorical features

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 7/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 21

Correct

Mark 1.00 out of 1.00

How is KNIME different from other data analysis tools?

Select one:
a. It has a user-friendly interface
b. It is free
c. It allows users to build custom data pipelines
d. It is open source

Question 22

Correct

Mark 1.00 out of 1.00

The KL distance can be used to measure the difference between two probability distributions in terms of the information content of
the distributions. In this context, the KL distance is also known as:

Select one:
a. The information distance
b. The information gain
c. The information divergence
d. The information ratio

Question 23

Correct

Mark 1.00 out of 1.00

What is the assumption made by the Naive Bayes classifier?

Select one:
a. That the features in the data are dependent on each other
b. That the features in the data are uniformly distributed
c. That the features in the data are normally distributed
d. That the features in the data are independent of each other

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 8/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 24

Correct

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm used for classification tasks?

Select one:
a. K-nearest neighbors
b. Decision tree
c. Linear regression
d. Support vector machine

Question 25

Correct

Mark 1.00 out of 1.00

What is the EM algorithm used to optimize in the "M" step?

Select one:
a. The likelihood of the model
b. The model parameters
c. The prediction accuracy of the model
d. The latent variables

Question 26
Incorrect

Mark 0.00 out of 1.00

How is the final set of clusters determined in the k-means algorithm?

Select one:
a. By selecting the set of clusters that maximize the sum of squared errors
b. By selecting the set of clusters that minimize the within-cluster variance 
c. By selecting the set of clusters that maximize the within-cluster variance
d. By selecting the set of clusters that minimize the sum of squared errors

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 9/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 27

Correct

Mark 1.00 out of 1.00

What is the technical term for a node in a directed acyclic graph (DAG)?

Select one:
a. Vertex
b. Graph
c. Edge
d. Cycle

Question 28

Correct

Mark 1.00 out of 1.00

What is the process of adjusting the weights of a perceptron based on the input data and the desired output called?

Select one:
a. Training
b. Validation
c. Pruning
d. Testing

Question 29
Incorrect

Mark 0.00 out of 1.00

What is the process of calculating the error between the desired output and the actual output of a perceptron called?

Select one:
a. Pruning
b. Validation
c. Training
d. Testing 

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 10/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 30

Correct

Mark 1.00 out of 1.00

Hierarchical clustering is a type of ______________ clustering.

Select one:
a. Deep
b. Hybrid
c. Hierarchical
d. Flat

Question 31

Incorrect

Mark 0.00 out of 1.00

What is an example of a real-world application of directed acyclic graphs (DAGs)?

Select one:
a. All of the above
b. Computer networks
c. Data pipelines 
d. Social media networks

Question 32
Correct

Mark 1.00 out of 1.00

Is the least squares method a deterministic or a probabilistic method?

Select one:
a. Deterministic
b. Probabilistic
c. Both deterministic and probabilistic
d. Neither deterministic nor probabilistic

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 11/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 33

Correct

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm?

Select one:
a. Support vector machine
b. K-nearest neighbors
c. All of the above
d. Linear regression

Question 34

Correct

Mark 1.00 out of 1.00

What is a key characteristic of Bayesian networks?

Select one:
a. They are trained on large amounts of data
b. They use linear algebra for prediction
c. They are based on probability theory
d. They use decision trees for prediction

Question 35
Correct

Mark 1.00 out of 1.00

Which of the following is NOT a limitation of the k-means algorithm?

Select one:
a. It is sensitive to the initial placement of centroids
b. It may produce suboptimal results if the clusters are not spherical
c. It requires the user to specify the number of clusters in advance
d. It is not affected by the scale of the variables

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 12/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 36

Correct

Mark 1.00 out of 1.00

How is the Hebb rule used in the training of a neural network?

Select one:
a. It is used to determine the structure of the neural network
b. It is used to determine the input to the neural network
c. It is used to calculate the output of the neural network
d. It is used to adjust the weights of the neural network based on the input and output

Question 37

Correct

Mark 1.00 out of 1.00

What is the "M" step in the EM algorithm?

Select one:
a. The step where the prediction accuracy of the model is calculated
b. The step where the expectation of the latent variables is calculated
c. The step where the model parameters are updated
d. The step where the likelihood of the model is maximized

Question 38
Correct

Mark 1.00 out of 1.00

What is the process of transforming data into a consistent format called?

Select one:
a. Cleaning
b. Sampling
c. Normalizing
d. Filtering

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 13/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 39

Correct

Mark 1.00 out of 1.00

The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters based on the mean
distance between their points.

Select one:
a. Average
b. Single
c. Complete
d. Centroid

Question 40
Correct

Mark 1.00 out of 1.00

Which of the following file types can be imported into KNIME?

Select one:
a. Excel
b. All of the above
c. CSV
d. XML

Question 41

Correct

Mark 1.00 out of 1.00

Which of the following is NOT a common application of the k-means algorithm?

Select one:
a. Customer segmentation
b. Anomaly detection
c. Image compression
d. Regression analysis

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 14/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 42

Incorrect

Mark 0.00 out of 1.00

How does supervised learning differ from unsupervised learning?

Select one:
a. Supervised learning involves predicting a continuous value, while unsupervised learning involves predicting a categorical 
value
b. Supervised learning involves predicting a value, while unsupervised learning involves clustering data
c. Supervised learning involves labeled data, while unsupervised learning involves unlabeled data
d. Supervised learning involves clustering data, while unsupervised learning involves predicting a value

Question 43

Correct

Mark 1.00 out of 1.00

What is the EM algorithm used to estimate in the "E" step?

Select one:
a. The prediction accuracy of the model
b. The model parameters
c. The likelihood of the model
d. The latent variables

Question 44
Correct

Mark 1.00 out of 1.00

What is the advantage of the Naive Bayes classifier over other classifiers?

Select one:
a. It is faster to train and predict
b. It is more accurate
c. It is able to handle large amounts of data
d. It is more flexible

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 15/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 45

Incorrect

Mark 0.00 out of 1.00

What is the role of the learning rate (η) in the Hebb rule?

Select one:
a. It determines the speed of the weight update
b. It determines the size of the weight update 
c. It determines the direction of the weight update
d. It determines the convergence of the weight update

Question 46

Incorrect

Mark 0.00 out of 1.00

The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters that have the
maximum distance between them.

Select one:
a. Centroid
b. Single
c. Complete 
d. Average

Question 47

Correct

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm used for regression tasks?

Select one:
a. Support vector machine
b. Decision tree
c. K-nearest neighbors
d. Linear regression

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 16/17
6/26/24, 8:34 AM Midterm Lab Exam: Attempt review

Question 48

Correct

Mark 1.00 out of 1.00

The KL distance can be used to measure the information lost when approximating one distribution with another. In this context, the
distribution being approximated is known as the:

Select one:
a. Target distribution
b. Base distribution
c. Approximation distribution
d. Reference distribution

Question 49
Correct

Mark 1.00 out of 1.00

What is an example of a batch learning algorithm used for feature selection tasks?

Select one:
a. Mutual information
b. Variance threshold
c. All of the above
d. Recursive feature elimination

Question 50

Correct

Mark 1.00 out of 1.00

The KL distance is often used in machine learning to evaluate the performance of a classification model. In this context, a low KL
distance indicates that the model's predicted class probabilities are:

Select one:
a. Very similar to the true class probabilities
b. Very different from the true class probabilities
c. Somewhat similar to the true class probabilities
d. Somewhat different from the true class probabilities

◄ Midterm Exam

Jump to...

https://trimestralexam.amaesonline.com/2333B/mod/quiz/review.php?attempt=52508&cmid=12925&showall=1 17/17

You might also like