Professional Documents
Culture Documents
Data Analysis Powerpoint 11
Data Analysis Powerpoint 11
Data Analysis Powerpoint 11
MACHINE LEARNING
NAME:
INSTITUTION:
The Lake is Our Home
• The Accuracy of the regression is 0.729. This shows how well the regression
choice fits the data.
Discriminant Analysis
• Based on the results you provided, it appears that the loan status was divided into
three groups: below 200 million, between 200 and 400 million, and above a certain
threshold. The highest indicator of creditworthiness, as determined by the LDA, was
found in the group with loan status below 200 million, with a percentage of 47.04%.
• The group with loan status between 200 and 400 million had a slightly lower
percentage of indicators of creditworthiness, at 40.6%. Finally, the group with the
highest loan status had the lowest percentage of indicators of creditworthiness, at
12.3%.
• It is important to note that these results are specific to the dataset and variables used
in the analysis, and should not be generalized to other situations without further
investigation and analysis.
R CODE for the Analysis
References
• Zou, X., Hu, Y., Tian, Z., & Shen, K. (2019, October). Logistic regression model optimization
and case analysis. In 2019 IEEE 7th international conference on computer science and network
technology (ICCSNT) (pp. 135-139). IEEE.
• Thabtah, F., Abdelhamid, N., & Peebles, D. (2019). A machine learning autism classification
based on logistic regression analysis. Health information science and systems, 7, 1-11.
• Senaviratna, N. A. M. R., & Cooray, T. M. J. A. (2019). Diagnosing multicollinearity of logistic
regression model. Asian Journal of Probability and Statistics, 5(2), 1-9.
• Boateng, E. Y., & Abaye, D. A. (2019). A review of the logistic regression model with emphasis
on medical research. Journal of data analysis and information processing, 7(4), 190-207.