IMPLEMENTASI K-NEAREST NEIGHBORD PADA RAPIDMINER UNTUK PREDIKSI KELULUSAN MAHASISWA

Authors

  • Sumarlin Sumarlin
  • Dewi Anggraini

DOI:

https://doi.org/10.52972/hoaq.vol10no1.p35-41

Keywords:

K-Nearest Neighbor, Cross Validation, Confusion Matrix, Kurva ROC

Abstract

Data on graduate students is an important part in determining the quality of a private and public university. Graduate data is included in important assessments in the accreditation process. Data from Uyelindo Kupang STIKOM graduates every year will continue to grow and accumulate like neglected data because it is rarely used. To maximize student data into information that can be used by universities, the data must be processed in this case used as training data in a study using data mining to obtain information in the form of predictions of graduation from Kupang Uyelindo STIKOM students. The method used in this study is K-Nearest Neighbor using rapidminer software to measure K-Nearest Neighbor's accuracy against student graduate data. The criteria used were in the form of student names, gender, cumulative achievement index (GPA) from semester 1 to 6. In applying the K-Nearest Neighbor algorithm can be used to produce predictions of student graduation. To measure the performance of the k-nearest neighbor algorithm, the Cross Validation, Confusion Matrix and ROC Curves methods are used, in this study using a 5-fold cross validation to predict student graduation. From 100 student dataset records Uyelindo Kupang STIKOM graduates obtained accuracy rate reached 82% and included a very good classification because it has an AUC value between 0.90-1.00, which is 0.971, so it can be concluded that the accuracy of testing of student graduation models using K-Nearest Neighbor (K-NN) algorithm is influenced by the number of data clusters. Accuracy and the highest AUC value of 5-fold validation is to cluster data k = 4 with the accuracy value of 90%.

References

Azwar. (2004). Penyusunan Skala Psikologi. Yogyakarta: Pustaka Pelajar.

Buku VI Matriks Penilaian Instrumen Akreditasi Program Studi Badan Akreditasi Nasional Perguruan Tinggi. (2008).

Bramer, M., 2007. Principles of Data Mining, Springer, London.

Ernastuti, S. &. (2010). Graduation Prediction of Gunadarma University Students Using Algorithm and Naïve Bayes C4.5 Algoritmh

Gorunescu, F., 2011. Data Mining : Concepts, Models, and Techniques. Verlag Berlin Heidelberg, Springer.

Han, J., Kamber, M., 2006. Data Mining Concept and Tehniques, San Fransisco, Morgan Kauffman.

Kohavi, R., 1995. A study of cross validation and bootstrap for accuracy estimation and model selection. Proceedings of the International JointConference on Articial Intelligence, vol. 2, 1137-1143.

Liao., 2007. Recent Advances in Data Mining of Enterprise Data Algoritms and Application. World Scientific Publishing, Singapore.

M. Mohankumar, S. Amuthakkani and G. Jeyamala, “Comparative Analysis of Decision Tree Algorithms for The Prediction of Eligibility of A Man for Availing Bank Loan,” International Journal of Advanced Research in Biology Engineering Science and Technology (IJARBEST), vol. 2, no. 15, pp. 360-366, 2016.

Ogor. (2007). Student Academic Performance Monitoring and Evaluation Using Data Mining Techniques.

Oyelade, O. &. (2010). Application of kmeans Clustering algorithm for predicting of Students Academic Performace. International Journal of Computer Science and Information Security, 292-295.

Saaty, T.L., 1980. The Analytic Hierarchy Process, McGraw-Hill, New York

Santoso, B. (2007). Data Mining Teknik Pemanfaatan Data untuk Keperluan Bisnis (1 ed.). Yogyakarta: Graha Ilmu.

S. D. Jadhav and H. Channe, “Comparative Study of K-NN, Naive Bayes and Decision Tree Classification Techniques,” International Journal of Science and Research (IJSR), vol. 5, no. 1, pp. 1842-1845, January 2016.

Vercellis. (2009). Business Intelligence: Data Mining and Optimization for Decision Making Decision Making. John Willey & Sons Inc: Southern Gate.

Vrettos, K. &. (2009). Sentivity Analysis of Neural Network for Identifying the Factors for Collage Students Success. World Congress on Computer Science and Information Engineering. (978-0-7695-3507-4).

Yingkuachat, P. K. (2007). An Application Probabilitic Model to the Prediction of Student Graduation Using Bayesian Belief Network. ECTI Transaction on Computer and Technology, 63-71.

Downloads

Published

31-05-2018

How to Cite

Sumarlin, S., & Anggraini, D. (2018). IMPLEMENTASI K-NEAREST NEIGHBORD PADA RAPIDMINER UNTUK PREDIKSI KELULUSAN MAHASISWA. HOAQ (High Education of Organization Archive Quality) : Jurnal Teknologi Informasi, 10(1), 35–41. https://doi.org/10.52972/hoaq.vol10no1.p35-41

Most read articles by the same author(s)