ANALISIS CLUSTERING UNTUK PENGELOMPOKAN JUDUL SKRIPSI MAHASISWA MENGGUNAKAN METODE TF-IDF DAN ALGORITMA K-MEANS (STUDI KASUS : STT WASTUKANCANA)
Abstract
Pada STT. Wastukancana mahasiswa yang telah lulus pasti telah mendaftarkan judul skripsinya ke dalam sistem yang bernama e-research, pada sistem tersebut judul skripsi di kelompokkan menurut tahun ajaran judul tersebut diajukan oleh mahasiswa. Hal itu membuat mahasiswa yang akan mengajukan judul skripsi kesulitan dalam menentukan judul baru yang sebelumnya belum pernah diajukan. Masalah ini dapat diatasi dengan menerapkan metode pengelompokkan clustering terhadap judul skripsi yang ada.Tujuan dari penelitian ini yaitu melakukan analisis clusteringjudul skripsi mahasiswa dengan membandingkan kemiripan kata yang terdapat dalam judul skripsi tersebut menggunakan metode TF-IDF dan algoritma K-Means.Metode analisis yang digunakan yaitu pengumpulan data, text preprocessing, feature Selection, TF-IDF, dan text mining.Algoritma clustering yang digunakan yaitu algoritma K-Means dimana clustering ini bertujuan untuk mengelompokkan judul skripsi ke dalam cluster berdasarkan kemiripan kata yang terdapat pada judul skripsi tersebut.Hasil dari penelitian ini adalah pengelompokkan judul skripsi mahasiswa yang didapat berdasarkan cluster yang terbentuk. Hasil dari cluster ini dapat menjadi acuan sebagai rekomendasi dalam penyimpanan skripsi yang sudah dibuat dan penentuan judul skripsi yang akan datang.
References
H. Yu, L. Y. Chen, J. T. Yao, and X. N. Wang, “A three-way clustering method based on an improved DBSCAN algorithm,” Phys. A Stat. Mech. its Appl., vol. 535, p. 122289, 2019, doi: 10.1016/j.physa.2019.122289.
M. R. L. Iin Parlina, Agus Perdana Windarto, Anjar Wanto, “Memanfaatkan Algoritma K-Means Dalam Menentukan Pegawai Yang Layak Mengikuti Asessment Center,” Memanfaatkan Algoritm. K-Means Dalam Menentukan Pegawai Yang Layak Mengikuti Asessment Cent. Untuk Clust. Progr. Sdp, vol. 3, no. 1, pp. 87–93, 2018.
R. R. A. Siregar, F. A. Sinaga, and R. Arianto, “Aplikasi Penentuan Dosen Penguji Skripsi Menggunakan Metode TF-IDF dan Vector Space Model,” Comput. J. Comput. Sci. Inf. Syst., vol. 1, no. 2, p. 171, 2017, doi: 10.24912/computatio.v1i2.1014.
N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,” Technol. J. Ilm., vol. 8, no. 3, p. 124, 2017, doi: 10.31602/tji.v8i3.1116.
V. Amrizal, “Penerapan Metode Term Frequency Inverse Document Frequency (Tf-Idf) Dan Cosine Similarity Pada Sistem Temu Kembali Informasi Untuk Mengetahui Syarah Hadits Berbasis Web (Studi Kasus: Hadits Shahih Bukhari-Muslim),” J. Tek. Inform., vol. 11, no. 2, pp. 149–164, 2018, doi: 10.15408/jti.v11i2.8623.
P. H. Saputro, M. Aristin, and Dy. L. Tyas, “Berdasarkan Lirik Menggunakan Metode Tf-,” J. Teknoloi Inform. dan Terap., vol. 4, no. 1, pp. 45–50, 2017.
F. S. Sholihuda, B. Yuwono, and H. C. Rustamadji, “Pemanfaatan Text Mining Pada Sistem Pengolahan Skripsi Menggunakan Algoritma Naïve Bayes Classifier Dan Simple Additive Weighting,” Telematika, vol. 17, no. 2, p. 120, 2020, doi: 10.31315/telematika.v1i1.3379.
R. T. Wahyuni, D. Prastiyanto, and E. Supraptono, “Penerapan Algoritma Cosine Similarity dan Pembobotan TF-IDF pada Sistem Klasifikasi Dokumen Skripsi,” J. Tek. Elektro, vol. 9, no. 1, pp. 18–23, 2017, doi: 10.15294/jte.v9i1.10955.
C. S. Journal, “Implementasi Text Mining Pada Twitter Dengan Algoritma K-Means Clustering Sebagai Dasar,” vol. 9, no. 2, pp. 138–147, 2020.
A. Priyanto and M. R. Ma’arif, “Implementasi Web Scrapping dan Text Mining untuk Akuisisi dan Kategorisasi Informasi dari Internet (Studi Kasus: Tutorial Hidroponik),” Indones. J. Inf. Syst., vol. 1, no. 1, pp. 25–33, 2018, doi: 10.24002/ijis.v1i1.1664.
S. Budi, “Text Mining Untuk Analisis Sentimen Review Film Menggunakan Algoritma K-Means,” Techno.Com, vol. 16, no. 1, pp. 1–8, 2017, doi: 10.33633/tc.v16i1.1263.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright Notice
The copyright of the received article shall be assigned to the publisher of the journal. The intended copyright includes the right to publish the article in various forms (including reprints). The journal maintains the publishing rights to published articles. Therefore, the author must submit a statement of the Copyright Transfer Agreement.*)
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
In line with the license, authors and any users (readers and other researchers) are allowed to share and adapt the material only for non-commercial purposes. In addition, the material must be given appropriate credit, provided with a link to the license, and indicated if changes were made. If authors remix, transform or build upon the material, authors must distribute their contributions under the same license as the original.
Please find the rights and licenses in RABIT : Jurnal Teknologi dan Sistem Informasi Univrab. By submitting the article/manuscript of the article, the author(s) accept this policy.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author’s Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User Rights
RABIT's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, RABIT permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and RABIT on distributing works in the journal.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
- Copyright and other proprietary rights relating to the article, such as patent rights,
- The right to use the substance of the article in own future works, including lectures and books,
- The right to reproduce the article for own purposes,
- The right to self-archive the article,
- The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (RABIT : Jurnal Teknologi dan Sistem Informasi Univrab).
5. Co-Authorship
If the article was jointly prepared by other authors, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. RABIT will not be held liable for anything that may arise due to the author(s) internal dispute. RABIT will only communicate with the corresponding author.
6. Royalties
This agreement entitles the author to no royalties or other fees. To such extent as legally permissible, the author waives his or her right to collect royalties relative to the article in respect of any use of the article by RABIT.
7. Miscellaneous
RABIT will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. RABIT's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.