Comparative Analysis of Deep Learning and Traditional Machine Learning Models for Turkish Text Classification
Yükleniyor...
Tarih
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
İstanbul Sabahattin Zaim Üniversitesi
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
In this study, using the word embedding method Doc2vec, the Turkish Text Classification 3600 (TTC-3600) dataset consisting of Turkish news texts was classified based on deep learning. Most commonly used classifiers were selected: Convolutional Neural Network (CNN), Gauss Naive Bayes (GNB), Random Forest (RF), Naïve Bayes (NB) and Support Vector Machine (SVM). While investigating the effect of text preprocessing steps on the success rate in the study, the results are compared with the previous studies with the TTC-3600 dataset. In the proposed model, a better accuracy rate was achieved with a result of 94.17% compared to the studies in the literature.
Açıklama
1st International Conference on Computing and Machine Intelligence (ICMI-2021) February 19-20, 2021, Istanbul, Turkey -- Editorial Board Dr. Akhtar JAMIL Dr. Alaa Ali HAMEED -- ISBN: 9786050667578 -- Istanbul Sabahattin Zaim University Yayınları; No. 57.
Anahtar Kelimeler
Turkish Text Classification, Doc2Vec, Text Preprocessing, Machine Learning, Deep Learning
Kaynak
1st International Conference on Computing and Machine Intelligence









