Comparative Analysis of Deep Learning and Traditional Machine Learning Models for Turkish Text Classification

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

In this study, using the word embedding method Doc2vec, the Turkish Text Classification 3600 (TTC-3600) dataset consisting of Turkish news texts was classified based on deep learning. Most commonly used classifiers were selected: Convolutional Neural Network (CNN), Gauss Naive Bayes (GNB), Random Forest (RF), Naïve Bayes (NB) and Support Vector Machine (SVM). While investigating the effect of text preprocessing steps on the success rate in the study, the results are compared with the previous studies with the TTC-3600 dataset. In the proposed model, a better accuracy rate was achieved with a result of 94.17% compared to the studies in the literature.

Açıklama

1st International Conference on Computing and Machine Intelligence (ICMI-2021) February 19-20, 2021, Istanbul, Turkey -- Editorial Board Dr. Akhtar JAMIL Dr. Alaa Ali HAMEED -- ISBN: 9786050667578 -- Istanbul Sabahattin Zaim University Yayınları; No. 57.

Anahtar Kelimeler

Turkish Text Classification, Doc2Vec, Text Preprocessing, Machine Learning, Deep Learning

Kaynak

1st International Conference on Computing and Machine Intelligence

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Onay

İnceleme

Ekleyen

Referans Veren