Deep Learning-Based Classification of News Texts Using Doc2Vec Model

Yükleniyor...
Küçük Resim

Tarih

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

The rapid increment in internet usage has also resulted in bulk gerenation of text data. Therefore, investigation of new techniques for automatic classification of textual content is needed as manually managing unstructured text is challenging. The main objective of text classification is to train a model such that it should place an unseen text into correct category. In this study, text classification was performed using the Doc2vec word embedding method on the Turkish Text Classification 3600 (TTC-3600) dataset consisting of Turkish news texts and the BBC-News dataset consisting of English news texts. As the classification method, deep learning-based CNN and traditional machine learning classification methods Gauss Naive Bayes (GNB), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM) are used. In the proposed model, the highest result was obtained as 94.17% in the Turkish dataset and 96.41% in the English dataset in the classification made with CNN. © 2021 IEEE.

Açıklama

1st International Conference on Artificial Intelligence and Data Analytics, CAIDA 2021 -- 6 April 2021 through 7 April 2021 --

Anahtar Kelimeler

Deep Learning, Doc2Vec, Machine Learning, Text Classification, Text Preprocessing, Advanced Analytics, Barium compounds, Classification (of information), Classifiers, Decision trees, Embeddings, Learning systems, Support vector machines, Text processing, Automatic classification, Classification methods, Embedding method, Internet usage, Machine learning classification, Text classification, Textual content, Unstructured texts, Deep learning

Kaynak

2021 1st International Conference on Artificial Intelligence and Data Analytics, CAIDA 2021

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Onay

İnceleme

Ekleyen

Referans Veren