Multi-Label Classification of Text Documents Using Deep Learning

Recently, studies in the field of Natural Language Processing and its related applications continue to mount up. Machine learning is proven to be predominantly data-driven in the sense that generic model building methods are used and then tailored to specific application domains. Needless to say, this has proven to be a very effective approach in modeling the complicated data dependencies we frequently experience in practice, making very few assumptions, and allowing the information to talk for themselves. Examples of these applications can be found in chemical process engineering, climate science, healthcare, and linguistic processing systems for natural languages, to name a few. Text classification is one of the important machine learning tasks that is used in many digital applications today; such as in document filtering, search engines, document management systems, and many more. Text classification is the process of categorizing of text documents into a given set of labels. Furthermore, multi-label text classification is the task of categorization of text documents into one or more labels simultaneously. Over the years, many methods for classifying text documents have been proposed, including the popularly known bag of words (BoW) method, support vector machine (SVM), tree induction, and label-vector embedding, to mention a few. These kinds of tools can be used in many digital applications, such as document filtering, search engines, document management systems, etc. Lately, deep learning-based approaches are getting more attention, especially in extreme multi-label text classification case. Deep learning has proven to be one of the major solutions to many machine learning applications, especially those involving high-dimensional and unstructured data. However, it is of paramount importance in many applications to be able to reason accurately about the uncertainties associated with the predictions of the models. In this paper, we explore and compare the recent deep learning-based methods for multi-label text classification. We investigate two scenarios. First, multi-label classification model with ordinary embedding layer, and second with Glove, word2vec, and FastText as pre-trained embedding corpus for the given models. We evaluated these different neural network model performances in terms of multi-label evaluation metrics for the two approaches, and compare the results with the previous studies.

ORCID

Görür, Abdül Kadir

Keywords

Multi-Label Text Classification, Machine Learning, Deep Learning, Natural Language Processing, Word Embedding

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 01 natural sciences, 0105 earth and related environmental sciences

WoS Q

N/A

Scopus Q

N/A

OpenCitations Citation Count

18

Source

8th IEEE International Conference on Big Data (Big Data) -- DEC 10-13, 2020 -- ELECTR NETWORK

Start Page

4681

End Page

4689

URI

https://doi.org/10.1109/BigData50022.2020.9378266
https://hdl.handle.net/20.500.12416/9691

Collections

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

PlumX Metrics

Citations

CrossRef : 2

Scopus : 21

Captures

Mendeley Readers : 63

Full item page

SCOPUS™ Citations

21

checked on May 30, 2026

Web of Science™ Citations

10

checked on May 30, 2026

Page Views

5

checked on May 30, 2026

Google Scholar™

Check

Multi-Label Classification of Text Documents Using Deep Learning

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

BIP! Indicators

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Description

ORCID

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Citation Count

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

Captures

SCOPUS™ Citations

21

Web of Science™ Citations

10

Page Views

5

Google Scholar™

OpenAlex FWCI

1.0605

Sustainable Development Goals

SDG data is not available