Multi-Label Classification of Text Documents Using Deep Learning

Mohammed, Hamza Haruna; Dogdu, Erdogan; Gorur, Abdul Kadir; Choupani, Roya

Multi-Label Classification of Text Documents Using Deep Learning

dc.contributor.author	Mohammed, Hamza Haruna
dc.contributor.author	Dogdu, Erdogan
dc.contributor.author	Gorur, Abdul Kadir
dc.contributor.author	Choupani, Roya
dc.date.accessioned	2025-05-11T18:35:41Z
dc.date.available	2025-05-11T18:35:41Z
dc.date.issued	2020
dc.description.abstract	Recently, studies in the field of Natural Language Processing and its related applications continue to mount up. Machine learning is proven to be predominantly data-driven in the sense that generic model building methods are used and then tailored to specific application domains. Needless to say, this has proven to be a very effective approach in modeling the complicated data dependencies we frequently experience in practice, making very few assumptions, and allowing the information to talk for themselves. Examples of these applications can be found in chemical process engineering, climate science, healthcare, and linguistic processing systems for natural languages, to name a few. Text classification is one of the important machine learning tasks that is used in many digital applications today; such as in document filtering, search engines, document management systems, and many more. Text classification is the process of categorizing of text documents into a given set of labels. Furthermore, multi-label text classification is the task of categorization of text documents into one or more labels simultaneously. Over the years, many methods for classifying text documents have been proposed, including the popularly known bag of words (BoW) method, support vector machine (SVM), tree induction, and label-vector embedding, to mention a few. These kinds of tools can be used in many digital applications, such as document filtering, search engines, document management systems, etc. Lately, deep learning-based approaches are getting more attention, especially in extreme multi-label text classification case. Deep learning has proven to be one of the major solutions to many machine learning applications, especially those involving high-dimensional and unstructured data. However, it is of paramount importance in many applications to be able to reason accurately about the uncertainties associated with the predictions of the models. In this paper, we explore and compare the recent deep learning-based methods for multi-label text classification. We investigate two scenarios. First, multi-label classification model with ordinary embedding layer, and second with Glove, word2vec, and FastText as pre-trained embedding corpus for the given models. We evaluated these different neural network model performances in terms of multi-label evaluation metrics for the two approaches, and compare the results with the previous studies.	en_US
dc.identifier.doi	10.1109/BigData50022.2020.9378266
dc.identifier.isbn	9781728162515
dc.identifier.issn	2639-1589
dc.identifier.scopus	2-s2.0-85103846854
dc.identifier.uri	https://doi.org/10.1109/BigData50022.2020.9378266
dc.identifier.uri	https://hdl.handle.net/20.500.12416/9691
dc.language.iso	en	en_US
dc.publisher	Ieee	en_US
dc.relation.ispartof	8th IEEE International Conference on Big Data (Big Data) -- DEC 10-13, 2020 -- ELECTR NETWORK	en_US
dc.relation.ispartofseries	IEEE International Conference on Big Data
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Multi-Label Text Classification	en_US
dc.subject	Machine Learning	en_US
dc.subject	Deep Learning	en_US
dc.subject	Natural Language Processing	en_US
dc.subject	Word Embedding	en_US
dc.title	Multi-Label Classification of Text Documents Using Deep Learning	en_US
dc.type	Conference Object	en_US
dspace.entity.type	Publication
gdc.author.scopusid	57222721182
gdc.author.scopusid	6603501593
gdc.author.scopusid	7006606908
gdc.author.scopusid	8662600400
gdc.author.wosid	Görür, Abdül Kadir/Aay-1590-2021
gdc.bip.impulseclass	C4
gdc.bip.influenceclass	C5
gdc.bip.popularityclass	C4
gdc.coar.access	metadata only access
gdc.coar.type	text::conference output
gdc.collaboration.industrial	false
gdc.description.department	Çankaya University	en_US
gdc.description.departmenttemp	[Mohammed, Hamza Haruna; Choupani, Roya] Cankaya Univ, Comp Engn Dept, Ankara, Turkey; [Dogdu, Erdogan] Angelo State Univ, Comp Sci Dept, San Angelo, TX 76909 USA; [Gorur, Abdul Kadir] Cankaya Univ, Software Engn Dept, Ankara, Turkey	en_US
gdc.description.endpage	4689	en_US
gdc.description.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
gdc.description.scopusquality	N/A
gdc.description.startpage	4681	en_US
gdc.description.woscitationindex	Conference Proceedings Citation Index - Science
gdc.description.wosquality	N/A
gdc.identifier.openalex	W3085469666
gdc.identifier.wos	WOS:000662554704094
gdc.index.type	WoS
gdc.index.type	Scopus
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	5.0
gdc.oaire.influence	3.1713696E-9
gdc.oaire.isgreen	false
gdc.oaire.popularity	8.270026E-9
gdc.oaire.publicfunded	false
gdc.oaire.sciencefields	0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields	02 engineering and technology
gdc.oaire.sciencefields	01 natural sciences
gdc.oaire.sciencefields	0105 earth and related environmental sciences
gdc.openalex.collaboration	International
gdc.openalex.fwci	1.0605
gdc.openalex.normalizedpercentile	0.82
gdc.opencitations.count	18
gdc.plumx.crossrefcites	2
gdc.plumx.mendeley	63
gdc.plumx.scopuscites	21
gdc.scopus.citedcount	21
gdc.virtual.author	Doğdu, Erdoğan
gdc.virtual.author	Görür, Abdül Kadir
gdc.virtual.author	Choupanı, Roya
gdc.wos.citedcount	10
relation.isAuthorOfPublication	0d453674-7998-4d57-a06c-03e13bb1e314
relation.isAuthorOfPublication	49bf2018-5b02-4799-b134-4bcbdb35fa19
relation.isAuthorOfPublication	2f2010e6-da07-4e28-95f8-660c2ca696ff
relation.isAuthorOfPublication.latestForDiscovery	0d453674-7998-4d57-a06c-03e13bb1e314
relation.isOrgUnitOfPublication	12489df3-847d-4936-8339-f3d38607992f
relation.isOrgUnitOfPublication	43797d4e-4177-4b74-bd9b-38623b8aeefa
relation.isOrgUnitOfPublication	0b9123e4-4136-493b-9ffd-be856af2cdb1
relation.isOrgUnitOfPublication.latestForDiscovery	12489df3-847d-4936-8339-f3d38607992f

Collections

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Multi-Label Classification of Text Documents Using Deep Learning

Files

Collections