Çankaya GCRIS Standart veritabanının içerik oluşturulması ve kurulumu Research Ecosystems (https://www.researchecosystems.com) tarafından devam etmektedir. Bu süreçte gördüğünüz verilerde eksikler olabilir.
 

Multi-Label Classification of Text Documents Using Deep Learning

dc.authorscopusid 57222721182
dc.authorscopusid 6603501593
dc.authorscopusid 7006606908
dc.authorscopusid 8662600400
dc.authorwosid Görür, Abdül Kadir/Aay-1590-2021
dc.contributor.author Mohammed, Hamza Haruna
dc.contributor.author Dogdu, Erdogan
dc.contributor.author Gorur, Abdul Kadir
dc.contributor.author Choupani, Roya
dc.date.accessioned 2025-05-11T18:35:41Z
dc.date.available 2025-05-11T18:35:41Z
dc.date.issued 2020
dc.department Çankaya University en_US
dc.department-temp [Mohammed, Hamza Haruna; Choupani, Roya] Cankaya Univ, Comp Engn Dept, Ankara, Turkey; [Dogdu, Erdogan] Angelo State Univ, Comp Sci Dept, San Angelo, TX 76909 USA; [Gorur, Abdul Kadir] Cankaya Univ, Software Engn Dept, Ankara, Turkey en_US
dc.description.abstract Recently, studies in the field of Natural Language Processing and its related applications continue to mount up. Machine learning is proven to be predominantly data-driven in the sense that generic model building methods are used and then tailored to specific application domains. Needless to say, this has proven to be a very effective approach in modeling the complicated data dependencies we frequently experience in practice, making very few assumptions, and allowing the information to talk for themselves. Examples of these applications can be found in chemical process engineering, climate science, healthcare, and linguistic processing systems for natural languages, to name a few. Text classification is one of the important machine learning tasks that is used in many digital applications today; such as in document filtering, search engines, document management systems, and many more. Text classification is the process of categorizing of text documents into a given set of labels. Furthermore, multi-label text classification is the task of categorization of text documents into one or more labels simultaneously. Over the years, many methods for classifying text documents have been proposed, including the popularly known bag of words (BoW) method, support vector machine (SVM), tree induction, and label-vector embedding, to mention a few. These kinds of tools can be used in many digital applications, such as document filtering, search engines, document management systems, etc. Lately, deep learning-based approaches are getting more attention, especially in extreme multi-label text classification case. Deep learning has proven to be one of the major solutions to many machine learning applications, especially those involving high-dimensional and unstructured data. However, it is of paramount importance in many applications to be able to reason accurately about the uncertainties associated with the predictions of the models. In this paper, we explore and compare the recent deep learning-based methods for multi-label text classification. We investigate two scenarios. First, multi-label classification model with ordinary embedding layer, and second with Glove, word2vec, and FastText as pre-trained embedding corpus for the given models. We evaluated these different neural network model performances in terms of multi-label evaluation metrics for the two approaches, and compare the results with the previous studies. en_US
dc.description.woscitationindex Conference Proceedings Citation Index - Science
dc.identifier.doi 10.1109/BigData50022.2020.9378266
dc.identifier.endpage 4689 en_US
dc.identifier.isbn 9781728162515
dc.identifier.issn 2639-1589
dc.identifier.scopus 2-s2.0-85103846854
dc.identifier.scopusquality N/A
dc.identifier.startpage 4681 en_US
dc.identifier.uri https://doi.org/10.1109/BigData50022.2020.9378266
dc.identifier.uri https://hdl.handle.net/20.500.12416/9691
dc.identifier.wos WOS:000662554704094
dc.identifier.wosquality N/A
dc.language.iso en en_US
dc.publisher Ieee en_US
dc.relation.ispartof 8th IEEE International Conference on Big Data (Big Data) -- DEC 10-13, 2020 -- ELECTR NETWORK en_US
dc.relation.ispartofseries IEEE International Conference on Big Data
dc.relation.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.scopus.citedbyCount 13
dc.subject Multi-Label Text Classification en_US
dc.subject Machine Learning en_US
dc.subject Deep Learning en_US
dc.subject Natural Language Processing en_US
dc.subject Word Embedding en_US
dc.title Multi-Label Classification of Text Documents Using Deep Learning en_US
dc.type Conference Object en_US
dc.wos.citedbyCount 5
dspace.entity.type Publication

Files