Çankaya GCRIS Standart veritabanının içerik oluşturulması ve kurulumu Research Ecosystems (https://www.researchecosystems.com) tarafından devam etmektedir. Bu süreçte gördüğünüz verilerde eksikler olabilir.
 

Application of BiLSTM-CRF model with different embeddings for product name extraction in unstructured Turkish text

dc.contributor.authorArslan, Serdar
dc.contributor.authorID325411tr_TR
dc.date.accessioned2024-05-28T13:28:20Z
dc.date.available2024-05-28T13:28:20Z
dc.date.issued2024
dc.departmentÇankaya Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.description.abstractNamed entity recognition (NER) plays a pivotal role in Natural Language Processing by identifying and classifying entities within textual data. While NER methodologies have seen significant advancements, driven by pretrained word embeddings and deep neural networks, the majority of these studies have focused on text with well-defined grammar and structure. A significant research gap exists concerning NER in informal or unstructured text, where traditional grammar rules and sentence structure are absent. This research addresses this crucial gap by focusing on the detection of product names within unstructured Turkish text. To accomplish this, we propose a deep learning-based NER model which combines a Bidirectional Long Short-Term Memory (BiLSTM) architecture with a Conditional Random Field (CRF) layer, further enhanced by FastText embeddings. To comprehensively evaluate and compare our model’s performance, we explore different embedding approaches, including Word2Vec and Glove, in conjunction with the Bidirectional Long Short-Term Memory and Conditional Random Field (BiLSTM-CRF) model. Furthermore, we conduct comparisons against BERT to assess the efficacy of our approach. Our experimentation utilizes a Turkish e-commerce dataset gathered from the internet, where traditional grammatical and structural rules may not apply. The BiLSTM-CRF model with FastText embeddings achieved an F1 score value of 57.40%, a precision value of 55.78%, and a recall value of 59.12%. These results indicate promising performance in outperforming other baseline techniques. This research contributes to the field of NER by addressing the unique challenges posed by unstructured Turkish text and opens avenues for improved entity recognition in informal language settings, with potential applications across various domains.en_US
dc.description.publishedMonth5
dc.identifier.citationArslan, Serdar (2024). "Application of BiLSTM-CRF model with different embeddings for product name extraction in unstructured Turkish text", Neural Computing and Applications, Vol. 36, No. 15, pp. 8371-8382.en_US
dc.identifier.doi10.1007/s00521-024-09532-1
dc.identifier.endpage8382en_US
dc.identifier.issn0941-0643
dc.identifier.issue15en_US
dc.identifier.startpage8371en_US
dc.identifier.urihttp://hdl.handle.net/20.500.12416/8424
dc.identifier.volume36en_US
dc.language.isoenen_US
dc.relation.ispartofNeural Computing and Applicationsen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectBERTen_US
dc.subjectBilstm-CRFen_US
dc.subjectDeep Learningen_US
dc.subjectFasttexten_US
dc.subjectNamed Entity Recognitionen_US
dc.titleApplication of BiLSTM-CRF model with different embeddings for product name extraction in unstructured Turkish texttr_TR
dc.titleApplication of Bilstm-Crf Model With Different Embeddings for Product Name Extraction in Unstructured Turkish Texten_US
dc.typeArticleen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Article.pdf
Size:
957.66 KB
Format:
Adobe Portable Document Format
Description:
Yayıncı sürümü

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: