Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12416/8651

Browse

Search Results

Now showing 1 - 4 of 4
  • Article
    Comprehensive Analysis of Data Augmentation Methods in Classification for an Imbalanced Epilepsy Dataset
    (Institute of Electrical and Electronics Engineers Inc., 2026) Calis, A.G.; Ergezer, H.
    Imbalanced class distribution reduces the generalizability of classifiers in EEG-based epilepsy detection. This study examines the impact of the synthetic minority oversampling technique (SMOTE) and its variants on imbalanced electroencephalography (EEG) data, utilizing an end-to-end data processing pipeline. Band-limited filtering is applied as pre-processing, and then the training data is gradually oversampled by 20% increments in four scenes. Experiments are conducted on coarse-k-nearest neighbor (Coarse-KNN), bagged trees, and artificial neural network (ANN) classifiers, and evaluation is performed using accuracy, precision, recall, F1 score, and Matthew’s correlation coefficient (MCC) metrics. In Scene #4, where the inter-class imbalance is eliminated, Borderline-SMOTE yielded the highest and most consistent results (F1 Score = 0.903–0.937, MCC = 0.830–0.894). Safe level-SMOTE (SL-SMOTE) and SMOTE/Geometric-SMOTE(G-SMOTE) produced second-ranked results. The findings demonstrate that appropriate variant selection provides consistent gains even across classifiers, making Borderline-SMOTE the recommended approach for imbalanced EEG classification. Furthermore, in the detailed analysis of ensemble sampling limits, SMOTE-based combined approaches (e.g., SL + G SMOTE) also produced consistent results. Basic descriptive statistics (mode, median, variance, and kurtosis) of the synthetic samples were found to be comparable to those of the real data, providing additional evidence of distributional consistency. © 2013 IEEE.
  • Conference Object
    Citation - WoS: 1
    Citation - Scopus: 1
    Dengesiz Epilepsi Veri Seti İçin Sınıflandırmada Farklı SMOTE Yöntemlerinin Etkileri
    (Institute of Electrical and Electronics Engineers Inc., 2025) Calis, Ahmet Gokay; Ergezer, Halit
    In this study, the effects of different SMOTE methods on machine learning algorithms for the imbalanced epilepsy dataset were investigated. After filtering, the imbalanced dataset was balanced with 5 different SMOTE methods and classified with various machine learning algorithms. Coarse-K-Nearest Neighbor, Bagged Trees, and Artificial Neural Networks models were evaluated in epilepsy detection. The performance of these different models was compared with Matthews Correlation Coefficient (MCC) and F1 Score metrics. The results showed that the Borderline-SMOTE algorithm had the highest F1 Score and MCC values among all machine learning algorithms. © 2025 Elsevier B.V., All rights reserved.
  • Conference Object
    Predicting Electric Vehicle Adoption in the Eu: Analyzing Classification Performance and Influencing Attributes Across Countries, Gender, and Education Level
    (Institute of Electrical and Electronics Engineers Inc., 2024) Kumbasar, M.; Tokdemir, G.; Labben, T.G.; Ertek, G.
    Electric vehicles (EVs) have been one of the trending technologies in recent decades, as they are expected to transform the current automotive technology and transportation systems. To this end, the scope of this study is analyzing survey data on European consumers' EV purchase decisions. The objective is comparing the predictive quality of various classification algorithms in predicting EV adoption, across country, gender and education level of the participants, as well as the analysis of the influencing attributes. Initially, the data is filtered for each value of the chosen categorical attribute (country, gender or education level) with the missing values being imputed. Then, several classification algorithms in the Python sklearn package are applied through 5-fold-cross validation and the performance of the algorithms are compared based on standard classification metrics. There are notable variations in classification performance and influencing attributes depending on the values of the selected categorical attributes. © 2024 IEEE.
  • Conference Object
    Citation - WoS: 10
    Citation - Scopus: 21
    Multi-Label Classification of Text Documents Using Deep Learning
    (Ieee, 2020) Mohammed, Hamza Haruna; Dogdu, Erdogan; Gorur, Abdul Kadir; Choupani, Roya
    Recently, studies in the field of Natural Language Processing and its related applications continue to mount up. Machine learning is proven to be predominantly data-driven in the sense that generic model building methods are used and then tailored to specific application domains. Needless to say, this has proven to be a very effective approach in modeling the complicated data dependencies we frequently experience in practice, making very few assumptions, and allowing the information to talk for themselves. Examples of these applications can be found in chemical process engineering, climate science, healthcare, and linguistic processing systems for natural languages, to name a few. Text classification is one of the important machine learning tasks that is used in many digital applications today; such as in document filtering, search engines, document management systems, and many more. Text classification is the process of categorizing of text documents into a given set of labels. Furthermore, multi-label text classification is the task of categorization of text documents into one or more labels simultaneously. Over the years, many methods for classifying text documents have been proposed, including the popularly known bag of words (BoW) method, support vector machine (SVM), tree induction, and label-vector embedding, to mention a few. These kinds of tools can be used in many digital applications, such as document filtering, search engines, document management systems, etc. Lately, deep learning-based approaches are getting more attention, especially in extreme multi-label text classification case. Deep learning has proven to be one of the major solutions to many machine learning applications, especially those involving high-dimensional and unstructured data. However, it is of paramount importance in many applications to be able to reason accurately about the uncertainties associated with the predictions of the models. In this paper, we explore and compare the recent deep learning-based methods for multi-label text classification. We investigate two scenarios. First, multi-label classification model with ordinary embedding layer, and second with Glove, word2vec, and FastText as pre-trained embedding corpus for the given models. We evaluated these different neural network model performances in terms of multi-label evaluation metrics for the two approaches, and compare the results with the previous studies.