Çankaya GCRIS Standart veritabanının içerik oluşturulması ve kurulumu Research Ecosystems (https://www.researchecosystems.com) tarafından devam etmektedir. Bu süreçte gördüğünüz verilerde eksikler olabilir.
 

Small and Unbalanced Data Set Problem in Classification

dc.contributor.authorPar, Öznur Esra
dc.contributor.authorSezer, Ebru Akçapınar
dc.contributor.authorSever, Hayri
dc.contributor.authorID11916tr_TR
dc.date.accessioned2023-01-04T08:28:53Z
dc.date.available2023-01-04T08:28:53Z
dc.date.issued2019
dc.departmentÇankaya Üniversitesi, Mühendislik Fakültesi, Yazılım Mühendisliği Bölümüen_US
dc.description.abstractClassification of data is difficult in case of small and unbalanced data set and this problem directly affects the classification performance. Small and / or the imbalance dataset has become a major problem in data mining. Classification algorithms are developed based on the assumption that the data sets are balanced and large enough. The most of the algorithms ignore or misclassify examples of the minority class, focus on the majority class. Small and unbalanced data set problem is frequently encountered in medical data mining due to some limitations. Within the scope of the study, the public accessible data set, hepatitis, was divided into small and imblanced data subsets, each of the data subsets were oversampled by distance based data generation methods. The oversampled data sets were classified by using four different machine learning algorithms (Artificial Neural Networks, Support Vector Machines, Naive Bayes and Decision Tree) and the classification scores were compared.en_US
dc.identifier.citationPar, Öznur Esra; Sezer, Ebru Akçapınar; Sever, Hayri (2019). "Small and Unbalanced Data Set Problem in Classification", 27th Signal Processing and Communications Applications Conference (SIU), Sivas Cumhuriyet Univ, Sivas, TURKEY, APR 24-26, 2019.en_US
dc.identifier.issn2165-0608
dc.identifier.urihttp://hdl.handle.net/20.500.12416/6019
dc.language.isoenen_US
dc.relation.ispartof2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU)en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectMachine Learningen_US
dc.subjectSmall Data Seten_US
dc.subjectImbalanced Data Seten_US
dc.subjectOversampling Methodsen_US
dc.titleSmall and Unbalanced Data Set Problem in Classificationtr_TR
dc.titleSmall and Unbalanced Data Set Problem in Classificationen_US
dc.typeConference Objecten_US
dspace.entity.typePublication

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: