Bilgilendirme: Sürüm Güncellemesi ve versiyon yükseltmesi nedeniyle, geçici süreyle zaman zaman kesintiler yaşanabilir ve veri içeriğinde değişkenlikler gözlemlenebilir. Göstereceğiniz anlayış için teşekkür ederiz.
 

Small and Unbalanced Data Set Problem in Classification

No Thumbnail Available

Date

2019

Journal Title

Journal ISSN

Volume Title

Publisher

Ieee

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

Classification of data is difficult in case of small and unbalanced data set and this problem directly affects the classification performance. Small and / or the imbalance dataset has become a major problem in data mining. Classification algorithms are developed based on the assumption that the data sets are balanced and large enough. The most of the algorithms ignore or misclassify examples of the minority class, focus on the majority class. Small and unbalanced data set problem is frequently encountered in medical data mining due to some limitations. Within the scope of the study, the public accessible data set, hepatitis, was divided into small and imblanced data subsets, each of the data subsets were oversampled by distance based data generation methods. The oversampled data sets were classified by using four different machine learning algorithms (Artificial Neural Networks, Support Vector Machines, Naive Bayes and Decision Tree) and the classification scores were compared.

Description

Keywords

Machine Learning, Small Data Set, Imbalanced Data Set, Oversampling Methods

Turkish CoHE Thesis Center URL

Fields of Science

Citation

Par, Öznur Esra; Sezer, Ebru Akçapınar; Sever, Hayri (2019). "Small and Unbalanced Data Set Problem in Classification", 27th Signal Processing and Communications Applications Conference (SIU), Sivas Cumhuriyet Univ, Sivas, TURKEY, APR 24-26, 2019.

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
8

Source

27th Signal Processing and Communications Applications Conference (SIU) -- APR 24-26, 2019 -- Sivas Cumhuriyet Univ, Sivas, TURKEY

Volume

Issue

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 6

Scopus : 13

Captures

Mendeley Readers : 22

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.92170647

Sustainable Development Goals

2

ZERO HUNGER
ZERO HUNGER Logo

8

DECENT WORK AND ECONOMIC GROWTH
DECENT WORK AND ECONOMIC GROWTH Logo

9

INDUSTRY, INNOVATION AND INFRASTRUCTURE
INDUSTRY, INNOVATION AND INFRASTRUCTURE Logo

10

REDUCED INEQUALITIES
REDUCED INEQUALITIES Logo

11

SUSTAINABLE CITIES AND COMMUNITIES
SUSTAINABLE CITIES AND COMMUNITIES Logo

15

LIFE ON LAND
LIFE ON LAND Logo

16

PEACE, JUSTICE AND STRONG INSTITUTIONS
PEACE, JUSTICE AND STRONG INSTITUTIONS Logo

17

PARTNERSHIPS FOR THE GOALS
PARTNERSHIPS FOR THE GOALS Logo