DOI

10.3906/elk-1602-341

Abstract

The number and length of massive datasets have increased day by day and this yields more complex machine learning stages due to the high computational costs. To decrease the computational cost many methods were proposed in the literature such as data condensing, feature selection, and filtering. Although clustering methods are generally employed to divide samples into groups, another way of data condensing is by determining ideal exemplars (or prototypes), which can be used instead of the whole dataset. In this study, first the efficiency of traditional data condensing by clustering approach was confirmed according to obtained accuracies and condensing ratios in 9 different synthetic or real batch datasets. This approach was then improved to be employed in time-ordered datasets. In order to validate the proposed approach, 23 different real time-ordered datasets were used in experiments. Achieved mean RMSEs were 0.27 and 0.29 by employing the condensed (mean condensed ratio was 97.17%) and the whole datasets, respectively. Obtained results showed that higher accuracy rates and condensing ratios were achieved by the proposed approach.

Keywords

Data condensing, prototype extracting, clustering, massive datasets, time-ordered datasets

First Page

2614

Last Page

2634

Recommended Citation

ERTUĞRUL, ÖMER FARUK (2017) "A novel approach for extracting ideal exemplars by clustering for massivetime-ordered datasets," Turkish Journal of Electrical Engineering and Computer Sciences: Vol. 25: No. 4, Article 6. https://doi.org/10.3906/elk-1602-341
Available at: https://journals.tubitak.gov.tr/elektrik/vol25/iss4/6

Download

Included in

Computer Engineering Commons, Computer Sciences Commons, Electrical and Computer Engineering Commons

COinS

Turkish Journal of Electrical Engineering and Computer Sciences

A novel approach for extracting ideal exemplars by clustering for massivetime-ordered datasets

DOI

Abstract

Keywords

First Page

Last Page

Recommended Citation

Included in

Issues by Year

Search

Turkish Journal of Electrical Engineering and Computer Sciences

A novel approach for extracting ideal exemplars by clustering for massivetime-ordered datasets

Authors

DOI

Abstract

Keywords

First Page

Last Page

Recommended Citation

Included in

Share

Issues by Year

Search