Parallel data reduction techniques for big datasets

Yildirim, A.A.; Özdoğan, Cem; Özdoğan, C.; Watson, D.

Parallel data reduction techniques for big datasets

Date

2013

Authors

Publisher

IGI Global

Organizational Units

Organizational Unit

Ortak Dersler Bölümü

Ortak Dersler Bölümü’nün amacı öğrencilerimizin analitik düşünme yeteneğini geliştirmek, bazı doğa kanunlarını anlayabilmelerini sağlamak, eğitim, bilim, sanat, tarih ve edebiyat gibi alanlarda öğrencilerimizin kendilerini geliştirmesine imkan sağlamaktır.

Abstract

Data reduction is perhaps the most critical component in retrieving information from big data (i.e., petascale-sized data) in many data-mining processes. The central issue of these data reduction techniques is to save time and bandwidth in enabling the user to deal with larger datasets even in minimal resource environments, such as in desktop or small cluster systems. In this chapter, the authors examine the motivations behind why these reduction techniques are important in the analysis of big datasets. Then they present several basic reduction techniques in detail, stressing the advantages and disadvantages of each. The authors also consider signal processing techniques for mining big data by the use of discrete wavelet transformation and server-side data reduction techniques. Lastly, they include a general discussion on parallel algorithms for data reduction, with special emphasis given to parallel waveletbased multi-resolution data reduction techniques on distributed memory systems using MPI and shared memory architectures on GPUs along with a demonstration of the improvement of performance and scalability for one case study. © 2014, IGI Global. All right reserved.

Citation

Yıldırım, Ahmet Artu; Özdoğan, Cem; Watson, Dan (2013). "Parallel data reduction techniques for big datasets", Big Data Management, Technologies, and Applications, pp. 72-93.

WoS Q

N/A

Scopus Q

N/A

Source

Big Data Management, Technologies, and Applications

Start Page

72

End Page

93

URI

https://doi.org/10.4018/978-1-4666-4699-5.ch004

Collections

Scopus İndeksli Yayınlar Koleksiyonu
Fizik Bilim Dalı Yayın Koleksiyonu

Full item page

Parallel data reduction techniques for big datasets

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Organizational Units

Journal Issue

Events

Abstract

Description

Keywords

Turkish CoHE Thesis Center URL

Fields of Science

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections