Please use this identifier to cite or link to this item: http://hdl.handle.net/2381/39321
Title: Ensemble learning for data stream analysis: a survey
Authors: Krawczyk, Bartosz
Minku, Leandro L.
Gama, Joao
Stefanowski, Jerzy
Wozniak, Michal
First Published: 3-Feb-2017
Publisher: Elsevier
Citation: Information Fusion, 2017, 37, pp. 132–156
Abstract: In many applications of information systems learning algorithms have to act in dynamic environments where data are collected in the form of transient data streams. Compared to static data mining, processing streams imposes new computational requirements for algorithms to incrementally process incoming examples while using limited memory and time. Furthermore, due to the non-stationary characteristics of streaming data, prediction models are often also required to adapt to concept drifts. Out of several new proposed stream algorithms, ensembles play an important role, in particular for non-stationary environments. This paper surveys research on ensembles for data stream classification as well as regression tasks. Besides presenting a comprehensive spectrum of ensemble approaches for data streams, we also discuss advanced learning concepts such as imbalanced data streams, novelty detection, active and semi-supervised learning, complex data representations and structured outputs. The paper concludes with a discussion of open research problems and lines of future research.
DOI Link: 10.1016/j.inffus.2017.02.004
ISSN: 1566-2535
Links: http://www.sciencedirect.com/science/article/pii/S1566253516302329
http://hdl.handle.net/2381/39321
Embargo on file until: 3-Aug-2018
Version: Post-print
Status: Peer-reviewed
Type: Journal Article
Rights: Copyright © Elsevier, 2017. This article is distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/ ), which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
Description: The file associated with this record is embargoed until 18 months after the date of publication. The final published version may be available through the links above. Following the embargo period the above license applies.
Appears in Collections:Published Articles, Dept. of Computer Science

Files in This Item:
File Description SizeFormat 
IF_ENS_survey.pdfPost-review (final submitted author manuscript)462.18 kBAdobe PDFView/Open


Items in LRA are protected by copyright, with all rights reserved, unless otherwise indicated.