eRules: a modular adaptive classification rule learning algorithm for data streams

F. Stahl, M. Gaber, M. Salvador

Research output: Contribution to conferencePaperpeer-review

115 Downloads (Pure)

Abstract

Advances in hardware and software in the past decade allow to capture, record and process fast data streams at a large scale. The research area of data stream mining has emerged as a consequence from these advances in order to cope with the real time analysis of potentially large and changing data streams. Examples of data streams include Google searches, credit card transactions, telemetric data and data of continuous chemical production processes. In some cases the data can be processed in batches by traditional data mining approaches. However, in some applications it is required to analyse the data in real time as soon as it is being captured. Such cases are for example if the data stream is infinite, fast changing, or simply too large in size to be stored. One of the most important data mining techniques on data streams is classification. This involves training the classifier on the data stream in real time and adapting it to concept drifts. Most data stream classifiers are based on decision trees. However, it is well known in the data mining community that there is no single optimal algorithm. An algorithm may work well on one or several datasets but badly on others. This paper introduces eRules, a new rule based adaptive classifier for data streams, based on an evolving set of Rules. eRules induces a set of rules that is constantly evaluated and adapted to changes in the data stream by adding new and removing old rules. It is different from the more popular decision tree based classifiers as it tends to leave data instances rather unclassified than forcing a classification that could be wrong. The ongoing development of eRules aims to improve its accuracy further through dynamic parameter setting which will also address the problem of changing feature domain value
Original languageEnglish
Publication statusPublished - 11 Dec 2012
EventProceedings of the Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence - Cambridge, United Kingdom
Duration: 11 Dec 201213 Dec 2012

Conference

ConferenceProceedings of the Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence
Country/TerritoryUnited Kingdom
CityCambridge
Period11/12/1213/12/12

Fingerprint

Dive into the research topics of 'eRules: a modular adaptive classification rule learning algorithm for data streams'. Together they form a unique fingerprint.

Cite this