Multi-Source Data Stream Online Frequent Episode Mining

Tao You, Yamin Li, Bingkun Sun, Chenglie Du

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Online frequent episode mining is more complicated than the traditional static frequent episode mining due to the continuous, unbounded and time-varying data stream. Especially in the multiple data streams, online frequent episode mining is more difficult than the single-source stream, due to the concurrency, global clock loss, and uncertainty of delay caused by the distributed environment. To cope with these problems, we propose a new algorithm. Firstly, the data stream with 'happen-before' relationship among multiple sources is combined on the global data lattice. Next, the traversal on global data lattice generates effective parallel and serial candidate data streams, which guarantee the accuracy of subsequent mining and reduce the number of global sequences during searching process. Then, we use the frequent episode tree to detect the expanding online serial episodes and parallel episodes. Finally, we verify the effectiveness and efficiency of the proposed methods through extensive experiments.

Original languageEnglish
Article number9099550
Pages (from-to)107465-107478
Number of pages14
JournalIEEE Access
Volume8
DOIs
StatePublished - 2020

Keywords

  • Episode
  • global data lattice
  • multi-source data stream

Fingerprint

Dive into the research topics of 'Multi-Source Data Stream Online Frequent Episode Mining'. Together they form a unique fingerprint.

Cite this