TDMCS: An Efficient Method for Mining Closed
In some data stream applications, the information embedded in the data arriving in the new recent time period is
important than historical transactions. Because data stream is changing over time, concept drift problem may appear in data
stream mining. Frequent pattern mining methods always generate useless and redundant patterns. In order to obtain the result
set of lossless compression, closed pattern is needed. A novel method for efficiently mining closed frequent patterns on data
stream is proposed in this paper. The main works includes: distinguished importance of recent transactions from historical
transactions based on time decay model and sliding window model; designed the frame minimum support count-maximal
support error rate-decay factor (θ-ε-f) to avoid concept drift; used closure operator to improve the efficiency of algorithm;
design a novel way to set decay factor: average-decay-factor faverage in order to balance the high recall and high precision of
algorithm. The performance of proposed method is evaluated via experiments, and the results show that the proposed method
is efficient and steady-state. It applies to mine data streams with high density and long patterns. It is suitable for different size
sliding windows, and it is also superior to other analogous algorithms.
[26] Yu J., Chong Z., Lu H., and Zhou A., False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams, in Proceeding of the 30th International Conference on Very Large Data Bases, Toronto, pp. 204-215, 2004. Han Meng, born in 1982, Ph.D. candidate, associate professor. Her research interests include data mining and machine learning. Jian Ding, born in 1977, M.S., associate professor. His research interests include machine learning and data mining. Juan Li, born in 1975, M.S., associate professor. Her research interests include information security and cloud computing.