You are in:Home/Publications/Efficiently using Prime-Encoding for Frequent Itemsets Mining in Sparse Data.

Dr. Mosab abd el-hameed mohamed hassaan :: Publications:

Title:
Efficiently using Prime-Encoding for Frequent Itemsets Mining in Sparse Data.
Authors: Karam Gouda and Mosab hassaan
Year: 2013
Keywords: Mining frequent itemsets, prime-block encoding, sparse data
Journal: Computing and Informatics
Volume: 32
Issue: Not Available
Pages: 1079–1099
Publisher: Not Available
Local/International: International
Paper Link:
Full paper Mosab abd el-hameed mohamed hassaan_karam-mosab.pdf
Supplementary materials Not Available
Abstract:

In the data mining field, data representation turns out to be one of the major factors affecting mining algorithm scalability. Mining Frequent Itemsets (MFI) is a data mining problem that is heavily affected by this fact. The vertical approach is one of the successful data representations adopted for MFI problem. The main advantage of this approach is support for fast frequency counting via joining operations. Recently, an encoding method called prime-encoding is proposed as an enhancement for the vertical approach [10]. The performance study introduced in [10] confirmed the high quality of prime-encoding based vertical mining of frequent sequence over other vertical and horizontal ones in terms of space and time. Though sequence mining is more general than itemset mining, this paper presents a prime-encoding based vertical mining of frequent itemsets with new optimizations and a new re-encoding method that further enhance memory and speed. The experimental results show that prime encoding based vertical itemset mining is suitable for high-dimensional sparse data.

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus