Dynamic Itemset Counting and Implication Rules for Market Basket Data.
Sergey Brin, Rajeev Motwani, Jeffrey D. Ullman, Shalom Tsur:
Dynamic Itemset Counting and Implication Rules for Market Basket Data.
SIGMOD Conference 1997: 255-264@inproceedings{DBLP:conf/sigmod/BrinMUT97,
author = {Sergey Brin and
Rajeev Motwani and
Jeffrey D. Ullman and
Shalom Tsur},
editor = {Joan Peckham},
title = {Dynamic Itemset Counting and Implication Rules for Market Basket
Data},
booktitle = {SIGMOD 1997, Proceedings ACM SIGMOD International Conference
on Management of Data, May 13-15, 1997, Tucson, Arizona, USA},
publisher = {ACM Press},
year = {1997},
pages = {255-264},
ee = {http://doi.acm.org/10.1145/253260.253325, db/conf/sigmod/BrinMUT97.html},
crossref = {DBLP:conf/sigmod/97},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
We consider the problem of analyzing market-basket data
and present several important contributions. First, we present
a new algorithm for finding large itemsets which uses fewer
passes over the data than classic algorithms, and yet uses
fewer candidate itemsets than methods based on sampling.
We investigate the idea of item reordering, which can improve
the low-level efficiency of the algorithm. Second, we
present a new way of generating "implication rules," which
are normalized based on both the antecedent and the consequent
and are truly implications (not simply a measure
of co-occurrence), and we show how they produce more intuitive
results than other methods. Finally, we show how
different characteristics of real data, as opposed to synthetic
data, can dramatically affect the performance of the system
and the form of the results.
Copyright © 1997 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
Online Version (ACM WWW Account required): Full Text in PDF Format
CDROM Version: Load the CDROM "Volume 1 Issue 1, SIGMOD '93-'97" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Joan Peckham (Ed.):
SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13-15, 1997, Tucson, Arizona, USA.
ACM Press 1997
,
SIGMOD Record 26(2),
June 1997
Contents
[Index Terms]
[Full Text in PDF Format, 1164 KB]
References
- [AIS93a]
- Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami:
Database Mining: A Performance Perspective.
IEEE Trans. Knowl. Data Eng. 5(6): 914-925(1993)

- [AIS93b]
- Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami:
Mining Association Rules between Sets of Items in Large Databases.
SIGMOD Conference 1993: 207-216

- [ALSS95]
- Rakesh Agrawal, King-Ip Lin, Harpreet S. Sawhney, Kyuseok Shim:
Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases.
VLDB 1995: 490-501

- [AS94]
- Rakesh Agrawal, Ramakrishnan Srikant:
Fast Algorithms for Mining Association Rules in Large Databases.
VLDB 1994: 487-499

- [AS95]
- Rakesh Agrawal, Ramakrishnan Srikant:
Mining Sequential Patterns.
ICDE 1995: 3-14

- [MAR96]
- Manish Mehta, Rakesh Agrawal, Jorma Rissanen:
SLIQ: A Fast Scalable Classifier for Data Mining.
EDBT 1996: 18-32

- [SA95]
- Ramakrishnan Srikant, Rakesh Agrawal:
Mining Generalized Association Rules.
VLDB 1995: 407-419

- [Toi96]
- Hannu Toivonen:
Sampling Large Databases for Association Rules.
VLDB 1996: 134-145

Copyright © Sun Nov 15 05:12:01 2009
by Michael Ley (ley@uni-trier.de)