Abstract
Retail stores have become an integral part of our daily lives and play a vital economic role in a society. It has been found that retailers can increase sales through proficient shelf-space management. So,
developing approaches for efficient product placement in retail stores is one of the key research issues.
Traditionally, approaches have been proposed based on dynamic programming models. These modeling approaches have limitations due to the inherent complexity of handling a large number of variables
with approximation. Since 1990, data analysis methods have provided an opportunity to retailers to
understand the buying patterns of their customers by analyzing customer purchase data. Moreover, after the emergence of data mining approaches, several research works have been proposed by extending
pattern mining techniques for improved product placement in retail. Especially, Frequent Pattern Mining, which is one of the task of data mining, can be used to extract patterns (or itemsets) from a given
set of transactions. Furthermore, several kinds of utility mining techniques have been proposed in the
literature for extracting the knowledge of high-utility itemsets by incorporating the notion of utility of
the item. On the other hand, in the field of pattern mining, approaches have been proposed to extract
the knowledge of generalized itemsets from the given transcational dataset and a taxonomy among the
items. In this thesis, we propose an improved product placement approach by extending a generalized
itemset mining framework to the existing utility mining approach, which has been proposed for product
placement.
Generalized high-utility itemset mining entails the extraction of high-utility itemsets by exploring the
association among categories/generalized items at higher-level concepts in a given product taxonomy.
We exploit the fact that the knowledge of generalized high-utility itemsets extracted from a user purchase
transactional database in conjunction with a product taxonomy can provide new insights about customer
purchase behaviour. Given a user purchase transactions database and a product taxonomy, we propose
a utility-based indexing scheme to extract high-utility (revenue) generalized itemsets. For a given level
of the taxonomy, we build an index, designated as the Generalized Utility Itemset (GUI) index, to
identify and store high-revenue generalized itemsets. Intrinsically, generalized itemsets are virtual in
the sense that they cannot be directly placed in the slots of the given retail store. Here, the issue is also
to identify the specific actual items pertaining to a given generalized pattern to be placed in the slots of
the retail store for improving the revenue of the retailer. The proposed product placement framework
exploits the proposed GUI index for placing items in the shelf space of the retail store to maximize
retailer revenue. Our performance evaluation on three real datasets, namely the instacart retail dataset (containing 49,688 items and 3,214874 transactions), the classical R ”groceries” dataset (containing 169
items and 9,835 transactions) and the fruithut dataset (containing 1,265 items and 181,979 transactions),
demonstrates the effectiveness of the proposed scheme in terms of total revenue and execution time as
compared with the three existing schemes.