The impact of big data market segmentation using data mining and clustering techniques

Fahed Yoseph, Ahamed Hassain Malim, Nurul Hashimah, Markku Heikkilä, Brezulianu Adrian, Geman Oana, Paskhal Rostam Nur Aqilah

    Research output: Contribution to journalArticleScientificpeer-review

    15 Citations (Scopus)
    119 Downloads (Pure)


    Targeted marketing strategy is a prominent topic that has received substantial attention from both industries and academia. Market segmentation is a widely used approach in investigating the heterogeneity of customer buying behavior and profitability. It is important to note that conventional market segmentation models in the retail industry are predominantly descriptive methods, lack sufficient market insights, and often fail to identify sufficiently small segments. This study also takes advantage of the dynamics involved in the Hadoop distributed file system for its ability to process vast dataset. Three different market segmentation experiments using modified best fit regression, i.e., Expectation-Maximization (EM) and K-Means++ clustering algorithms were conducted and subsequently assessed using cluster quality assessment. The results of this research are twofold: i) The insight on customer purchase behavior revealed for each Customer Lifetime Value (CLTV) segment; ii) performance of the clustering algorithm for producing accurate market segments. The analysis indicated that the average lifetime of the customer was only two years, and the churn rate was 52%. Consequently, a marketing strategy was devised based on these results and implemented on the departmental store sales. It was revealed in the marketing record that the sales growth rate up increased from 5% to 9%.

    Original languageEnglish
    Pages (from-to)6159–6173
    Number of pages15
    JournalJournal of Intelligent and Fuzzy Systems
    Issue number5
    Publication statusPublished - 2020
    MoE publication typeA1 Journal article-refereed


    Dive into the research topics of 'The impact of big data market segmentation using data mining and clustering techniques'. Together they form a unique fingerprint.

    Cite this