Clustering consumers based on product discrimination in check-all-that-apply (CATA) data
Tidsskrift : Food Quality and Preference , vol. 99 , p. 1–15–14 , 2022
Utgiver : Elsevier
Trykt : 0950-3293
Elektronisk : 1873-6343
Publikasjonstype : Vitenskapelig artikkel
DOI : doi.org/10.1016/j.foodqual.202...
Kvalitet og målemetoder
Har du spørsmål om noe vedrørende publikasjonen, kan du kontakte Nofimas bibliotekleder.
Consumers can be clustered based on their product-related check-all-that-apply (CATA) responses. We identify two paradoxes that can occur if these clusters are derived from conventional similarity coefficients. The first paradox is that clustering similar consumers can nullify within-cluster sensory differentiation of products. The second paradox is that consumers who check many attributes yet disagree can be clustered together, whereas consumers who check fewer attributes without disagreement can be split into different clusters. After illustrating these paradoxes with toy data sets, we propose “b-cluster analysis”, in which consumers are clustered according to how they differentiate products. We define performance metrics to compare cluster analysis solutions. By design, b-cluster analysis is expected to give different results than CLUSCATA, since the objective of CLUSCATA is to cluster consumers who characterize products similarly, not according to how they differentiate products. We apply b-cluster analysis to the same toy data sets and show that the identified paradoxes do not occur. Then we apply both b-cluster analysis and CLUSCATA to a real consumer data set. We find that the b-cluster analysis solutions have better within-cluster sensory differentiation, better sensory discrimination, and less redundant clusters than CLUSCATA solutions. To investigate the sensitivity of b-cluster analysis to the initial (random) cluster membership allocations, we obtained 10,000 two-cluster solutions, each initialized with a different random partitioning of consumers. The best solution, which retains the most sensory differentiation, was observed in 21.4% of the runs. As a best practice, we recommend running b-cluster analysis several times and choosing the best solution. The proposed b-cluster analysis approach can be extended to other types of sensometric data and may have applications in other fields.