Automatically understanding the contents of an image is a highly relevant problem in practice. In e-commerce and social media settings, for example, a common problem is to automatically categorize user-provided pictures. Nowadays, a standard approach is to fine-tune pre-trained image models with application-specific data. Besides images, organizations however often also collect collaborative signals in the context of their application, in particular how users interacted with the provided online content, e.g., in forms of viewing, rating, or tagging. Such signals are commonly used for item recommendation, typically by deriving latent user and item representations from the data. In this work, we show that such collaborative information can be leveraged to improve the classification process of new images. Specifically, we propose a multitask learning framework, where the auxiliary task is to reconstruct collaborative latent item representations. A series of experiments on datasets from e-commerce and social media demonstrates that considering collaborative signals helps to significantly improve the performance of the main task of image classification by up to 9.1%.
|Title of host publication||CIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management|
|Publisher||Association for Computing Machinery|
|Number of pages||11|
|State||Published - 17 Oct 2022|
|Event||31st ACM International Conference on Information and Knowledge Management, CIKM 2022 - Atlanta, United States|
Duration: 17 Oct 2022 → 21 Oct 2022
|Name||International Conference on Information and Knowledge Management, Proceedings|
|Conference||31st ACM International Conference on Information and Knowledge Management, CIKM 2022|
|Period||17/10/22 → 21/10/22|
Bibliographical notePublisher Copyright:
© 2022 ACM.
- image classification
- multi-task learning
- recommendation system