M5Product-Mainpage

About

The M5Product dataset is a large-scale multi-modal pre-training dataset with coarse and fine-grained annotations for E-products.

• 6 Million multi-modal samples, 5k properties with 24 Million values

• 5 modalities-image text table video audio

• 6 Million category annotations with 6k classes

• Wide data source (1 Million merchants provide)

Sampler

The data acquisition page is shown as follows.

Examples

Citation

If you find our dataset useful in your research, please consider citing:

@ARTICLE{2021arXiv210904275D
title={M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining},
author={Xiao Dong and Xunlin Zhan and Yangxin Wu and Yunchao Wei and Michael C. Kampffmeyer and Xiaoyong Wei and Minlong Lu and Yaowei Wang and Xiaodan Liang},
year={2021},
eprint={2109.04275},
journal = {arXiv e-prints},
year={2021},
}

Annoucement

2021/09/2 Initial release.

2022/03/12 Update.

Organization