TY - GEN
T1 - Movie Box Office Prediction With Self-Supervised and Visually Grounded Pretraining
AU - Chao, Qin
AU - Kim, Eunsoo
AU - Li, Boyang
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Investments in movie production are associated with a high level of risk as movie revenues have long-tailed and bimodal distributions [1]. Accurate prediction of box-office revenue may mitigate the uncertainty and encourage investment. However, learning effective representations for actors, directors, and user-generated content-related keywords remains a challenging open problem. In this work, we investigate the effects of self-supervised pretraining and propose visual grounding of content keywords in objects from movie posters as a pretraining objective. Experiments on a large dataset of 35,794 movies demonstrate significant benefits of self-supervised training and visual grounding. In particular, visual grounding pretraining substantially improves learning on movies with content keywords and achieves 14.5% relative performance gains compared to a finetuned BERT model with identical architecture.
AB - Investments in movie production are associated with a high level of risk as movie revenues have long-tailed and bimodal distributions [1]. Accurate prediction of box-office revenue may mitigate the uncertainty and encourage investment. However, learning effective representations for actors, directors, and user-generated content-related keywords remains a challenging open problem. In this work, we investigate the effects of self-supervised pretraining and propose visual grounding of content keywords in objects from movie posters as a pretraining objective. Experiments on a large dataset of 35,794 movies demonstrate significant benefits of self-supervised training and visual grounding. In particular, visual grounding pretraining substantially improves learning on movies with content keywords and achieves 14.5% relative performance gains compared to a finetuned BERT model with identical architecture.
KW - Box Office Prediction
KW - Movie Revenue Prediction
KW - Multimodal Learning
KW - Self-supervised Learning
KW - Visual Grounding
UR - http://www.scopus.com/inward/record.url?scp=85171137360&partnerID=8YFLogxK
U2 - 10.1109/ICME55011.2023.00265
DO - 10.1109/ICME55011.2023.00265
M3 - Conference contribution
AN - SCOPUS:85171137360
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
SP - 1535
EP - 1540
BT - Proceedings - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023
PB - IEEE Computer Society
T2 - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023
Y2 - 10 July 2023 through 14 July 2023
ER -