Back to Search Start Over

Early Success Prediction of Indian Movies Using Subtitles: A Document Vector Approach.

Authors :
Rahul, Vaddadi Sai
Tejas, M.
Prasanth, N. Narayanan
Raja, S. P.
Source :
International Journal of Image & Graphics. Jul2023, Vol. 23 Issue 4, p1-27. 27p.
Publication Year :
2023

Abstract

Scientific studies of the elements that influence the box office performance of Indian films have generally concentrated on post-production elements, such as those discovered after a film has been completed or released, and notably for Bollywood films. Only fewer studies have looked at regional film industries and pre-production factors, which are elements that are known before a decision to greenlight a film is made. This study looked at Indian films using natural language processing and machine learning approaches to see if they would be profitable in the pre-production stage. We extract movie data and English subtitles (as an approximation to the screenplay) for the top five Indian regional film industries: Bollywood, Kollywood, Tollywood, Mollywood, and Sandalwood, as they make up a major portion of the Indian film industry's revenue. Subtitle Vector (Sub2Vec), a Paragraph Vector model trained on English subtitles, was used to embed subtitle text into 50 and 100 dimensions. The proposed approach followed a two-stage pipeline. In the first stage, Return on Investment (ROI) was calculated using aggregated subtitle embeddings and associated movie data. Classification models used the ROI calculated in the first step to predicting a film's verdict in the second step. The optimal regressor–classifier pair was determined by evaluating classification models using F 1 -score and Cohen's Kappa scores on various hyperparameters. When compared to benchmark methods, our proposed methodology forecasts box office success more accurately. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02194678
Volume :
23
Issue :
4
Database :
Academic Search Index
Journal :
International Journal of Image & Graphics
Publication Type :
Academic Journal
Accession number :
169782937
Full Text :
https://doi.org/10.1142/S0219467823500304