Review 2

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 21

FORECASTING FILM

FINANCIAL SUCCESS WITH


MACHINE LEARNING
Guided By:
Mrs. V.Rajeswari,M.E.,
Dept of Information Technology
Panimalar Institute Of Technology

Team Members: (BATCH N0:13)


Gokula Krishnan.A (211520205043)
Gokulnath.K (211520205044)
Ram Vilas.H (211520205112)
ABSTRACT
• The project aims to develop a robust predictive model for estimating box
office revenue using machine learning techniques.

• In the dynamic and highly competitive film industry, accurate box office
revenue predictions play a pivotal role in strategic decision-making for
filmmakers, distributors, and investors.

• Traditional methods of forecasting often fall short in capturing the


intricate patterns and factors influencing a movie's success.

• Leveraging machine learning algorithms, this project seeks to harness the


power of data-driven insights to enhance the accuracy of box office
revenue predictions.
INTRODUCTION
• The Box Office Revenue Prediction project aims to leverage machine
learning techniques to forecast the financial success of upcoming movies.

• By analyzing historical data, features such as genre, cast, budget, and


release date will be utilized to build a predictive model.

• This project not only serves as a practical application of machine learning


in the entertainment industry but also contributes to informed decision-
making for film producers and distributors.

• Let's delve into the methodology and key components of this predictive
model.
LITERATURE SURVEY
S.NO YEAR TITLE AUTHOR MERITS DEMERITS

1. MOVIE BOX Dawei Li 1 and The paper proposes a The study's


2018 logistic regression exclusive focus on
OFFICE GROSS Zhi-Ping Liu based method for economic factors
PREDICTION 2,*ORCID predicting box office and limited
USING MACHINE revenues, comparison to only
demonstrating its two countries may
LEARNING effectiveness through limit the accuracy
comparative analysis in and generalizability
the US and China, and of the prediction
considers economic model.
factors for prediction.
S.NO YEAR TITLE AUTHOR MERITS DEMERITS

2. 2019 MOVIE BOX Samrudhi Abbad The paper compares


the accuracy of linear The study may not
OFFICE GROSS , ParyulJain4 regression and account for other
PREDICTION , Prof. Pooja Mishra1 polynomial regression
USING to predict movie factors that could
MACHINE revenue in the film
industry. impact a theatre
LEARNING owner's decision-
making process for
which movie to show.
EXISTING SYSTEM
● Given two datasets containing various attributes, use the features available in
the dataset and define a supervised classification algorithm which can identify
whether the movie gross getting correct predicted values or not.

● This data set contains many movie gross records. The data set records were
collected all over the world.

● Techniques used in Paper:


• Logistic Regression
• Supervised learning algorithms
PROPOSED SYSTEM
● This project aims to FILM FINANCIAL SUCCESS gross using Linear Regression.

● The study found that Linear Regression had the highest accuracy and can be
used to recommend the best prediction technique. The project used Google
Colab and evaluated essential attributes for movie gross prediction.

● The final results can be displayed using a Flask app with an HTML user
interface for movie box office gross prediction.
ARCHITECTURE DIAGRAM
HARDWARE REQUIREMENTS
• Windows 8.1 or 10

• internet connection to download and activate

• Minimum 10GB free disk space

• Administrator access to install and run Anaconda Navigator


SOFTWARE REQUIREMENTS
• Google colaboratory Notebook or jupyter Notebook
• Spyder and Pycharm Community
• Microsoft Excel 2013
MODULES
The project was completed in three phases:
1. Data Extraction

2. Data Processing and Analysis

3. Model Building
1.Data Extraction

A significant part of the dataset required for the


project was extracted from the global TMDB
dataset using its APIs. Following this, the
OMDB API was used to extract the MPAA
ratings and IMDb ratings and votes of each film
in the dataset. The final dataset has the features-
Genres, ID, Original Language, Original Title,
Overview, Popularity Rating, Release Date,
Title, TMDB Rating, TMDB Vote Count, IMDb
ID, Budget, Revenue, Production Companies,
Cast, Crew, Production Countries, Spoken
Languages, Runtime, Tagline, MPAA Rating,
IMDb Rating, IMDb Vote Count and Star Power.
2.Data Processing and Analysis

● Identifying Inconsistencies, Errors, and Missing Values: This involves


thoroughly examining the dataset to spot any discrepancies, such as outliers,
incorrect data entries, or missing values. For instance, missing values might
occur due to data collection errors or incomplete records.

● Rectifying Data Integrity Issues:


Once identified, inconsistencies and errors
need to be addressed. This may involve
imputing missing values, removing outliers,
or correcting erroneous entries. The goal is
to ensure that the data is accurate and
reliable for analysis.
Model Building
The preprocessed data was passed through different regression
models .Specific functions were written to calculate the model’s performance,
i.e., the train Mean RMSE and test mean RMSE for the final model. The 4
most important features according to the results of each regression model used
were found.
OUTPUT SCREENSHOTS
ADVANTAGES:

● Efficiency in workflow: One of the first desires that probably comes to mind is
efficiency. When building your website, you want to be able to reach as many people as
you can. The system predicts an approximate success rate of a film based on its
profitability by analyzing data.

● Reduce costs: You don't need a large time to wait for the results with in a second the
amount will be predicted.
● Using machine learning algorithms to predict film box office gross in data sets. Various
kinds of data sets, have to use this to train classifier algorithms to predict film gross with
good accuracy.
Disadvantages:
● Any single error in data set can change the entire data.
● Correct accuracy must be needed while doing the project using supervised machine
learning algorithms.
● Python code should be correct without any error.

You might also like