AIML Synopsis
AIML Synopsis
Synopsis
Predicting stock price by creating a Machine Learning Model
1. Problem Domain
The financial market is a dynamic and composite system where people can buy and sell
currencies, stocks, equities and derivatives over virtual platforms supported by brokers.
Stock markets are affected by many factors causing the uncertainty and high volatility in the
market. Although humans can take orders and submit them to the market, automated trading
systems (ATS) that are operated by the implementation of computer programs can perform
better and with higher momentum in submitting orders than any human. Since most of the
dealings in the markets are done by automated systems, it has now been well established that
training the past data can help us in finding patterns in the movement of the markets which
can be used to predict the future prices. If implemented successfully with a higher accuracy
than existing systems, it could turn into a financial support system with minimal amount of
risk.
With the innovation in technology and their application in the stock market, the system
has become increasingly complex and volatile which in turn has made human
predictions highly inaccurate, but using Machine Learning to find out the patterns in
the system using historical data can help us predict the future prices more accurately.
4.Objectives
We want to maximize our true positives - days when the algorithm predicts that the price will
go up, and it actually goes up. Therefore, we'll be using precision as our error metric
for our algorithm, which is true positives / (false positives + true positives). This will
ensure that we minimize how much money we lose with false positives (days when we
buy the stock, but the price actually goes down). This means that we will have to accept
a lot of false negatives - days when we predict that the price will go down, but it
actually goes up. This is okay, since we'd rather minimize our potential losses than
maximize our potential gains.
5. Dataset Details
The dataset will be imported through kaggle easily. For training the model we would just
need prices of the required stocks over certain periods of time. The parameters that the
data will be trained on are OHLC ( open, high, low, close) and some additional
parameters(indicators) to achieve even higher accuracy that will comprise the innovative
component of the proposed system.
6. Methodology
We will be using a Random Forest Classification algorithm as the dataset that we train is
completely discrete and we will be using several indicators to calculate the data on which
the training will be performed.
7. Expected Outcome
We aim to create a Machine Learning model to predict the future movements in the prices
of stocks. This will help us in gaining an edge over the traditional investors and can lead to
monetary advantages.