0% found this document useful (0 votes)
20 views25 pages

A Review On Stock Market Prediction Using Machine Learning Algorithms

This document reviews the use of machine learning algorithms for stock market prediction, highlighting the ongoing debate around the Efficient Market Hypothesis and the effectiveness of various predictive models. It discusses traditional methods like fundamental and technical analysis, as well as advanced techniques such as deep learning and neural networks, which have shown superior performance in forecasting stock prices. The study emphasizes the importance of accurate predictions for informed decision-making in financial investments.

Uploaded by

Kangana W. M
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views25 pages

A Review On Stock Market Prediction Using Machine Learning Algorithms

This document reviews the use of machine learning algorithms for stock market prediction, highlighting the ongoing debate around the Efficient Market Hypothesis and the effectiveness of various predictive models. It discusses traditional methods like fundamental and technical analysis, as well as advanced techniques such as deep learning and neural networks, which have shown superior performance in forecasting stock prices. The study emphasizes the importance of accurate predictions for informed decision-making in financial investments.

Uploaded by

Kangana W. M
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 25

A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

ISSN: 1004-9037
https://sjcjycl.cn/
DOI: 10.5281/zenodo.98549673

A Review: On stock Market Prediction using Machine Learning Algorithms

Harish G N,
[email protected]
Department of CSE
Research scholar at Presidency University, Bangalore

Dr. Murali Parameswaran,


[email protected]
Department of CSE
Professor at Presidency University, Bangalore

The forecasting of the stock market is a traditional quandary that lies at the crossroads of the financial
and computational disciplines. Regarding this issue, the renowned Efficient Market Hypothesis (EMH)
espouses a bleak perspective, positing that the financial market is efficient [Fama, 1965]. This theory
asserts that any form of analysis, be it technical or fundamental, would not generate a reliable surplus
profit for investors. Notwithstanding, there exists a divergence of opinion amongst scholars regarding
the validity of the Efficient Market Hypothesis [Malkiel, 2003]. Several scholarly inquiries are currently
underway to gauge the varying levels of efficacy between established and developing markets.
Additionally, there are ongoing endeavors to construct robust prognostic models for stock markets,
which is also the purview of the present investigation. The endeavor commences with the narratives of
fundamental and technical analyses. The methodology of fundamental analysis involves the assessment
of a stock's worth based on its inherent value, commonly referred to as fair value. In contrast, technical
analysis solely relies on the interpretation of charts and trends. The utilization of technical indicators,
derived from one's prior experience, may be employed as manually crafted input characteristics for both
machine learning and deep learning models. Subsequently, the introduction of linear models ensues as
the viable resolutions for the prognostication of the stock market, encompassing the autoregressive
integrated moving average (ARIMA) [Hyndman & Athanasopoulos, 2018] and the generalized
autoregressive conditional heteroskedasticity (GARCH) [Bollerslev, 1986]. The advent of machine
learning models has facilitated their utilization in the realm of stock market forecasting, exemplified by
the likes of Logistic regression and support vector machine [Alpaydin, 2014]. The crux of our survey
shall center around the most recent advancements in deep learning, specifically pertaining to the diverse
array of deep neural network architectures as expounded upon by Goodfellow et al. in 2016. The
remarkable triumphs of deep learning in recent years can be attributed to its utilization of vast amounts
of data obtained from the Internet, the parallel processing capabilities of graphics processing units

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

(GPUs), and the novel convolutional neural network family. This has enabled deep learning to excel in
various domains, such as image classification [Rawat & Wang, 2017; Jiang & Zhang, 2020], object
detection [Zhao et al., 2019], and time series prediction [Brownlee, 2018; Jiang & Zhang, 2018]. Deep
learning models have demonstrated superior performance in tasks such as stock market prediction,
owing to their adeptness in handling large datasets and discerning the intricate, nonlinear associations
between input features and prediction targets, surpassing both linear and machine learning models.

1. Introduction
The intricate nature of the stock market, characterized by a significant amount of noise [Fischer et al.
2018], and the semi-strong form of market efficiency [Malkiel BG et al. 1970], which is widely
acknowledged, renders the task of analyzing and predicting it a challenging one. Making a moderately
precise forecast has the potential to increase the likelihood of generating advantageous outcomes and
mitigating market uncertainties. Notwithstanding, the presence of prospects for lucrative
prognostications is frequently scrutinized by financial economists [Zhou F et al. 2019].
The application of artificial intelligence has been observed in the resolution of time series data that
exhibit chaotic and random behavior, as evidenced by studies conducted by Yan D et al. in 2017 and
Wang J-J et al. in 2012. The scholarly examination of the extensive utilization of astute prognostic models
has conventionally been scrutinized within the realm of machine learning [Henrique BM et al. 2019]. In
contrast to conventional models, machine learning models offer greater adaptability [Zhang Y et al. 2009],
obviate the need for distributional presumptions, and enable facile amalgamation of individual classifiers
to mitigate variance [Kotecha K et al. 2015]. Numerous mechanized methodologies have been
implemented to prognosticate the stock market, as per Kotecha et al.'s 2015 study. The utilization of
various machine learning techniques such as logistic regression (LR), neural networks (NNs) [Frances
et al. 2005, Chen A-S et al. 2003, Moghaddam AH et al. 2016], deep neural networks (DNNs), and decision
trees (DTs) [Krauss C et al. 201710] has been observed. Various machine learning techniques, such as
support vector machines (SVMs), support vector regression (SVR), k-nearest neighbors (KNN), random
forests (RFs), long short-term memory networks (LSTMs), and restricted Boltzmann machines (RBMs)
have been employed by researchers to forecast fluctuations in the stock market, as evidenced by studies
conducted by Wu M-C et al. (2006), Lee M-C et al. (2009), Pai P-F et al. (2005), Kim K-j et al. (2003), Khalid
Alkhatib et al. (2013), Zhang N et al., Krauss C et al. (2017), Bao W et al. (2017, 2019), Qiu J et al. (2020),
and Liang Q et al. (2017). The study conducted by Bessembinder H et al. in 1979 involved the
implementation of Long Short-Term Memory (LSTM) networks to analyze and forecast the directional
movements of constituent stocks of the S&P 500 from 1992 to 2015, in order to compare various machine
learning techniques. It has been observed that LSTM networks exhibit superior performance in
comparison to RF, DNN, and LR. In accordance with Kotecha et al.'s (2015) study, an evaluation was
conducted to compare the efficacy of four models, namely Artificial Neural Network (ANN), Support
Vector Machine (SVM), Random Forest (RF), and Naïve-Bayes, in relation to the CNX Nifty, S&P BSE
Sensex, Infosys Ltd., and Reliance Industries on the Indian stock market.In their study, Goo YJ et al.
(2007) employed a neural network model to forecast the daily closing prices of the FTSE 100 Share Index

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

in the United Kingdom for both five and twenty-five day periods. Additionally, the researchers utilized
multiple linear regression analysis to compare and contrast the predictive outcomes of the two models.
The study conducted by Chen et al. in 2016 involved the implementation and analysis of the efficacy of
deep neural networks (DNNs) over a period of one day.
The concept of artificial intelligence encompasses the capacity of a system to assimilate knowledge from
its prior encounters and enhance its performance without the need for frequent reconfiguration.
According to Cheng, Li-Chen et al's research in 2018, it has been observed that the fluctuations in long-
term supply rates typically manifest in a linear configuration. Individuals opt to allocate their resources
towards equities that are expected to experience an increase in value in the forthcoming period.
Individuals often exhibit reluctance towards purchasing stocks due to the volatile fluctuations in stock
valuations. Consequently, it is imperative that we make precise prognostications regarding stock market
valuations that are amenable to real-world scenarios. This particular endeavor involves the utilization of
anticipatory methodologies, including but not limited to direct regression, long short-term memory,
Facebook Prophet, and k nearest neighbors. The notable triumph of machine learning (ML) across
various sectors has sparked a surge of curiosity and continued investigation into ML's potential
applications in finance [Nguyen et al., 2015; Kim and Kang, 2019]. Thus, the present study aims to
investigate the utilization of machine learning in financial methodologies and algorithms, with a specific
focus on the prediction of stock prices.

The need for stock market forecasting:


Through the allocation of financial resources in the stock market, the investor conveys a keen desire to
generate profitable returns. The cutting-edge applications of the stock market have piqued the interest
of investors, as prognosticating the market's future has the potential to yield profits. The precision of
prognostications regarding the fluctuations of the stock market is contingent upon antecedent
cognizance. The utilization of stock market prognosticating mechanisms, as posited by D. Enke and
colleagues in 2011, facilitates the monitoring and regulation of the market, thereby enabling users to
make judicious determinations. In order for the stock market to operate effectively, a comprehensive
array of data pertaining to industrial stocks, spanning the entirety of the financial sector, is requisite [H.
Chung et al., 2018].
The aforementioned modifications are made in congruence with the prevailing commercial
circumstances of the investors, as documented by X. Li et al. (2016), E. Chong et al. (2016), and X. Pang
et al. (2020), who meticulously consider both acquisitions and divestitures. Projections of forthcoming
income, declarations of earnings, alterations in management, and sundry other occurrences all exert an
influence on the market's standing. The rationale behind the significance of accurate prognostication of
the stock market lies in its ability to facilitate astute decision-making among investors. By employing
machine learning methodologies, investors have the potential to augment their profits without exposing
themselves to excessive risk. Figure 1 depicts the procedural framework of the stock market.
In Figure 1. The initial step in the data collection process involves the acquisition of real-time data from
a diverse range of websites and historical databases, including but not limited to NASDAQ [K. Zhang et
al., 2019], contingent upon their respective price indices. The task of precisely determining the direction

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

of the SM is a formidable challenge due to the somewhat non-linear character of the available historical
data. The interpretation of stock price movement as a directional indicator and its subsequent utilization
for predictive purposes is a common practice. Assessing the trajectory of future stock price fluctuations
holds paramount significance for investors in gauging market vulnerabilities. The task of modeling the
direction of stock price movement has long been regarded as a formidable and intricate challenge. The
task of predicting stock price movements is a challenging one due to the significant volatility, anomalies,
and noisy signals that are present within the realm of securities markets. In recent decades, this subject
matter has garnered the interest of scholars across various disciplines, with a particular emphasis on
the realm of artificial intelligence. The publication authored by Fatih Ecer and colleagues in the year 2020.
In the event that the software yields a surplus [J. Li et al., 2017], the shareholder may leverage the equity
for lucrative transactions. Conversely, when the pricing index is suboptimal [E. Guresen et al., 2011],
emphasis is placed on enhancing the developmental aspects of the application to facilitate more
judicious decision-making.

Extracting data
Inputs of Processing of Extraction
from various
various stocks the Data and selection
sources
of features

Detailed
Prediction

Analysis ML Updates & Decision


Techniques Notifications Making

Fig.1 Process of stock market prediction

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Classification of the reviewed articles about financial stocks market prediction using computational
techniques and machine learning techniques

Reviewed Articles About Stock Market Prediction Using Computational Techniques

Sl.No Author & Article name Technic Name Algorithm Name Metric Name
Fischer T, Krauss C. Deep Random forest Authors are finding one
learning with long short- (RAF), a deep common pattern among the
term memory networks for Long short-term neural net (DNN), stocks selected for trading –
1.
financial market memory (LSTM) and a logistic which exhibit high volatility
predictions. Eur J Oper regression and a short-term reversal
Res. 2018;270(2):654–69. classifier return profile.
1. raw price data and twelve
Zhou F, Zhang Q, Sornette
technical indicators are
D, Jiang L. Cascading Logistic
By cascading the employed for extracting the
logistic regression onto regression
logistic regression information contained in the
gradient boosted decision algorithm,
(LR) model onto the stock indices.
2. trees for forecasting and Gradient-boosted
gradient boosted 2. consideration of
trading stock decision trees,
decision trees transaction cost and buy–sell
indices.Applied Soft Support vector
(GBDT) model thresholds, contributing to
Computing. machine algorithm
exploit short-term strategies
2019;84:105747.
for more stock indices data
Yan D, Zhou Q, Wang J,
Daily market prices and
Zhang N. Bayesian
Bayesian- financial technical indicators
regularisation neural Particle swarm
regularised artificial are utilised as inputs to
3. network based on artificial optimisation
neural networks (BR- predict the one day future
intelligence optimisation. (PSO) algorithm
ANN) closing price of the Shanghai
Int J Prod Res.
(in China) composite index.
2017;55(8):2266–87.
Hybrid approach
combining
exponential
Wang J-J, Wang J-Z, Zhang The closing of the Shenzhen
smoothing model
Z-G, Guo S-P. Stock index Integrated Index (SZII) and
(ESM),
4. forecasting based on a Genetic algorithm opening of the Dow Jones
autoregressive
hybrid model. Omega. Industrial Average Index
integrated moving
2012;40(6):758–66. (DJIAI)
average model
(ARIMA), and the
back propagation

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

neural network
(BPNN) is used

Three-stage stock
D. Enke, M. Grauer, N. market prediction 3-month Certificate of
Mehdiyev, Stock market system: Multiple Deposit (CDR3) rate, past
prediction with multiple Regression Analysis S&P 500 (SP500) Index level,
Multiple
regression, fuzzy type-2 , Differential past
5. Regression
clustering, and neural Evolution-based Money Supply (M1) level,
Analysis
networks, Procedia type-2 Fuzzy recent Industrial Production
Comput. Sci. 1 (6) (2011) Clustering, a Fuzzy (IP) reading, and the recent
201–206. type-2 Neural Producer Price Index (PPI)
Network
H. Chung, K.S. Shin,
Genetic algorithm- Long short-term Korea Stock Price Index
optimized long short-term memory (LSTM) (KOSPI) data: high price, low
Deep learning
6. memory network for stock network and price, opening price, closing
technique
market prediction, genetic price, and trading volume for
Sustainability 10 (10) (2018) algorithm (GA) 10 days
3765.
K.J. Kim, W.B. Lee, Stock
market prediction using Technical indicators and the
artificial NN with optimal Artificial neural Genetic Algorithm direction of change in the
7.
feature transformation, networks with GA (GA) daily KOSPI: 2,348 trading
Neural Comput. Appl. 13 (3) days data
(2004) 255–260.

Reviewed Articles About Stock Market Prediction Using Machine Learning Techniques

Author & Article name Technic Name Algorithm Name Metric Name

Henrique BM, Sobreiro VA,


Direct citation
Kimura H. Literature
network
review: machine learning Most cited ML technique
Bibliographic survey construction
1. techniques applied to papers are considered for
techniques algorithm, search
financial market prediction. market prediction
path counting
Expert Syst
Algorithm
Appl.2019;124:226–51.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Kotecha K. Predicting stock Ten years of historical data:


market index using fusion The ten technical indicators
Fusion of SVR and
2. of machine learning SVR, ANN, RF used are calculated from
ANN
techniques. Expert Syst close, high, low and opening
Appl. 2015;42(4):2162–72. prices of these indices
Kim K-j. Financial time Technical indicators such as
series forecasting using SVM in contrast with Commodity channel index,
3. support vector machines. back-propagation SVM Relative strengthindex,
Neurocomputing. neural networks Accumulation/distribution
2003;55(1–2):307–19. oscillator etc
X. Li, H. Xie, R. Wang, Y.
Support vector
Cai, J. Cao, F. Wang, X.
machine (SVM) Intra-day tick-by-tick data of
Deng, Empirical analysis:
Extreme learning and back- the H-share market and
4. stock market prediction via
machine model propagation contemporaneous news
extreme learning machine,
neural network archives
Neural Comput. Appl. 27 (1)
(BP-NN)
(2016) 67–78.
E. Chong, C. Han, F.C. Park,
Deep learning networks for
stock market analysis and
Deep neural network High-frequency intraday
5. prediction: Methodology, DNN
Model stock returns
data representations, and
case studies, Expert Syst.
Appl. 83 (2017) 187–205.
K. Zhang, G. Zhong, J.
Dong, S. Wang, Y. Wang, Generative Open Price Highest Price
Stock market prediction Adversarial Network Lowest Price Close Price
Long Short-Term
6. based on the generative (GAN) with the Multi- Turnover Volume Turnover
Memory (LSTM
adversarial network, Layer Perceptron Rate: 5000 pieces of data
Procedia Comput. Sci. 147 (MLP) from each stock
(2019) 400–406
J. Li, H. Bu, J. Wu. (2017,
June). Sentiment-aware
Long Short-Term Naïve bayes
stock market prediction: A Investor sentiment messages
Memory (LSTM) classification
7. deep learning method. In from popular discussion
neural network algorithm and
2017 international boards using web crawler.
model LSTM
conference on service
systems and service

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

management (pp. 1-6).


IEEE.

Cheng, Li-Chen, Yu-Hsiang Historical stock data


Stock price data: Open,
Huang, and Mu-En Wu. and technical
Close, Low, High,
"Applied attention-based indicators to predict
Volume) from the Taiwan
LSTM neural networks in future stock price Attention-based
8. Stock Exchange Corporation
stock prediction." 2018 movement by using LSTM
(TWSE), and calculating the
IEEE International an attention-based
technical indicators (KD, MA,
Conference on Big Data long short-term
RSV,etc.)
(Big Data). IEEE, 2018. memory model

Fatih Ecer, Sina Ardabili, Multilayer


Shahab S. Band, Amir perceptron–
nine technical indicators for
Mosavi, Training Multilayer genetic algorithms
each trading day were
Perceptron with Genetic (MLP–GA) and
Hybrid model of MLP utilized: such as Momentum,
9. Algorithms and Particle Multilayer
and GA Relative Strength Index,
Swarm Optimization for perceptron–
Moving Average
Modeling Stock Price Index particle swarm
Convergence
Prediction. optimization
Entropy 2020, 22(11), 1239. algorithm
E. Guresen, G. Kayakutlu,
T.U. Daim, Using artificial multi-layer
neural network models in perceptron (MLP),
Daily stock exchange rates of
10. stock market index dynamic artificial MLP,DAN2
NASDAQ
prediction, Expert Syst. neural network
Appl. 38 (8) (2011) 10389– (DAN2) model
10397.
Yanjie. Hu, Juanjuan. Pang, Financical indexes are
“Financial crisis early grouped into eight catagories
warning based on support contrastive analysis viz liquidity, asset
vector machine,” In: is made between management, ability to pay
11. SVM
International Joint SVM model and the back the long term liability,
Conference on Neural Logistic model financial structure, develop
Networks, pp. 2435-2440, ability and profitability and so
2008. on.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

X. Pang, Y. Zhou, P. Wang,


W. Lin, V. Chang, An long short-term stock data, stock news,
innovative neural network memory neural capital stock and
12. approach for stock market network model using LSTM and ELSTM shareholders, and financial
prediction, J. Stock vector , analysis, etc using web
Supercomput. 76 (3) (2020) StockCrawler crawler
2098–2118.

TABLE FOR ERROR VALUE

Least error value


Sl No Technique Name Paper reference
Obtained

Long term reversal


Long short-term memory
Fischer T, Krauss C. Deep
(LSTM),
LSTM : (0.0683). learning with long short-term
Algorithms used: Random
1 memory networks for financial
forest (RAF), a deep neural net
RAF : (0.0663) market predictions. Eur J Oper
(DNN), and a logistic
Res. 2018;270(2):654–69.
regression classifier
DNN: (0.0785)
By cascading the logistic
regression (LR) model onto
Zhou F, Zhang Q, Sornette D,
the gradient boosted decision
Jiang L. Cascading logistic
trees (GBDT) model.
LR: 0.66 regression onto gradient boosted
2 Algorithms used : Logistic
GBDT: 0.51 decision trees for forecasting and
regression algorithm,
SVM: 0.66 trading stock indices.Applied
Gradient-boosted decision
Soft Computing. 2019;84:105747.
trees, Support vector machine
algorithm

Bayesian-regularised
Bayesian-regularised artificial Yan D, Zhou Q, Wang J, Zhang N.
ANN 0.85%.
neural networks (BR-ANN). Bayesian regularisation neural
Fusion model (HMM,
4 Algorithms used: Particle network based on artificial
ANN,GA): 0.8487%
swarm optimisation (PSO) intelligence optimisation. Int J
ARIMA model: 0.9723%
algorithm Prod Res. 2017;55(8):2266–87.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Hybrid approach combining


exponential smoothing model Wang J-J, Wang J-Z, Zhang Z-G,
BPNN forecasting model
(ESM), autoregressive Guo S-P. Stock index forecasting
: Traing error 0.016087,
5 integrated moving average based on a hybrid model. Omega.
Testing error value
model (ARIMA), and the back 2012;40(6):758–66.
0.013231
propagation neural network
(BPNN) is used
GA–LSTM hybrid model
is 0.91%. H. Chung, K.S. Shin, Genetic
Deep learning technique :
the predicted MAPE of algorithm-optimized long short-
Long short-term memory
6 the benchmark, which term memory network for stock
(LSTM) network and genetic
expresses accuracy as a market prediction, Sustainability
algorithm (GA)
percentage of error, is 10 (10) (2018) 3765.
1.10%

Table based on Mathematical Model.

Sl No Technique Name Mathematical Model Paper reference


Ballings, M., den Poel, D. V.,
Benchmark three Hespeels, N., & Gryp, R. (2015).
ensemble methods Evaluating multiple classifiers for
1 Multiple Regression
against four single stock price direction prediction.
Analysis
classifiers Expert Systems with Applications,
42(20), 7046–7056.
Yan D, Zhou Q, Wang J, Zhang N.
Bayesian-regularised Bayesian regularisation neural
Particle swarm optimisation
2 artificial neural networks network based on artificial
(PSO) algorithm
(BR-ANN) intelligence optimisation. Int J Prod
Res. 2017;55(8):2266–87.
Hybrid approach
combining exponential
smoothing model (ESM),
Wang J-J, Wang J-Z, Zhang Z-G,
autoregressive
Guo S-P. Stock index forecasting
3 integrated moving Genetic algorithm
based on a hybrid model. Omega.
average model (ARIMA),
2012;40(6):758–66.
and the back
propagation neural
network (BPNN) is used

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

K. Zhang, G. Zhong, J. Dong, S.


Generative Adversarial
Wang, Y. Wang, Stock market
Network (GAN) with the Long Short-Term Memory
4 prediction based on the generative
Multi-Layer Perceptron (LSTM) Implementation
adversarial network, Procedia
(MLP
Comput. Sci. 147 (2019) 400–406
Mo, H., & Wang, J. (2017). Return
scaling cross-correlation
Neural networks based forecasting by stochastic time
5 Correlation Model
on neural networks strength neural network in financial
market dynamics. Soft Computing,
1(1), 1–13.

Oliveira, N., Cortez, P., & Areal, N.


(2017). The impact of microblogging
data for stock market prediction:
Neural networks, SVM, Return, volume, volatility
6 Using Twitter to predict returns,
RF prediction model
volatility, trading volume and survey
sentiment indices. Expert Systems
with Applications, 73(1), 125– 144.

Classification table based on duration/time Interval

Sl Type of Market
Time period Market Paper reference
No (Equity/index)
Ballings, M., den Poel, D. V.,
Hespeels, N., & Gryp, R. (2015).
Evaluating multiple classifiers for
1 Stocks 5 years of data Europe
stock price direction prediction.
Expert Systems with Applications,
42(20), 7046–7056.
Chang, P.-C., Liu, C.-H., Lin, J.-L.,
Fan, C.-Y., & Ng, C. S. (2009). A
neural network with a case based
2 Stocks 6 Years Taiwan dynamic window for stock trading
prediction. Expert Systems with
Applications, 36(3, Part 2), 6889–
6898.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Chen, H., Xiao, K., Sun, J., & Wu, S.


(2017). A double-layer neural
network framework for high-
3 Indices 4 Years China frequency forecasting. ACM
Transactions on Management
Information Systems (TMIS), 7(4),
11:2–11:17.
Gorenc Novak, M., & Velušcek, ˇ D.
(2016). Prediction of stock price
4 Stocks 5 years USA movement based on daily high
prices. Quantitative Finance, 16(5),
793–826
Huang, C.-L., & Tsai, C.-Y. (2009). A
hybrid SOFM-SVR with a filter-
5 Index 3 years Taiwan based feature selection for stock
market forecasting. Expert Systems
with Applications, 36(2), 1529–1539
Pan, Y., Xiao, Z., Wang, X., & Yang,
D. (2017). A multiple support vector
machine approach to stock index
6 Index 4 years USA
forecasting with mixed frequency
sampling. Knowledge-Based
Systems, 122(1), 90–102.

Research approach
The primary objective of this review paper is to collate empirical data on the application of machine
learning models in stock market forecasting. This approach entails the formulation of one research
question (Q4) under the vote-counting method and five research questions (Q1, Q2, Q3, Q4, Q5) under
the narrative synthesis method. The research strategy encompasses research questions that facilitate
the extraction of information. We have derived several research questions from the selected studies,
which are as follows: Q1. What are the diverse statistical tools utilized in analyzing the stock market?
Q2. What types of machine learning (ML) algorithms are utilized for predicting the stock market? Q3.
What are the various datasets employed in predicting the stock market? Q4. Has a hybrid method of ML
models been used to predict the stock market? Q5. What are the different performance metrics employed
in stock market forecasting?

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

4. Results and discussions

We have systematically compiled a selection of scholarly articles that align with our designated research
inquiries. Within this segment, we shall deliberate upon the research inquiries that were previously
explicated. The inquiries of investigation are as follows:

May I inquire as to the various statistical instruments employed in the analysis of the stock market?
Following a rigorous selection process, we have conducted a thorough analysis and extracted pertinent
information. In order to expand our knowledge, let us delve into a selection of statistical instruments
employed in the analysis of the stock market. The diverse statistical methodologies employed in the
analysis possess a descriptive foundation for comprehending the stock market's interpretation. Certain
studies employ ARIMA (Autoregressive Integrated Moving Average), regression, and clustering
methodologies to prognosticate the stock market.

Each methodology is explicated in the ensuing manner:


Arima, as posited by K.J. Kim et al. in 2004, is a statistical methodology employed in the analysis of time
series data. Its primary function is to forecast future trends, thereby enhancing comprehension of the
dataset.
The utilization of the clustering technique facilitates the amalgamation of collections of entities that
exhibit comparable attributes. Stocks exhibiting a high degree of correlation are grouped together, while
those with lower correlation are segregated into a distinct category. This iterative procedure persists
until all actions are allocated to each respective group. Based on the statistical techniques employed to
forecast the stock market, several subjects have been identified as per the findings presented in Table
1. Merely 18% of the aggregate. Frequently discussed subjects, particularly pertaining to prognosticating
the stock market, however, only a solitary investigation has been employed in the context of corporate
mergers.

Table 1 Statistical techniques used by selected studies


Tools Selected Studies Percentage
ARIMA S8,S3,S8,S13,S14,S16 18
Clustering S17 3

Inquiry number two pertains to the specific machine learning (ML) algorithms utilized for the purpose of
predicting the stock market.

The preponderance of the chosen subjects employ machine learning or deep learning techniques in
order to prognosticate the stock market. A pair of scholarly investigations have been chosen that employ
a merged methodology to enhance precision in prognosticating stock market trends. The primary focus
of this section pertains to the various techniques employed in the prediction of stock market trends. The
prevailing methods utilized for forecasting are explicated as follows: Support Vector Machines (SVM) is

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

a powerful machine learning algorithm that is widely used in classification and regression analysis. It is
based on the concept of finding the optimal hyper Support Vector Machine (SVM) is widely regarded as
a highly efficacious approach for the purpose of time series prediction. The Support Vector Machine
(SVM) algorithm is a versatile tool that can be effectively employed for both regression and classification
tasks. The works of Schumacher and Chen et al. from the year 2009. The support vector machine (SVM)
is a sophisticated machine learning algorithm that has the capability to classify the future direction of a
stock price, whether it will experience an upward trend or a downward trend. The SVM algorithm entails
the representation of data as a point within a space of n dimensions. The various metrics of the stock
market are delineated and graphed on distinct Cartesian planes. The support vector machine (SVM) is
widely regarded as the most efficacious and prognostic financial market tool.
The Support Vector Regression (SVR) model, as proposed by Yanjie Hu et al. in 2008, is based on the
principles of the Support Vector Machine (SVM) model. While the two models share many similarities,
there exist subtle distinctions between them. The implementation of Support Vector Regression (SVR)
is commonly employed for the purpose of predicting stock prices, while Support Vector Machines (SVM)
are frequently utilized for the forecasting of stock market trends through the analysis of their respective
time series. The prognostication of stock market indices is a highly significant field of inquiry within the
domains of investment and practical applications. This is due to its potential to yield greater profits and
returns while mitigating risk through the implementation of efficacious exchange strategies. The
findings of Yingjun Chen et. al. suggest that the Feature Weighted Support Vector Machine (FWSVM)
outperforms the conventional Support Vector Machine (SVM) in terms of accuracy when predicting
binary labels (profit or loss) over short, medium, and long-term periods. The findings indicate that
FWSVM exhibits superior performance compared to SVM, with a notable margin of 3.4% for 1-day ahead
prediction, 3.2% for 5 days, 2.6% for 10 days, 1.6% for 15 days, 1.4% for 20 days and 1.0% for 30 days.

The Generative Adversarial Network, commonly referred to as GAN, is a type of neural network
architecture that involves two distinct models working in tandem to generate new data. The Generative
Adversarial Network (GAN), as proposed by Zhang et al. in 2019, represents a novel framework that
manifests itself in two distinct versions, akin to a game that lacks any semblance of amusement. Within
the antagonism cycle, the individual who generates data that closely resembles authentic data may be
referred to as a "forger," while the individual who assumes the role of a "judge" in discerning genuine
data from computer-generated data is commonly referred to as a "racist." The esteemed scholars Xingyu
Zhou et. al. have put forth a straightforward yet sophisticated model for predicting stock market trends,
aptly named GAN-FD. This innovative approach is poised to aid individuals lacking financial expertise
and everyday investors in making astute investment choices. The GAN-FD methodology employs a
streamlined approach by utilizing a concise set of 13 technical indices as input data, thereby obviating
the need for convoluted pre-processing of input data. He et al. employed a hybrid sequential GANs
framework for the purpose of forecasting stock index fluctuations. Their empirical investigations have
demonstrated that hybrid sequential GANs exhibit superior performance in the realm of stock prediction,
relative to prior research that relied solely on single algorithmic approaches. The empirical findings
indicate that the Gated Long Short-Term Memory (G-LSTM) model augmented with Deep Long Short-

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Term Memory (D-LSTM) and G-LSTM model augmented with Deep Gated Recurrent Unit (D-GRU)
outperformed other models.

The Naïve Bayes algorithm, as posited by Li et al. in 2017, is a classification technique that utilizes
Bayesian networks to derive a theorem for a given dataset, grounded in the principles of Bayes. The
underlying presumption is that the designated dataset comprises a solitary function that lacks any
interdependence with other class functionalities. The algorithm in question exhibits a straightforward
methodology and exceptional out-of-the-box efficacy when applied to top-tier strategies tailored for
voluminous data sets. The amalgamation of GNB algorithm and Linear Discriminant Analysis, known as
GNB_LDA, has been observed to outperform all other GNB models in three out of four evaluation
metrics, namely accuracy, F1-score, and AUC, as per the research conducted by Ernest Kwame
Ampomah et. al. The utilization of a predictive model founded on the Gaussian Naive Bayes algorithm,
coupled with Min-Max scaling and Principal Component Analysis, yielded the most favorable ranking as
determined by the specificity outcomes. Furthermore, it has been observed that the performance of GNB
is superior when employing the Min-Max scaling method as opposed to standardization scaling
methods. The scholarly work conducted by Chia-Cheng Chen and colleagues involved a thorough
examination of the relative efficacy of various machine learning models in the context of the Taiwan
stock market. The comparative analysis of investment performance among four distinct models, namely
ANN, SVM, random forest, and Naïve Bayes, was conducted based on a five-year historical dataset (2014-
2018) of the Taiwan Stock Market (TWSE) Index. The findings of their study suggest that machine
learning models surpass the benchmark index in terms of investment performance. Within the realm of
machine learning models, it is widely acknowledged that artificial neural networks (ANN) and support
vector machines (SVM) exhibit exceptional performance, surpassing their counterparts. Random Forest,
while still a formidable contender, ranks third in comparison, with Naïve-Bayes ultimately falling behind
the rest.

Furthermore, it is noteworthy that a subset of the chosen investigations employ either machine learning
or deep learning methodologies for the purpose of predicting stock market trends. The algorithms have
been subjected to a rigorous evaluation process, wherein they have been applied to a real-time dataset,
taking into account various features, and subsequently assessed based on their performance
parameters. Table 2 enumerates the implementation of the machine learning algorithm for each chosen
study, accompanied by a detailed description of the same. Upon examination of Table 2, it is evident that
a significant proportion of the chosen research endeavors employ neural network methodologies with
notable frequency. Figure 3 depicts the proportion of methodology employed.

Brownian motion, also known as the Wiener process, is a phenomenon that has been extensively studied
in the field of physics. The stochastic model of Brownian motion, originally intended to emulate the
movement of minute particles in a liquid medium, has found additional applications in option pricing
theory. These procedures are extensively bolstered by meticulous mathematical analysis, albeit in
relation to this matter.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Brownian motion refers to the random and erratic movement of microscopic particles suspended in a
fluid, which is caused by the constant bombardment of the A stochastic process characterized by real-
valued random variables. Could you kindly expound on the concept of Brownian motion, also known as
Wiener process, under a probability measure? P if 1. For any given t ≥ 0 and s > 0, it can be observed
that the stochastic variable Wt+s −Wt, commonly referred to as dW, exhibits a certain probability
distribution.
Typically characterized by a mean of zero and a variance of s.
For any given value of n and for all instances where 0 is less than or equal to t0, which in turn is less
than or equal to t1, and so on up to tn, it can be observed that the random variables {Wtr − Wtr−1} exhibit
independence. The initial value of W0 is conventionally set to zero, although it is important to note that
this is an arbitrary choice and any other starting point could be selected.
The function in question exhibits continuity for all values of t greater than or equal to zero.
Essentially, this represents a prolongation of the discrete simple random walk to a continuous temporal
domain. The differential of the change in Wt+s − Wt over an infinitesimal time interval dt is commonly
represented by the symbol dW and follows a distribution with a mean of zero and a variance of dt. The
erratic trajectories of Brownian motions are readily apparent, and it is worth noting that the anticipated
length of the path traversed by W within any given interval is boundless. This characteristic poses a
challenge to the application of calculus in the context of Brownian motions.

Table 2 Percentage of each technique used by selected studies

Studies Techniques Percentage


S2,S3,S4,S9,S12,S15,S22 SVM 21
S24, S26 CNN 6
S24, S26 RNN 6
S5 SVR 3
S23 GAN 3
S29 NB 3
S7, S8, S13 Hybrid 9
approaches/Brownian
motion

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Hybrid approaches 9

Techniques used in the study NB 3

GAN 3

SVR 3

RNN 6

CNN 6

SVM 21
Percentage of each technique used by selected studies
0 5 10 15 20 25

Fig. 3 ML Techniques
(X-axis represents Percentage of each technique used by selected studies and Y-axis represents
Techniques used in the study )

Inquiry III, Question 3: What are the various typologies of datasets employed in the prognostication of
the stock market?
A notable investigation employed diverse sets of data in the realm of stock market prognostication. As
per the findings of certain scholarly investigations, a number of datasets have been made available to
the public. A significant proportion of the chosen subjects employ publicly available datasets to forecast
the stock market. The aforementioned datasets are commonly employed for the purposes of
classification or forecasting. Table 3 delineates the various categories of data sets employed by the
selected studies, as explicated below. The tabulated data indicates that a majority of the chosen research
endeavors employed the NASDAQ dataset for inventory prediction and projection.

Table 3: Dataset used by selected studies.

#Studies Year Target Dataset values (days)/source

S1 2007 Dow Jones industrial 1024


average index
S2 2016 Stock market 500

S3 2019 Stock market NASDAQ


S4 2019 Stock market 1659/
www.moneycontrol.com
S5 2014 Stock market www.nseindia.com

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

S6 2019 Stock market NASDAQ


S7 2012 DHAKA stock market www.dse.com.hd
S8 2014 Stock market OGDCL Pakistan
S9 2002 Stock market 104/FASM
S10 2016 Stock market TFIDM
S11 2003 Stock market 100/INSTABUL STOCK
EXCHANGE
S12 2005 Stock market forecasting MCcardy cd mahen
S13 2016 Stock market 1414/tatasteel, Cisco
S14 2013 Stock market forecasting 734/ Goldman sacks Inc.
S15 2017 Stock market 108
S16 2015 Stock market 1024
S17 2011 Stock market 360
S18 2018 Stock market 4203/Korea stock index
price
S19 2014 Stock market www.finet.hk
S20 2017 Stock market 38/kospi market
S21 2018 Stock market 2691
S22 2016 Stock market Crawler
S23 2019 Stock market 5000/ new york stock
exchange
S24 2017 Stock market 1721
S25 2017 Stock market NASDAQ
S26 2017 Stock market 600
S27 2018 Stock market 500
2016 Stock market Dhaka stock exchange
S29 2017 Stock market CS1300
S30 2011 Stock market NASDAQ

Can it be posited whether the stock market has been prognosticated through the utilization of a
composite approach involving machine learning models?
As depicted in Figure 2, it is noteworthy that a mere three of the chosen studies have employed the
amalgamated approach for prognosticating the stock market. The present study posits the employment
of the hybrid methodology S3, which amalgamates artificial neural networks (ANN) with an
approximation approach. Furthermore, the proposed hybrid methodology S8 combines ANN with genetic
algorithms (GA) to enhance the performance of GA in the domain of securities forecasting in the stock
market. In a recent investigation, S13 adeptly integrated the discrete statistical methodology of wavelet

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

transforms with the machine learning artificial neural network algorithm (DWT-ANN) in order to predict
stock market trends.

What are the diverse performance metrics utilized in the prognostication of the stock market?
Diverse performance metrics are employed to evaluate the superior market/exchange/forecasting
proficiency of machine learning. The evaluation of an algorithm's efficacy is contingent upon its
performance parameters, which are determined by the methodology employed and the corresponding
data sets utilized. The diverse performance metrics employed by the studies that opted to gauge their
performance are explicated as follows:
The metric employed to assess the classification of a model is Accuracy, as stated by Pang et al. in 2020.
The metric of informal accuracy pertains to the degree to which our model's predictions are deemed
correct.
The Root Mean Squared Error (RMSE) is a statistical metric utilized to determine the disparity between
the anticipated values of a model and the values that are retained. This method was employed in the
calculation process described by J. B. Heaton et al. in 2016. The root mean square error (RMSE) exhibits
a remarkable proximity to both the training and evaluation datasets.
The utilization of Mean Absolute Error (MAE) as a metric for regression values was implemented by
Ummul Khair Pang et al. in 2002. In this particular instance, the error of prediction is derived from the
summation of variances between the anticipated and factual variables, subsequently partitioned by the
total quantity of data points encompassing the entire dataset. The concept of Mean Absolute Error (MAE)
pertains to the computation of the disparity between two variables that are continuous in nature.
The Mean Square Error (MSE) is a statistical metric utilized as the loss function to compute least squares
regression, as per the research conducted by Z. Wang, A, et al. in 2018. Furthermore, it can be expressed
as the aggregate of the disparity between the projected and factual variables, divided by the total count
of observations encompassing the entire dataset. The incorporation of pertinent events or sentiments
pertaining to the stock market may lead to a reduction in the Mean Square Error (MSE).
The utilization of Mean Percentage Absolute Error (MAPE) is a viable approach to assess the relative
reliability of stock data prediction, as posited by Ansari Saleh Ahmar. The present summation [E.
Guresen, et al., 2011] pertains to the collective absolute discrepancies that have been segregated based
on the requisitions. This represents a standard deviation from the true value. Furthermore, certain
individuals employed these performance metrics and their corresponding databases to prognosticate
the fluctuations of the stock market. The fluctuations of stock market exchange rates are subject to
monthly or yearly increments. Figure 4 illustrates that a majority of the chosen studies employ the
precise performance parameter utilized to assess their model in conjunction with their dataset. However,
it is noteworthy that a mere 11% of the chosen studies employed the MAPE parameter for predictive
purposes.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

Percentage

11% Accuracy
32% MSE
20%
MAE
RMSE
16% 21% MAPE

Fig.4 Selected studies used performance parameters

Table 4 shows vast majority of the journals/conferences by the selected studies


Studies Conference/Journal % Publisher
S1 Applied Intelligence 3 Springer

S2 International conference on computer 3 IEEE

S3 International journal of financial studies 3 MDPI


S4 Computational intelligence in data 3 Springer
mining
S5, Expert systems with applications 12 Science Direct
S14,S20,
S30
S6 International conference on information 3 Springer
technology and systems
S7 International conference on computer 3 IEEE
and information technology
S8 International journal of computer 3 Scopus
science and information security
S9 International conference on intelligent 3 Springer
data engineering and automated
learning

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

S10 International conference on ICT in 3 IEEE


business industry and Government

S11 Neural computing and applications 3 Springer


S12 Pakistan multitopic conference IEEE
S13 Indian journal of science and 3 Scopus
technology
S15 International conference of electronics, 3 IEEE
communication and aerospace
technology
S16 International research journal of 3 Scopus
engineering and technology
S17, S22, Most dominant journals/conferences. 9 ScienceDirect
S23 Procedia computer science

S18 Sustainability 3 MDPI


S19 Neural computing and applications 3 Springer
S21 Journal of supercomputing 3 Springer
S24 International conference of advances in 3 IEEE
computing, communications and
informatics
S25 Multimedia tools and applications 3 Springer
S26 International conference on 3 IEEE
Computational Intelligence and Virtual
Environments for Measurement
Systems and Applications
S27 International conference on information 3 IEEE
and communication systems

S28 International conference on Electrical, 3 IEEE


Computer & Telecommunication
Engineering

S29 International conference on service 3 IEEE


systems
and service management

From the table is evident that almost all authors are using and publishing in IEEE journals.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

5. Conclusion
The present article presents a comprehensive analysis of diverse methodologies employed in the
prediction of stock market trends, leveraging mathematical and machine learning strategies. The
objective of this survey is to evaluate the relative efficacy of prevailing methodologies vis-à-vis modified
approaches, utilization of diverse datasets, performance metrics, and application methodologies, based
on an analysis of the top 50 seminal investigative articles. The categorization of techniques employed in
the prediction of stock market trends is predicated upon various machine learning algorithms. In pursuit
of enhancing prognostic precision, a multitude of inquiries have been undertaken, employing a
confluence of methodologies, in the domain of stock market analysis. The utilization of Artificial Neural
Networks (ANNs) and Neural Networks (NNs) has become a prevalent methodology in the realm of stock
market forecasting, yielding favorable outcomes. It is plausible to devise methodologies that enable the
comprehensive surveillance and oversight of the entirety of the stock market. The primary impediment
to stock market prognostication lies in the inability to discern the prevailing methodologies through the
examination of past stock data. Hence, the stock market is subject to the sway of various externalities,
including but not limited to governmental policy determinations and the prevailing disposition of the
consumer populace. In the forthcoming times, our endeavor shall be to enhance the system by devising
a more dependable and precise stock market mechanism.

References
1. Fischer T, Krauss C. Deep learning with long short-term memory networks for financial market
predictions. Eur J Oper Res. 2018;270(2):654–69.
2. Malkiel BG, Fama EF. Efficient capital markets: A review of theory and empirical work. The journal
of Finance. 1970;25(2):383–417.
3. Zhou F, Zhang Q, Sornette D, Jiang L. Cascading logistic regression onto gradient boosted
decision trees for forecasting and trading stock indices.Applied Soft Computing. 2019;84:105747.
4. Yan D, Zhou Q, Wang J, Zhang N. Bayesian regularisation neural network based on artificial
intelligence optimisation. Int J Prod Res. 2017;55(8):2266–87.
5. Wang J-J, Wang J-Z, Zhang Z-G, Guo S-P. Stock index forecasting based on a hybrid model.
Omega. 2012;40(6):758–66.
6. Henrique BM, Sobreiro VA, Kimura H. Literature review: machine learning techniques applied to
financial market prediction. Expert Syst Appl.2019;124:226–51.
7. Zhang Y, Wu L. Stock market prediction of S&P 500 via combination of improved BCO approach
and BP neural network. Expert Syst Appl.2009;36(5):8849–54.
8. Kotecha K. Predicting stock market index using fusion of machine learning techniques. Expert
Syst Appl. 2015;42(4):2162–72.
9. ZLi J, Financial time series forecasting using twin support vector regression. PLoS ONE.
2019;14(3).pmid:30865670
10. Krauss C, Do XA, Huck N. Deep neural networks, gradient-boosted trees, random forests:
Statistical arbitrage on the S&P 500. Eur J Oper Res.2017;259(2):689–702.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

11. Khalid Alkhatib. Stock Price Prediction Using K-Nearest Neighbor (kNN) Algorithm. International
Journal of Business, Humanities and Technology. 2013; Vol. 3, No. 3.
12. Lee M-C. Using support vector machine with a hybrid feature selection method to the stock trend
prediction. Expert Syst Appl. 2009;36(8):10896–904.
13. Wu M-C, Lin S-Y, Lin C-H. An effective application of decision tree to stock trading. Expert Syst
Appl. 2006;31(2):270–4.
14. Pai P-F, Lin C-S. A hybrid ARIMA and support vector machines model in stock price forecasting.
Omega. 2005;33(6):497–505.
15. Frances PH, Marches M, Murray A. A hybrid genetic-neural architecture for stock index
forecasting. Information Science. 2005;17(1):3–37.
16. Kim K-j. Financial time series forecasting using support vector machines. Neurocomputing.
2003;55(1–2):307–19.
17. Chen A-S, Leung MT, Daouk H. Application of neural networks to an emerging financial market:
forecasting and trading the Taiwan Stock Index.Computers & Operations Research. 2003;30(6):901–23.
18. Brownstone D. Using percentage accuracy to measure neural network predictions in stock
market movements. Neurocomputing. 1996;10(3):237–50.
19. Bao W, Yue J, Rao Y. A deep learning framework for financial time series using stacked
autoencoders and long-short term memory. PLoS ONE.2017;12(7):e0180944. pmid:28708865.
20. Liang Q, Rong W, Zhang J, Liu J, Xiong Z, editors. Restricted Boltzmann machine based stock
market trend prediction. International Joint Conference onNeural Networks (IJCNN); 2017: IEEE.
21. Zhang N, Lin A, Shang P. Multidimensional k-nearest neighbor model based on EEMD for
financial time series forecasting. Phys A Stat Mech its Appl.2017;477:161–73.
22. Qiu J, Wang B, Zhou C. Forecasting stock prices with long-short term memory neural network
based on attention mechanism. PLoS ONE. 2020;15(1).pmid:31899770.
23. Kotecha K. Predicting stock and stock price index movement using Trend Deterministic Data
Preparation and machinelearning techniques. Expert Syst Appl. 2015;42(1):259–68.
24. Moghaddam AH, Moghaddam MH, Esfandyari MJJoEF, Science A. Stock market index prediction
using artificial neural network. 2016:89–93.
25. Bessembinder H, Chan K. Market efficiency and the returns to technical analysis. Financ Manag.
1998:5–17.
26. Goo YJ, Chen DH, Chang YW. The application of Japanese candlestick trading strategies in
Taiwan. Investment Management and Financial Innovations.2007;(4, Iss. 4):49–79.
27. Chen S, Bao S, Zhou Y. The predictive power of Japanese candlestick charting in Chinese stock
market. Phys A Stat Mech its Appl. 2016;457:148–65.
28. Cheng, Li-Chen, Yu-Hsiang Huang, and Mu-En Wu.
"Applied attention-based LSTM neural networks in stock prediction." 2018 IEEE International Conference
on Big Data (Big Data). IEEE, 2018.
29. Nguyen, T. H., Shirai, K., and Velcin, J. (2015).
Sentiment analysis on social media for stock movement prediction. Expert Systems with Applications,
42(24):9603– 9611.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

30. Kim, S. and Kang, M. (2019). Financial series


prediction using Attention LSTM. arXiv preprint arXiv: 1902.10877.
31. D. Enke, M. Grauer, N. Mehdiyev, Stock market prediction with multiple regression, fuzzy type-2
clustering, and neural networks, Procedia Comput. Sci. 1 (6) (2011) 201–206.
32. H. Chung, K.S. Shin, Genetic algorithm-optimized long short-term memory network for stock
market prediction, Sustainability 10 (10) (2018) 3765.
33. X. Li, H. Xie, R. Wang, Y. Cai, J. Cao, F. Wang, X. Deng, Empirical analysis: stock market
prediction via extreme learning machine, Neural Comput. Appl. 27 (1) (2016) 67–78.
34. E. Chong, C. Han, F.C. Park, Deep learning networks for stock market analysis and prediction:
Methodology, data representations, and case studies, Expert Syst. Appl. 83 (2017) 187–205.
35. X. Pang, Y. Zhou, P. Wang, W. Lin, V. Chang, An innovative neural network approach for stock
market prediction, J. Supercomput. 76 (3) (2020) 2098–2118.
36. K. Zhang, G. Zhong, J. Dong, S. Wang, Y. Wang, Stock market prediction based on the generative
adversarial network, Procedia Comput. Sci. 147 (2019) 400–406.
37. Yingjun Chen, Yongtao Hao, A feature weighted support vector machine and K-nearest neighbor
algorithm for stock market indices prediction. Expert Systems With Applications 80 (2017) 340–355
38. Fatih Ecer, Sina Ardabili, Shahab S. Band, Amir Mosavi, Training Multilayer Perceptron with
Genetic Algorithms and Particle Swarm Optimization for Modeling Stock Price Index Prediction.
Entropy 2020, 22(11), 1239.
39. J. Li, H. Bu, J. Wu. (2017, June). Sentiment-aware stock market prediction: A deep learning
method. In 2017 international conference on service systems and service management (pp. 1-6). IEEE.
40. E. Guresen, G. Kayakutlu, T.U. Daim, Using artificial neural network models in stock market index
prediction, Expert Syst. Appl. 38 (8) (2011) 10389–10397.
41. K.J. Kim, W.B. Lee, Stock market prediction using artificial NN with optimal feature
transformation, Neural Comput. Appl. 13 (3) (2004) 255–260.
42. Schumacher and Chen, SVM is a machine learning algorithm that can classify a future stock price
direction (rise or drop). ACM Transactions on Information SystemsVolume 27 Issue 2February 2009.
43. Yanjie. Hu, Juanjuan. Pang, “Financial crisis early warning based on support vector machine,”
In: International Joint Conference on Neural Networks, pp. 2435-2440, 2008.
44. K. Zhang, G. Zhong, J. Dong, S. Wang, Y. Wang, Stock market prediction based on the generative
adversarial network, Procedia Comput. Sci. 147 (2019) 400–406.
45. Xingyu Zhou, Zhisong Pan, Guyu Hu, Siqi Tang, and Cheng Zhao, Stock Market Prediction on
High-Frequency Data Using Generative Adversarial Nets. Mathematical Problems in Engineering Volume
2018.
46. Bate He, Eisuke Kita, Stock Price Prediction by Using Hybrid Sequential Generative Adversarial
Networks. International Conference on Data Mining Workshops (ICDMW), IEEE, 2020.
47. Ernest Kwame Ampomah, Gabriel Nyame, Zhiguang Qin, Prince Clement Addo, Enoch Opanin
Gyamfi, Michael Gyan, Stock Market Prediction with Gaussian Naïve Bayes Machine Learning Algorithm.
Informatica 45 (2021) 243–256.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813


A REVIEW: ON STOCK MARKET PREDICTION USING MACHINE LEARNING ALGORITHMS

48. Chia-Cheng Chen, Yi-Sheng Liu, Ting-Hsin Hsu, An Analysis on Investment Performance of
Machine Learning: An Empirical Examination on Taiwan Stock Market. International Journal of
Economics and Financial Issues, 2019, 9(4), 1-10.
49. X. Pang, Y. Zhou, P. Wang, W. Lin, V. Chang, An innovative neural network approach for stock
market prediction, J. Supercomput. 76 (3) (2020) 2098–2118.
50. J. B. Heaton. N. G. Polson, J. H. Witte, Deep learning for finance: deep portfolios, Applied
Stchastic Models in Business and Industry. Wiley 2016.
51. Ummul Khair, Hasanul Fahmi, Sarudin Al Hakim and Robbi Rahim, Forecasting Error Calculation
with Mean Absolute Deviation and Mean Absolute Percentage Error, International Conference on
Information and Communication Technology (IconICT) IOP Publishing, IOP Conf. Series: Journal of
Physics: Conf. Series 930 (2017) 01 2002.
52. Z. Wang, A. Tan, F. Li, and S.-B. Ho, “Comparisons of learning based methods for stock market
prediction,” in The 4th International Conference on Cloud Computing and Security (ICCCS 2018), 2018.
53. Ansari Saleh Ahmar, Sutte Indicator: A Technical Indicator in Stock Market. International Journal
of Economics and Financial Issues , 2017, 7(2), 223-226.
54. E. Guresen, G. Kayakutlu, T.U. Daim, Using artificial neural network models in stock
market index prediction, Expert Syst. Appl. 38 (8) (2011) 10389–10397.

Journal of Data Acquisition and Processing Vol. 38 (3) 2023 2813

You might also like