Stock Market Prediction Based On Interrelated Time Series Data
Stock Market Prediction Based On Interrelated Time Series Data
Abstract—In this paper, we propose a stock market prediction indicators such as moving average, relative strength index
method based on interrelated time series data. Though there are (RSI), stochastic oscillator and so on. There are many studies
a lot of stock market prediction models, there are few models to predict the stock market by using such data and engineered
which predict a stock by considering other time series data.
Moreover it is difficult to discover which data is interrelated approaches, such as Neural Network [6], Evolutionary Al-
with a predicted stock. Therefore we focus on extracting interre- gorithms [7], Support Vector Machine [8], Neuro-Fuzzy [9],
lationships between the predicted stock and various time series Hidden Markov model [10] and decision tree [11]. Concerning
data, such as other stocks, world stock market indices, foreign the other approaches, there are some studies that research the
exchanges and oil prices. We test our method for predicting interrelationship between stock prices of the predicted stock
the daily up and down changes in the closing value by using
discovered interrelationships, and experimental results show that and other time series data, such as foreign stock, temperature,
our methods can predict stock directions well, especially in the audience rate and so on. Sung and So analyzed association
manufacturing industry. rule for predicting changes in the Korea Compose Stock Price
Keywords—data mining; stock market prediction; Evolution Index based on the time series data of various interrelated
Strategy. world stock market indices [12]. Johan, Huina and Xiaojun
discovered that measurements of collective public mood states
I. I NTRODUCTION derived from large scale Twitter feeds correlate to the value of
These days, many kinds of data are stored and it is easy to the Dow Jones Industrial Average over time. Moreover they
gain access to them. On the other hand we are faced with also find an accuracy of 86.7% in predicting the daily up and
unmanageability of this data. We still don’t know what is down changes in the closing value by using its measurements
the best way to use stored data. Therefore, one of the most [13]. In this approach, however, it is difficult to discover which
important problems is to discover knowledge from these data data is interrelated with the predicted stock.
and make effective use of them. For this reason, studying about Therefore we propose a method that extracts interrelation-
data mining or knowledge discovery is required. In recent ships of changes in price between the predicted stock and
years, it has been possible to analyze huge amounts of data various time series data, such as other stocks, world stock
due to developments in computer technology. There are many market indices, foreign exchanges and oil prices from real data
studies in this field [1], [2]. automatically. We test our method for predicting the daily up
Regarding financial areas, there have been many studies of and down changes in the closing value by using discovered
the stock market prediction or stock investment using data interrelationships.
mining techniques. Indeed, there are many investors using The rest of this paper is organized as follows: Section 2
these techniques all over the world. The important things in discusses the proposed method. Experiments and results are
this area are to discover effective data and to make effective reported in Section 3 while conclusions are the topic of Section
use of the discovered data. 4.
There are two approaches to stock market predictions. One
approach focuses on data on the predicted stock, the other II. P ROPOSED M ETHOD
approach focuses on data aside from the predicted stock.
Concerning the first approach, there are two typical methods. While there are some causes of changes in stock prices,
One is fundamental analysis. This method predicts the stock information about other than the predicted stock should have
market by focusing on financial statements, interest rates, effects on the predicted stock. For example, a stock related
products, management, news and so on. This is used to get to exports is affected by foreign exchanges or foreign stocks.
some insight on whether it is overvalued or undervalued. There Therefore we aim to extract interrelations of changes in stock
are some studies to get some insight from news articles by prices between the predicted stock and various time series
using engineered method [3], [4], [5]. The other is technical data from real data, then predict the stock by using extracted
analysis. This is a type of method which predicts the stock interrelationships. This method is composed of two phases,
market by focusing on previous stock data, usually technical interrelation discovery phase and prediction phase.
…
A variation pattern of referenced time series data is computed
one-on-one with the predicted stock. Figure 1 shows the Fig. 1. Outline of Interrelation discovery phase.
detailed description. How to compute the variation pattern is
as follows.
The variation pattern
1) Quantize referenced time series data: In this study,
we quantize referenced time series data for simplicity of t-4 t-3 t-2 t-1 t
The predicted stock
discovering interrelationships. How to quantize is as follows. 1
# or # 4 0 changes up at time t+1
At first, referenced time series data is converted to rate of 2
change of value:
y(t) − y(t − 1)
C(t) = , (1)
y(t − 1) Fig. 2. Example of a variation pattern.
18
TABLE I Likely interrelated data
PARAMETER SETTINGS FOR ES Dow Jones Average
Dow Jones
Average t-4 t-3 t-2 t-1 t
Parameter Value
(index) # # # # 0
Number of Generations 5,000
Generation alternation model (1+4)ES S&P500
Mutation rate 1/(gene length) (index)
S&P500 FTSE100
FTSE100
(index) t-4 t-3 t-2 t-1 t t-4 t-3 t-2 t-1 t
2 0 2
TABLE II # or # # or or # # # 0
3 1 3
PARAMETER SETS FOR THE VARIATION PATTERN
19
explain why these interrelationships are discovered, we think it
is also important to discover such interrelations because there
may be some indirect interrelationships such as the butterfly
effect. In addition, we have to analyze the reason why these
are discovered because we may obtain new knowledge about
the industrial structure.
IV. C ONCLUSION
In this paper we proposed the methods that extract interre-
lationships of changes in prices between the predicted stock
and various time series data, such as other stocks, world stock
market indices, foreign exchanges and oil prices. Our method
calculates variation patterns which represent how referenced
time series has changed by using Evolution Strategy, then
extracts likely interrelated data, and predicts the stock by
using obtained interrelationships. We tested our method on
the stock market prediction. Experimental results showed
that our method can predict stock directions well, especially
in the manufacturing industry. Obtained LIDs are not only
stocks which is stockholder or has involved in the business
cooperation with the predicted stock but also stocks which
seem not to interrelated with the predicted stock.
In our future works, we will work on analyzing the interre-
lation between the predicted stock and other time series output
as LID, and develop a trading strategy using our method.
R EFERENCES
[1] Alex A, Freitas, “Data Mining and Knowledge Discovery with Evolu-
tionary Algorithms”, Springer, 2002.
[2] Ashish Ghosh, Lakhmi C. Jain, “Evolutionary Computation in Data
Mining”, Springer, 2004.
[3] Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Wai Lam, “Stock Prediction: In-
tegrating Text Mining Approach using Real-Time News”, Computational
Intelligence for Financial Engineering, pp. 395 - 402, 2003.
[4] Di WU, Gabriel Pui Cheong FUNG, Jeffrey Xu YU, Qi PAN, “Stock
prediction: an event-driven approach based on bursty keywords”, FRON-
TIERS OF COMPUTER SCIENCE IN CHINA,Vol. 3, No. 2, pp. 145-
157, 2009.
[5] Robert P.Schumaker, Hsinchun Chen, “A Discrete Stock Price Prediction
Engine Based on Financial News”. Computer, Vol. 43, No. 1 ,pp. 51-56,
2010.
[6] Kyoung-jae Kim, “Artificial neural networks with evolutionary instance
selection for financial forecasting” , Expert System with Applications Vol.
30, pp. 519-526, 2006.
[7] Pawel B. Myszkowski, Lukasz Rachwalski, “Trading rule discovery on
Warsaw Stock Exchange using coevolutionary algorithms”, Proceedings
of the International Multiconference on Computer Science and Informa-
tion Technology, Vol. 4, pp. 81-88, 2009.
[8] Wei Huang, Yoshiteru Nakamori, Shou-Yang Wang, “Forecasting stock
market movement direction with support vector machine”, Computers &
Operations Research, Vol. 32, No. 10, pp. 2513-2522, 2005.
[9] George S.Atsalakis, Kimon P. Valavanis, “Forecasting stock market short-
term trends using a neuro-fuzzy based methodology”, Expert System with
Applications, Vol.36, pp. 10696-10707, 2009.
[10] Md. Rafiul Hassen, Baikunth Nath, Michael Kirley, “A fusion model
of HMM, ANN and GA for stock market forecasting” , Expert Systems
with applications, Vol. 33, No. 1, pp. 171-180, 2007.
[11] Jar-Kong Wang, Shu-Hui Chan, “Stock market trading rule discovery
using two-layer bias decision tree”, Expert System with Applications,
Vol.30, pp. 605-611, 2006.
[12] Sung Hoon Na, So Young Sohn, “Forecasting changes in Korea Com-
posite Stock Price Index(KOSPI) using association rule”, Expert System
with Applications, Vol.38, pp. 9046-9049, 2011.
[13] Johan Bollen, Huina Mao, Xiaojun Zeng, “Twitter mood predicts the
stock market”, Journal of Computational Science, vol.2, pp. 1-8, 2011.
20
TABLE III
E XPERIMENTAL SETTING
Referenced time series data 1371 stocks listed on the Tokyo Stock Exchange,
TOPIX, Dow Jones Average, S&P500, FTSE100, SSE Composite Index, Hong Kong Hang Seng Index,
yen-dollar exchange rate, yen-GBP exchange rate, yen-Euro exchange rate, yen-AUD exchange rate,
yen-NZD exchange rate, yen-CAD exchange rate, yen-CHF exchange rate, Euro-dollar exchange rate,
WTI
Predicted stock Nippon Meat Packers, Inc. (food) COSMO OIL Co, Ltd. (petroleum and coral)
(Industry) ITOCHU Corporation. (wholesale) Mitsubishi Corporation. (wholesale)
Hitachi, Ltd. (electric) NEC corporation. (electric) Sony Corporation. (electric)
Nikon Corporation. (precision equipment) MITSUBISHI HEAVY INDUSTRIES, LTD. (machine)
Nissan Motor Co, Ltd. (automobile) TOYOTA MOTOR CORPORATION. (automobile)
Training periods 4 January 1999 to 29 December 2006
Test periods 4 January 2007 to 30 December 2008
TABLE IV
NUMERICAL EVALUATIONS IN PREDICTING THE DAILY UP
TABLE V
NUMERICAL EVALUATIONS IN PREDICTING THE DAILY DOWN
21