SSRN-id2822105 Retail Investor Tracking
SSRN-id2822105 Retail Investor Tracking
Abstract
We provide an easy method to identify purchases and sales initiated by retail investors
using recent, widely available U.S. equity transactions data. Individual stocks with net buying by
retail investors outperform stocks with negative imbalances by approximately 10 basis points over
the following week. Less than half of the predictive power of marketable retail order imbalances
is attributable to order flow persistence; contrarian trading (a proxy for liquidity provision) and
public news sentiment explain little of the remaining predictability. There is suggestive (but only
suggestive) evidence that retail marketable orders contain firm-level information that is not yet
incorporated into prices.
*Ekkehart Boehmer is with Singapore Management University, LKC School of Business. Charles
M. Jones is with Columbia Business School. Xiaoyan Zhang and Xinran Zhang are with Tsinghua
University, PBC School of Finance. We thank Kevin Crotty, Larry Glosten, Frank Hatheway, Eric
Kelley, Jamil Nazarali, Jeff Pontiff, Meijun Qian, Tarun Ramadorai, Paul Tetlock; seminar
participants at Brigham Young University, University of California at Riverside, the Frankfurt
School, Humboldt University Berlin, INSEAD, Purdue University, Rice University, Singapore
Management University, Southern Methodist University, Tsinghua University, University of Utah,
National Singapore University, Nanyang Technology University; and conference audiences at the
2017 SFS Cavalcade, 2017 German Finance Association, 2017 ABFER, the 2017 CICF and the
2018 AFA for their helpful comments. The contents of this publication are solely the responsibility
of the authors. The authors have no conflicts of interest to disclose.
Many researchers have concluded that retail equity investors are generally uninformed and,
if anything, make systematic mistakes when selecting equity investments (see for example Barber
and Odean (2000,2008)). However, some more recent evidence, including Kaniel, Saar, and
Titman (2008), Barber, Odean and Zhu (2009), Kaniel, Liu, Saar, and Titman (2012), Kelley and
Tetlock (2013), Fong, Gallagher, and Lee (2014), and Barrot, Kaniel, and Sraer (2016), suggests
otherwise. These studies show that retail investors’ trading can predict future stock returns.
Unfortunately, most existing studies of retail order flow are based on proprietary datasets with
relatively small subsets of overall retail order flow. For example, Barber and Odean (2000)
analyze data from a single U.S. retail brokerage firm, while Barber and Odean (2008) examine
individual investor trading data from a total of three different retail or discount brokerage firms.
Kelley and Tetlock (2013) use data from a single U.S. wholesaler; Fong, Gallagher, and Lee (2014)
analyze data from the Australian Securities Exchange (ASX); and Barrot, Kaniel, and Sraer (2016)
use data from a single French brokerage firm. Kaniel, Saar, and Titman (2008), Kaniel et al. (2012),
and Boehmer, Jones, and Zhang (2008) use proprietary account-type data from the NYSE during
the early 2000s. During that sample period, only a small number of brokerages sent their retail
order flow to the NYSE. As a result, the NYSE’s market share of overall retail order activity was
(and has remained) quite small.
In existing work, many researchers use trade size as a proxy for retail order flow. Before
the spread of computer algorithms that “slice and dice” large institutional parent orders into a
sequence of small child orders, small trades were much more likely to come from retail customers,
while institutions were likely behind the larger reported trades. For example, Lee and
Radhakrishna (2000) use a $20,000 cutoff point to separate smaller individual trades from larger
institutional trades. More recently, Campbell, Ramadorai, and Schwartz (2009) effectively allow
these cutoff points to vary through a regression approach that is calibrated to observed quarterly
changes in institutional ownership, but they maintain the same basic assumption that small trades
Given the current automated and algorithm-driven market structure, researchers need an
easily implementable method to isolate retail order flow. We introduce such a measure in this
paper. As one of our main contributions, we show that our measure can identify a broad swath of
marketable retail order flow. Our measure builds on the fact that, due to regulatory restrictions in
the U.S. and the resulting institutional arrangements, retail order flow, but not institutional order
flow, can receive price improvement, measured in small fractions of a cent per share. We use this
fact to identify marketable retail price-improved orders from the TAQ data, a publicly available
data set that contains all transactions for stocks listed on a national exchange in the U.S. We do
this by identifying trades that execute at share prices with fractional pennies. Most such price-
improved transactions take place off-exchange and are reported to a Trade Reporting Facility
(TRF). Using this TRF data, we identify transactions as retail buys if the transaction price is
slightly below the round penny, and retail sells if the transaction price is slightly above the round
penny. This approach isolates retail investors’ marketable orders from institutional ones, because
institutional trades cannot receive this type of fractional penny price improvement.1 We discuss
our approach in greater detail in the data section. Notice that our retail order flow measure only
includes marketable orders, but not limit orders. Overall, we believe that our method of retail trade
identification is conservative, and we cross-validate the accuracy of our approach using a small
sample of NASDAQ TRF audit trail data.
We analyze retail marketable order flow from the U.S. equity market for six years between
January 2010 and December 2015. We find that retail investors are slightly contrarian at a weekly
horizon, and that the cross-section of weekly marketable retail order imbalances predicts the cross-
1
In contrast, institutional trades often occur at the midpoint of the prevailing bid and ask prices. If the bid-ask spread
is an odd number of cents, the resulting midpoint trade price ends in a half-penny. Many of these midpoint trades take
place on crossing networks and are reported to the TRF. Thus, trades at or near a half-penny are likely to be from
institutions and are not assigned to the retail category.
Given the nature of our data, our work is also related to recent studies of off-exchange
trading in the U.S. For instance, Kwan, Masulis, and McInish (2015) study the competition
between traditional stock exchanges and new dark trading venues and find that the minimum
pricing increment regulation (typically one penny) drives orders to dark pools and limits the
competitiveness of the exchanges. Battalio, Corwin, and Jennings (2016) examine make-take fees
and how brokers route order flow, and suggest that current order routing practices may not
maximize the quality of limit order execution. Menkveld, Yueshen, and Zhu (2017) directly
investigate the pecking order of trading venues in dark pools and document that investors
strategically put low-cost-low-immediacy orders in front of high-cost-high-immediacy orders.
Compared to the earlier literature on retail orders and studies of off-exchange trades, we
make three main contributions. First and most importantly, we propose a novel methodology for
identifying and signing marketable retail trades using publicly available data with substantial
coverage. Second, our empirical results show that the marketable retail trades we identify can
predict the cross-section of future stock returns. Third, we analyze the nature of the predictive
3
Two studies, Kaniel, Saar, and Titman (2008) and Kelley and Tetlock (2013), study similar
questions, and are closely related to our research, but with different data and different
interpretations. For instance, using proprietary data from the NYSE between January 2000 and
December 2003, Kaniel, Saar, and Titman (2008) document that retail order flows can predict
stock returns. In addition, Kaniel, Saar and Titman (2008) examine the contemporaneous relation
between their retail order flows and stock returns. They find that the contemporaneous return is
significantly positive for stocks that retail investors sell, and negative for stocks that they buy,
which is consistent with a liquidity provision interpretation and inconsistent with the information
story. We follow their approach using our new marketable retail order flow variables. We are able
to replicate the predictive relation between retail order flow and future stock returns. However, our
results for the contemporaneous relation are different: the contemporaneous return is significantly
negative for stocks that retail investors sell, and positive for stocks that they buy, when they use
marketable orders. Our findings are more in line with an information interpretation than a liquidity
provision interpretation.
Kelley and Tetlock (2013) obtain data from a major retail wholesaler between February
2003 and December 2007. Their data allow them to separate retail orders into market orders and
limit orders. They find that retail market orders and limit orders can both predict future stock
returns, but for different reasons. The aggressive market orders can correctly predict future news,
suggesting these trades are informed, while the passive limit orders are contrarian, consistent with
the liquidity provision hypothesis. Our marketable retail order flow measure only identifies market
orders, and for these marketable orders we follow their tests and replicate their results. In addition,
we decompose our marketable retail order imbalance into components related to order flow
persistence, contrarian trading, public news and a residual, which potentially contains non-public
information. The decomposition exercise shows that public news barely contributes to the
predictive power of the marketable retail trades, and the residual part is more important. With more
recent data and wider coverage, our study provides interesting new findings, which complement
the original studies by Kaniel, Saar, and Titman (2008) and Kelley and Tetlock (2013).
4
From TAQ trade data, we keep only trades that occur off-exchange, with exchange code
“D.” We merge these TAQ data with stock returns and accounting data from CRSP and Compustat,
respectively. We include only the common stocks with share code 10 or 11 (which excludes mainly
ETFs, ADRs, and REITs) listed on the NYSE, NYSE MKT (formerly the Amex), and NASDAQ.
We remove low-priced stocks by requiring the minimum stock price to be $1 at the previous
month-end.
Our sample period covers January 3, 2010 to December 31, 2015. Data on subpenny price
improvement actually extend back to 2005. In Appendix Figure A1, we show the time series from
January 2005 (the start of Regulation NMS, which established the current regulatory framework
for subpenny price improvement in the U.S.) to December 2017. We choose to study the period
from 2010 to 2015 for two reasons. First, for the first few years of Reg NMS, there is a strong
upward trend in the number of subpenny trades, possibly because an increasing number of
brokerage firms were adopting the practice of providing fractional cents of price improvement to
retail investors via internalization or wholesalers. The upward trend disappears and stabilizes after
2009. Second, from 2016 to September 2018, the SEC adopted a tick size pilot program (TSPP)
that affects tick size and brokers’ ability to provide price improvement for many stocks, which
likely affects the prevalence of subpenny price improvements unevenly in the cross section.
Therefore, our main results are focused on the middle part of the data, 2010 to 2015. For each day,
we have an average of around 3,000 firms included in the sample.
In the U.S., most marketable equity orders initiated by retail investors do not take place on
one of the dozen or so registered exchanges. Instead, these retail orders are typically executed
either by wholesalers or via internalization, meaning that orders are filled from a broker’s own
inventory. Orders executed by wholesalers or through internalization must be publicly reported;
they are usually reported to a FINRA Trade Reporting Facility (TRF), which provides broker-
dealers with a mechanism through which to report transactions that take place off-exchange. These
TRF executions are then included in the TAQ “consolidated tape” of all reported transactions with
exchange code “D.” Many orders that are internalized or executed by wholesalers are given a small
price improvement relative to the National Best Bid or Offer (NBBO).2 For instance, wholesalers
are willing to provide a small price improvement to induce the retail trader’s broker to route the
order to the wholesaler. Internalizers, who are subject to Regulation 606T, need to show that they
execute their clients’ orders optimally, and thus also have incentives to provide price improvement
to their clients. This price improvement is typically only a small fraction of a cent. Common price
improvement amounts are 0.01, 0.1, and 0.2 cent.
Brokerage firms in the U.S. are required to provide regular summary statistics in SEC Rule
606 filings about their order routing practices for non-directed orders. A directed order instructs
the broker to execute an order on a given exchange or trading venue; a non-directed order gives
the broker discretion regarding the execution venue. The vast majority of retail orders are non-
directed. For example, Charles Schwab reports that 98.6% of their security orders during the
second quarter of 2016 were non-directed orders. The corresponding figure for TD Ameritrade is
99%. According to the Rule 606 filings by these two retail brokerage firms, more than 90% of
these orders receive price improvement.
Our communications with a major retail wholesaler and a major exchange suggest that
these types of price improvement are not a feature of institutional order executions, as institutional
orders are almost never internalized or sold to wholesalers. Instead, their orders are sent to
exchanges and dark pools, and Regulation NMS prohibits these orders from having subpenny limit
2
As a rough estimate of the frequency of subpenny price improvement, we find in a NASDAQ subsample used for
robustness tests (introduced in Section I.D) that 60% of trades on “retail” venues receive subpenny price
improvements, with 14% reported at the halfpenny and 46% taking place at a different subpenny. For subpenny trades
that do not execute at half pennies and constitute the focus of our study, more than 99% are reported to a TRF with
exchange code “D.”
As discussed above, Reg NMS requires that limit orders be priced at round pennies, so our
approach will by definition identify only marketable retail orders.4 The 606 filings by brokerage
firms are also partitioned into market and limit orders, which allows us to gauge the relative
3
According to SEC litigation releases (see, for example, https://www.sec.gov/litigation/admin/2015/33-9697.pdf and
https://www.sec.gov/litigation/admin/2016/33-10013.pdf), at least two dark pool operators (Credit Suisse and UBS)
were accused of violations of Regulation NMS in accepting, ranking, and executing orders based on subpenny prices.
These alleged violations occurred through mid-2011 and were eventually settled. A back-of-the-envelope calculation
suggests that these violations could have accounted for about 0.5% of total share volume during this part of our sample
period. Since these dark pools cater to institutions, including high-frequency traders, our identification of retail flows
using subpenny trades could be “contaminated” during this period, and we cannot identify using public TAQ data
which trades are from the affected dark pools. Given the potential contamination accounts for a small part of overall
subpenny trades for our sample period, our main analysis still focuses on the full sample period from 2010 to 2015.
We conduct a subsample analysis for 2012 to 2015 in a robustness check in Section III.B, and the results are similar
to those for the whole sample. We thank the associate editor for pointing this out.
4
Marketable orders by definition demand immediacy, and according to Kelley and Tetlock (2013), market orders are
more informed than are limit orders. Thus, any predictive power from retail market orders is likely to be stronger than
that of retail limit orders and overall retail orders.
C. Summary Statistics
Table I presents summary statistics on the marketable retail orders identified by our
method. We pool observations across stocks and days, and compute the mean, standard deviation,
median, and 25th and 75th percentiles. Our sample comprises over 4.6 million stock-day
observations. For the number of shares traded per day (vol), the mean share volume is around 1.23
million, and the standard deviation is about 6.85 million shares. The average stock has 5,917 trades
each day (trd). These numbers suggest that the average trade size over this sample period is about
200 shares. Our identified marketable retail investor activity represents only a small part of the
overall trading volume. The identified average daily buy volume from marketable retail orders
(mrbvol) is 42,481 shares, and the average daily sell volume from marketable retail orders (mrsvol)
is 42,430 shares. Throughout the paper, we use “mr” to represent “marketable retail”. Thus, we
identify an average of 84,911 shares per stock-day traded by marketable retail orders, about 6.91%
of the average total shares traded each day. The average number of buy trades from marketable
retail orders (mrbtrd) each day is 110, and the average number of sell trades from marketable retail
orders (mrstrd) each day is 108. Thus, the total number of identified trades per stock-day from
marketable retail orders is 218, around 3.68% of the total number of trades. Interestingly, the buy
volumes closely match the sell volumes, and the number of buy trades match the number of sell
trades, both indicating that many marketable retail trades offset each other. In terms of average
share volumes and number of trades, there is slightly more buying than selling by marketable retail
trades over our sample period.
5
One might wonder whether the market or marketable orders can be offset, in aggregate, by limit or non-marketable
orders in the opposite direction. This is possible. Unfortunately, we do not have data to directly check this possibility.
Figure 1 provides further statistics on the overall properties of our identified marketable
retail trades. Panel A presents trade sizes in dollars. For each marketable retail trade, we compute
the trade size in dollars by multiplying the number of executed shares by the transaction price. For
each year of our sample, we compute the 25th percentile, the median, and the 75th percentile of
marketable retail trade size. The median marketable retail trade size is around $8,000, and the
interquartile range is mostly between $2,000 and $25,000. Panel B reports the distribution of
subpenny prices. We separate all trades into 12 groups or bins. We separate out trades that take
place at a round penny or half penny; the other bins are each 0.1 cent wide. We pool the sample
across days and stocks, and we report the number of shares reported in the different subpenny
buckets. Not surprisingly, most of the share volume occurs at round and half-pennies, with average
stock-day share volumes of around 27,000 and 7,000, respectively. The next most prevalent
occurrence, averaging around 3,000 shares per day per stock, is a subpenny price within 0.1 cent
of a round penny. Other subpenny bins are less prevalent, with most averaging around 1,000 shares
per stock per day.
𝑚𝑟𝑏𝑣𝑜𝑙(𝑖, 𝑡) − 𝑚𝑟𝑠𝑣𝑜𝑙(𝑖, 𝑡)
𝑚𝑟𝑜𝑖𝑏𝑣𝑜𝑙(𝑖, 𝑡) = , (1)
𝑚𝑟𝑏𝑣𝑜𝑙(𝑖, 𝑡) + 𝑚𝑟𝑠𝑣𝑜𝑙(𝑖, 𝑡)
𝑚𝑟𝑏𝑡𝑟𝑑(𝑖, 𝑡) − 𝑚𝑟𝑠𝑡𝑟𝑑(𝑖, 𝑡)
𝑚𝑟𝑜𝑖𝑏𝑡𝑟𝑑(𝑖, 𝑡) = , (2)
𝑚𝑟𝑏𝑡𝑟𝑑(𝑖, 𝑡) + 𝑚𝑟𝑠𝑡𝑟𝑑(𝑖, 𝑡)
𝑜𝑑𝑑𝑚𝑟𝑏𝑡𝑟𝑑(𝑖, 𝑡) − 𝑜𝑑𝑑𝑚𝑟𝑠𝑡𝑟𝑑(𝑖, 𝑡)
𝑜𝑑𝑑𝑚𝑟𝑜𝑖𝑏𝑡𝑟𝑑(𝑖, 𝑡) = . (4)
𝑜𝑑𝑑𝑚𝑟𝑏𝑡𝑟𝑑(𝑖, 𝑡) + 𝑜𝑑𝑑𝑚𝑟𝑠𝑡𝑟𝑑(𝑖, 𝑡)
The first two measures are calculated using marketable retail round lot executions between January
2010 and December 2015 and by aggregating round lot and odd lot executions thereafter, while
the last two measures are calculated using only marketable retail odd lots, and thus these latter
measures begin in December 2013 instead of December 2010.
Summary statistics on the marketable retail order imbalance measures are reported at the
bottom of Table I. Across all stocks and all days, the mean order imbalance for share volume,
mroibvol, is -0.038, with a standard deviation of 0.464, and the mean order imbalance for trade,
mroibtrd, is -0.032, with a standard deviation of 0.437. The correlation between mroibtrd and
mroibvol is around 85%. Our later discussions mostly focus on mroibvol, but the results using
these two measures are quite similar given the high correlation between the two. Overall, the order
imbalance measured in shares is close to zero on average, but with sells slightly more prevalent
than buys, which is consistent with findings in Kaniel, Saar, and Titman (2008). More importantly,
the sizable standard deviation measures show that there is substantial cross-sectional variation in
the activity levels and trading direction of retail investors. The odd lot order imbalance measures
exhibit similar patterns.
In Figure 2, we plot the time-series of the cross-sectional mean, median, and 25th and 75th
percentiles of the marketable retail order imbalance measures over the six-year sample period.
Across all four order imbalance measures, the means and medians are all close to zero, while the
25th percentiles are mostly around -0.3, and the 75th percentiles are mostly around 0.2. There are
no obvious time trends or structural breaks in the time-series observations.
10
Our main data source is TAQ, which does not provide direct information on the direction
of the trade or the identity of the traders. We validate our marketable retail order imbalance
algorithm through a small sample of proprietary NASDAQ data. 6 The same dataset is used in
Menkveld, Yueshen, and Zhu (2017), who provide more details about the data. The NASDAQ
sample covers all intraday transactions on its TRF for 117 stocks for the month of October 2010.
The 117 stocks are chosen from different size groups, but they are generally larger than a typical
firm in TAQ.7
For each trade, the NASDAQ TRF data provide a trade direction indicator: “buy,” “sell,”
or “cross.” Our algorithm identifies all subpenny trades with subpenny prices between 0.61 and
0.99 cents inclusive as “buy” trades. We separate all subpenny “true buy” trades (as indicated in
the NASDAQ TRF data) with price below $100 into two categories: “identified buy” and
“identified sell (false identification).” We falsely identify 1.37% of all subpenny “true buy” trades
as “sell.” Similarly, our algorithm identifies all subpenny trades with subpenny prices between
0.01 and 0.39 basis points as “sell” trades. In this case, we falsely identify 2.12% of all subpenny
“true sell” trades with price below $100 as “buy.” If we put identified marketable retail “buys” and
“sells” together, for stocks with a share price below $100, our subpenny approach matches the
NASDAQ TRF’s correct buy/sell sign 98.2% of the time, while the standard Lee and Ready (1991)
trade-signing algorithm gets the trade sign right 96.7% of the time. Overall, we find our algorithm
is quite accurate for trade direction identification.
6
We thank NASDAQ for generously providing the data.
7
The smallest market cap of the 117 NASDAQ firms is 257 million dollars, while our sample’s 40 percentile market
cap is merely 243 million dollars.
11
Our main measure in this article is order imbalance. The correlation between our order
imbalance measure and the one calculated using the “DarkRetail” order imbalance for the 117
stocks is 0.70. This correlation is less than one for two main reasons: First, our order imbalance
measure includes trades printed on the competing NYSE TRF, while the NASDAQ TRF dataset
does not; second, our order imbalance measure includes some subpenny trades from the
“DarkNMid” and “DarkMid” venues, in addition to those in “DarkRetail.” Finally, some
marketable retail market orders do not receive price improvement or receive a full half-cent of
price improvement. We do not sign these trades or include them in our marketable retail sample,
because we cannot be sure that we have the correct trade direction. Nevertheless, the high
correlation between our marketable retail order imbalance measure and the actual NASDAQ
“DarkRetail” venue data strongly suggests that our order imbalance measures closely reflect the
true marketable buy and sell activities of retail investors.8
In the data section, we measure order imbalances at the daily level to minimize the amount
of aggregation. For our main empirical results, we focus on weekly horizons to reduce the impact
8
Kelley and Tetlock (2013) compute retail order imbalance measures using data from one large wholesaler. As part
of a conference discussion of our paper, Kelley computed the retail order imbalance measure for 2007 using our
algorithm and found that the correlations between our measure and their measure ranged between 0.345 and 0.507
when defining marketable retail flow using different subpenny ranges. For instance, 0.345 is the correlation of our
measure and their measure using the number of shares for subpenny prices in the (0, 0.4) and (0.6, 1) cent intervals,
while 0.507 is the correlation between our measure and their measure using the number of trades for subpenny prices
at 0.99 cents and 0.01 cents. These correlations should be less than one because their flow comes from only one
wholesaler, while our measure comes from TRF, which covers nearly all retail order executions. We are grateful to
Eric Kelley for computing and sharing these calculations with us.
12
We start by investigating the properties of the order imbalance measures in Section II.A.
In Section II.B, we examine whether past marketable retail order imbalance measures can predict
future stock returns using Fama-MacBeth regressions and long-short portfolios. In Section II.C,
we compare alternative hypotheses for the predictive power of marketable retail order imbalances
for future stock returns. In Section II.D, we explore the nature of the information contained in
marketable retail flow by linking it to Thomson Reuters News Analytics data.
We start our empirical investigation by examining what drives the trading of retail
investors. Specifically, we examine how retail investors’ marketable order flow is related to past
order flow and past returns. To allow maximal time-series flexibility and focus on cross-sectional
patterns, we adopt the Fama and MacBeth (1973) two-stage estimation. At the first stage, for each
day, we estimate the following predictive regression:
where we use various horizons of past weekly returns, 𝑟𝑒𝑡(𝑖, 𝑤 − 1) and various control variables
from the past to explain the order imbalance measure, 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤), for firm i during week w. The
first-stage estimation generates a daily overlapping time-series of weekly
coefficients, {𝑏0(𝑤), 𝑏1(𝑤)′ , 𝑏2(𝑤)′ }. At the second stage, we conduct statistical inference using
the time-series of the coefficients. Because we use overlapping daily frequency data for weekly
13
To explain the order imbalance over week w, from day 1 to day 5, we first include its own
lag, the past week order imbalance measure from day -4 to day 0, or mroib(w-1). We also include
past returns over three different horizons: the previous week (ret(w-1)), the previous month (ret(m-
1)), and the previous six months (ret(m-7,m-2)). For control variables, we use log market cap, log
book-to-market ratio, turnover (share volume over shares outstanding), and daily return volatility,
all computed from the previous month’s data.
The results are presented in Table II, with regressions I and II explaining the order
imbalance measured in shares, and regressions III and IV explaining the order imbalance measured
using the number of trades. In the first regression, the order imbalance using share volume,
mroibvol, has a positive correlation with its own lag, with a highly significant coefficient of 0.22,
indicating that directional marketable retail trading activity is somewhat persistent over successive
weeks, as suggested in Chordia and Subrahmanyam (2004). The coefficients for the past week,
past one month, and past six-month returns are -0.9481, -0.2778, and -0.0586, respectively. All
three coefficients are negative and highly significant, which shows that marketable retail order
flows are contrarian for horizons ranging between one week and six months. The control variables
indicate that investors tend to buy more aggressively in larger firms, growth firms, and firms with
higher turnover and higher volatility. All coefficients are highly significant. The average adjusted
R2 from the first stage cross sectional estimation is about 6%.
We use different return and order imbalance measures for regressions II, III, and IV. At the
weekly horizon, the results are similar across methods of computing returns and order imbalances.
Henceforth, we focus our discussion on bid-ask midpoint returns, which do not have bid-ask
bounce and thus exhibit a smaller degree of time-series predictability than returns based on
transaction prices. We also include CRSP returns in the results for the sake of completeness and
robustness.
The negative coefficients on past returns match some of the findings in the literature. For
example, marketable retail order flows are found to be contrarian in Kaniel, Saar, and Titman
9
The optimal lag number is chosen using BIC.
14
Appendix Figure A2 Panel B plots the correlation of daily order imbalance with past
returns for the previous one to 80 trading days. Similar to Kelley and Tetlock (2013), the
correlation between the current marketable retail order imbalance and the previous-day return is
positive, indicating a momentum trading pattern on average. However, at lags between two days
and 30 days, our average correlation is slightly negative. Our results are thus consistent with the
findings of Kelley and Tetlock (2013) at short horizons and with those of other researchers at
longer horizons. 10,11
Our results in Table II reveal two important drivers affecting weekly order imbalance. The
first is its own lag, which indicates that the marketable retail order imbalance measures are
persistent. The second are past returns, and we show a mixed result of both contrarian and
momentum patterns, with the contrarian pattern prevailing at weekly horizons.
B. Predicting Future Stock Returns with Marketable Retail Order Imbalance Measures
B.1. Methodology and Overall Predictive Power
Can marketable retail investors’ activity provide useful information for future stock
returns? In this section, we examine the predictive power of our order imbalance measures using
Fama-MacBeth regressions as follows:
10
Lee et al. (2004) also find a mixed pattern of contrarian and momentum, using the overall market order imbalance.
They find evidence that, after up-market moves, overall trades tend to follow a momentum pattern, while overall trades
tend to be contrarian after down-market moves. We provide similar results using daily retail order flows in Internet
Appendix Table AI Panel A. When we use weekly retail order flows, the patterns both become contrarian, as shown
in Internet Appendix Table AI Panel B.
11 In addition, we examine how firm-level order imbalance measures are related to firm fundamentals, as in Chordia,
Huh, and Subrahmanyam (2007). The results shown in Internet Appendix Table AI Panel E indicate that retail order
imbalances are positively related to firm size, number of analysts, analyst dispersion, and leverage and negatively
related to past return, firm age, and book-to-market ratio.
15
where we use the marketable retail order imbalance measure from the previous week,
𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1), and various control variables to predict the next week’s stock return, 𝑟𝑒𝑡(𝑖, 𝑤),
for firm i during week w. As in the previous section, because we use overlapping daily frequency
data for weekly order imbalances and return measures, the standard errors of the time-series are
adjusted using Newey-West (1987) with five lags. If past marketable retail order imbalances
predict the cross-section of future returns in the same direction, we expect coefficient c1 to be
significantly positive. For example, if retail buys dominate retail sells for a particular stock during
a particular week, a positive c1 means that that stock’s future return tends to be above the cross-
sectional average. There are several possible explanations, none of which are mutually exclusive.
There could be persistence in marketable retail order imbalances, marketable retail orders could
be compensated for providing liquidity, or retail traders may have valuable information that is
incorporated into stock prices at some point after they trade. We examine these hypotheses in
Section C. If coefficient c1 turns out to be significantly negative, again there are multiple possible
explanations. These retail investors might be making systematic trading mistakes, or retail
investors might be mainly “liquidity” or “noise” traders who end up trading at temporarily
disadvantageous prices because rational but risk-averse market makers require compensation for
trading with them. Either way, a negative c1 would constitute a drag on the overall returns of these
retail investors. Finally, if c1 is insignificantly different from zero, we cannot reject the null that
our measure of marketable retail order flow is uninformative on average about the cross-section of
future stock returns.
We again include past returns as control variables, using three different horizons: the
previous week, the previous month, and the previous six months (from month m-7 to month m-2).
In addition, we include log market cap, log book-to-market ratio, turnover, and daily return
volatility, all from the previous month. We report the estimation results in Table III. In regression
I, we use the order imbalance based on share volume, mroibvol, to predict the next week’s return
based on bid-ask midpoints. The coefficient on mroibvol is 0.0009, with a t-statistic of 15.60. The
positive and significant coefficient shows that, if retail investors buy more than they sell in a given
week, the return on that stock in the next week is significantly higher. In terms of magnitude, we
16
For the control variables, we observe negative coefficients on the previous week’s return,
which indicates weekly return reversals, and positive coefficients on the other longer-horizon
returns, which indicates momentum. Size, book-to-market, turnover, and volatility all carry the
expected signs, and most are not statistically significant. This result also confirms that the
predictability we find is not simply a manifestation of some other size, book-to-market, turnover,
or volatility anomaly. The average adjusted R2’s from the first stage cross-sectional estimation are
mostly around 3.85%.
Our sample includes on average more than 3,000 firms each day. Is the predictive power
of marketable retail order imbalances restricted to a particular type of firm? Do informed retail
investors have preferences for particular types of firms? We investigate these questions by
analyzing various firm subgroups in this section. We first sort all firms into three groups based on
a firm or stock characteristic observed at the end of the previous month. Then, we estimate equation
(6) within each characteristic group. That is, we allow all coefficients in equation (6) to be different
within each group, which allows substantial flexibility in the possible predictive relationship across
these different groups.
To save space, we include only the results on weekly returns that are computed using the
end-of-day bid-ask average price. We first sort all stocks into three different size groups based on
market capitalization: small, medium, and large. The results are reported in Panel A of Table IV.
In the left panel, we report coefficients on mroibvol, the order imbalance computed from share
17
In Panel B of Table IV, we sort all firms into three groups based on the previous month-
end share price. In the left panel, moving from the lowest share-price firms to the highest, the
coefficient on mroibvol decreases from 0.0014 to 0.0002, and the t-statistics go from 13.34 to 3.23.
In terms of magnitude, the interquartile weekly return difference is 20.5 basis points (10.66% per
year) for the lowest-price firms and only 2.0 basis points for the firms with the highest share price
(1.04% per year). The results are similar for specifications using mroibtrd, reported in the right
panel, with slightly lower coefficients and t-statistics. The pattern is clear: the predictive power of
marketable retail order imbalances for future returns is stronger for low-price firms.
Next, we sort all firms based on previous-month turnover, which may be a proxy for
liquidity. In the left panel, moving from the tercile of low trading activity to the firms with more
turnover, the coefficient on mroibvol decreases from 0.0011 to 0.0007, and the t-statistic decreases
from 15.60 to 4.98. In terms of magnitude, the interquartile weekly return difference is 20.5 basis
points (10.66% per year) for the firms with the lowest turnover and 6.5 basis points for the firms
with the highest turnover (3.38% per year). For specifications based on mroibtrd in the right panel,
the results are similar, with slightly lower coefficients and t-statistics. Overall, marketable retail
order imbalances better predict returns for firms with lower trading activity.
In this section, we find that the predictive power of the marketable retail order imbalance
is significant and positive for all but one subgroup, which shows that the predictive power is not
driven by special subgroups. However, a clear cross-sectional pattern for the predictive power is
observed. The predictive power of the marketable retail order imbalance is much stronger for small
firms and firms with low share-price and low liquidity.
18
That is, we use one week of order imbalance measures to predict k-week ahead returns, ret(i,w+k),
with k=1 to 12. To observe the decay of the predictive power of marketable retail order imbalance,
the return to be predicted is a weekly return over a one-week period, rather than a cumulative
return over n weeks, which is an average over all weeks involved. If marketable retail order
imbalances have only short-lived predictive power for future returns, we might observe the
coefficient c1 decrease to zero within a couple of weeks. Alternatively, if the marketable retail
order imbalance has longer predictive power, the coefficient c1 should remain statistically
significant for a longer period. In our empirical estimation, we choose k ranging from two to 12
weeks.
We report the results in Table V, with results based on bid-ask average returns in Panel A,
and those based on closing transaction prices in Panel B. In Panel A, when we extend the window
from two to 12 weeks, the coefficient on mroibvol monotonically decreases from 0.00055 to
0.00007, and the coefficient on mroibtrd gradually decreases from 0.00048 to 0.00006. The
coefficients are statistically significant up to six or eight weeks ahead. The results in Panel B are
similar. There is no evidence of price reversals at any horizon. Thus, our marketable retail order
imbalances potentially capture either longer-lived information or slow information diffusion.
One might wonder whether we can use marketable retail order imbalances as a signal to
form a profitable trading strategy. As discussed earlier, both mroibvol and mroibtrd are publicly
available information. In this section, we form quintile portfolios based on the previous week’s
average order imbalance and then hold the quintile portfolios for up to 12 weeks. If retail investors
on average can select the right stocks to buy and sell, then firms with higher or positive marketable
19
Table VI reports long-short portfolio returns, where we buy the stocks in the highest order
imbalance quintile and short the stocks in the lowest order imbalance quintile each day using the
previous 5-day marketable retail order flow measures, and hold them for the next few weeks.
Portfolio returns are value-weighted using the previous month-end market cap. Because the
holding period can be as long as 12 weeks, we report both the raw and risk-adjusted returns using
the Fama-French three-factor model. Given the usage of overlapping data, we adjust the standard
errors of the portfolio return time-series using Hansen and Hodrick (1980) standard errors with the
corresponding number of lags.12
In Panel A, the long-short strategy is based on the previous week’s mroibvol, and we report
bid-ask average returns. Over a one-week horizon, the long-short portfolio return is 0.092%, or
4.78% per year annualized. The t-statistic is 2.66. Risk adjustment using the Fama-French three-
factor model does not make much difference: the weekly Fama-French alpha for the long-short
portfolio is 0.084%, with a t-statistic of 2.43. When we increase the holding horizon to 12 weeks,
the mean return becomes 0.588%, with a t-statistic of 2.09. The general pattern is that holding-
period returns (and alphas) continue to grow at a decreasing rate over time. We observe no
evidence of a reversal in returns. In terms of statistical significance, the t-statistics are significant
or marginally significant up to the 12-week horizon. These results are slightly weaker than those
of the Fama-MacBeth regressions, mainly because, in this section, we value-weight the portfolio
returns across firms, while the Fama-MacBeth approach implicitly weights each stock equally.
When we restrict portfolio formation to one of the three market cap groups, the one-week
return is 0.403% (or 20.96% per year) with a t-statistic of 9.16 for the smallest firms, while the
one-week return is 0.067% (or 3.48% per year) with a t-statistic of 1.78 for the largest firms. When
the holding horizon becomes longer, the return on the long-short strategy is still significant and
12
For example, for a one-week holding period portfolio, we use Hansen and Hodrick (1980) with 5 lags. For a two-
week holding period, we use Hansen and Hodrick (1980) with 10 lags, and so on.
20
To make sure that the statistical significance in return differences is not driven by particular
sample periods, we provide a time-series plot of the return differences between quintiles 1 and 5
in Figure 3, where the portfolios are sorted on mroibvol and the holding period is one week. Over
our six-year sample period, we observe both time-variation in the return differences and positive
and negative spikes. However, most data points are positive, and the positive returns are not driven
by particular sample subperiods.
C. Alternative Hypotheses for Marketable Retail Order Imbalance Predictive Power for
Future Returns
The predictive power of marketable retail order imbalances for future stock returns is
consistent with three hypotheses. First, as in Chordia and Subrahmanyam (2004), order flows are
persistent, and, as the retail buying/selling pressure is persistent, this could lead directly to the
predictability of future returns. Second, as in Kaniel, Saar, and Titman (2008), these retail traders
are contrarian at weekly horizons, and since their contrarian trading provides liquidity to the
market, their trades might positively predict future returns. Third, as in Kelley and Tetlock (2013),
retail investors, especially the aggressive investors using market orders, may have valuable
information about the firm, and thus their trading could correctly predict the direction of future
returns. The above three hypotheses are not exclusive. In Section II.C.1, we conduct a simple
decomposition to separate alternative hypotheses. In Section II.C.2, we provide more evidence
regarding the liquidity provision hypothesis.
13
We also conduct a rough calculation that includes transaction costs. Frazzini, Israel and Moskowitz (2018) state
that a reasonable estimate of the one-way transaction cost on value-weighted US stocks is about 12 basis points for
the period January 2006 to June 2016. To be conservative, we assume that for each rebalance, we change 100% of the
positions. That is, each rebalance we incur a 2*12 bps = 24 bps rebalance cost. For instance, for a weekly rebalance
or 1 week holding period, each year’s transaction cost would be 52 rebalances * 2 * 12bps = 1248 bps. After this
drastic transaction cost adjustment, the mean returns and alphas remain positive and significant for the small firms
over all holding horizons. For the medium and big firms, the mean returns and alphas stay positive for longer holding
periods, but they are mostly insignificant.
21
To distinguish among these alternative hypotheses for the predictive relation between
previous period marketable retail order imbalance and next period stock return as in equation (6),
we adopt a two-step decomposition procedure. In the first step, we decompose the independent
variable in equation (6), the previous week’s marketable retail order imbalance mroib(w-1), into
three components, with the following cross-sectional regressions for each week w-1:
̂ (𝑤 − 1), 𝑑1
For each week w-1, we obtain the time-series of coefficients, {𝑑0 ̂ (𝑤 − 1), 𝑑2
̂ (𝑤 −
1)′}. Next, we calculate the three components of the mroib(i,w-1) as follows:
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 = 𝑑1
𝑚𝑟𝑜𝑖𝑏 ̂ (𝑤 − 1)𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 2),
𝑖,𝑤−1
̂ 𝑖,𝑤−1
𝑚𝑟𝑜𝑖𝑏 𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ (𝑤 − 1)′ 𝑟𝑒𝑡(𝑖, 𝑤 − 2),
= 𝑑2 (9)
̂ 𝑖,𝑤−1
𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
= 𝑢4 ̂ (𝑤 − 1).
̂ (𝑖, 𝑤 − 1) + 𝑑0
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 + 𝑚𝑟𝑜𝑖𝑏
𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) = 𝑚𝑟𝑜𝑖𝑏 ̂ 𝑖,𝑤−1
𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ 𝑖,𝑤−1
+ 𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
. (10)
𝑖,𝑤−1
That is, we denote the part of 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) related to the past order imbalance as the
“persistence,” which is related to the price pressure hypothesis. The part related to past returns
over different horizons is denoted as “contrarian,” which relates to the liquidity provision
hypothesis. After we take out predictability due to “persistence” and “contrarian” trading, we
denote the residual part as “other,” which potentially contains other relevant information about
future returns. Note that this empirical decomposition is an identity. When we add up these three
components, by definition we obtain the explanatory variable 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) in our basic
predictive regression in equation (6).
At the second stage, we replace the 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) in equation (6) by its three
components, and we estimate the following regression using the Fama-MaBeth methodology:
22
Since we decompose the original order imbalance measure 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) into three parts,
related to order flow persistence, a contrarian trading pattern, and the residual, the coefficient
estimates in equation (11) reveal how each component of 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) helps to predict future
stock returns. The advantage of the two stage decomposition approach is that it includes
components of 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) from alternative hypotheses in a unified and internally consistent
empirical framework. The caveat of this approach is that without a structural model, interpreting
the results may be more difficult. In particular, we must make empirical assumptions on proxies
for the persistence and contrarian components. These empirical assumptions seem to us to be
reasonable, but we still need to be cautious that the interpretation of the results depends on the
validity of our empirical assumptions.
We report the decomposition results in Table VII. Panel A presents the first-stage
estimation as in equation (8), which is quite similar to those reported in Table II. Take the first
regression as an example. The order imbalance measure, mroibvol, has a highly significant and
positive coefficient on its own lag at 0.22, which indicates order persistence. In terms of past
returns, the coefficients for the past week, past month, and past six-month returns are -0.9286, -
0.2029 and -0.0267, respectively, all implying contrarian trading patterns.
After we decompose the previous week’s order imbalance into “persistence,” “contrarian,”
and “other,” we include them together to predict future stock returns, as in equation (11). In the
first regression, we use the past week’s mroibvol to predict future bid-ask return. The coefficient
estimate on mroib(persistence) is 0.0027, with a t-statistic of 8.75, which implies that price
pressure significantly and positively contributes to the predictive power of the marketable retail
flow. The coefficient estimate on mroib(contrarian) is -0.0044, and it is insignificantly different
from zero, implying that we cannot reject the null hypothesis that the contrarian component does
not contribute significantly to the predictive power of marketable retail order imbalances. Finally,
23
14
We also try to include the past order imbalance as a control variable for the second stage estimation. We cannot
directly include mroib(w-1) or mroib(w-2), because it will create collinearity issues. Therefore, here we include either
mroib(w-3) or mroib(m-1) to control for past mroib. These results are presented in Appendix Table AII. We also
consider using the contemporaneous return in equation (8) rather than lag returns, and these results are reported in
Appendix Table AIII. No matter which specification we use, the main results stay quite similar to those in Table VII.
15
All three components of the marketable order imbalance measure in equation (11) are measured at week w-1 so that
we are using this identity to decompose the predictive relation between mroib(w-1) and ret(w) in equation (6). If we
are willing to depart from this predictive decomposition framework, we can examine other relationships. For example,
we might want to examine the contemporaneous relation between mroib(w) and ret(w), and use the three fitted values
from week w rather than from week w – 1 . In this case, the first stage estimation for week w becomes,
𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤) = 𝑑0(𝑤) + 𝑑1(𝑤)𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) + 𝑑2(𝑤)′ 𝑟𝑒𝑡(𝑖, 𝑤 − 1) + 𝑢4(𝑖, 𝑤). (8’)
These are period-by-period cross-sectional regressions, which means that when we estimate coefficients
̂ (𝑤), 𝑑1
{𝑑0 ̂ (𝑤), 𝑑2 ̂ (𝑤)′} in equation (8’), they are estimated using information from both week w and week w – 1.
From this first stage estimation, we can define the relevant persistence estimate as:
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 = 𝑑1
𝑚𝑟𝑜𝑖𝑏 ̂ (𝑤)𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1),
𝑖,𝑤
̂ 𝑖,𝑤
𝑚𝑟𝑜𝑖𝑏 𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ (𝑤)′ 𝑟𝑒𝑡(𝑖, 𝑤 − 1),
= 𝑑2 (9’)
̂ 𝑖,𝑤
𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
= 𝑢4 ̂ (𝑤).
̂ (𝑖, 𝑤) + 𝑑0
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 + 𝑚𝑟𝑜𝑖𝑏
𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤) = 𝑚𝑟𝑜𝑖𝑏 ̂ 𝑖,𝑤𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ 𝑖,𝑤
+ 𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
. (10’)
𝑖,𝑤
̂ ̂ ̂
Notice that coefficients {𝑑0(𝑤), 𝑑1(𝑤), 𝑑2(𝑤)′} are estimated using information from week w, and thus
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 , 𝑚𝑟𝑜𝑖𝑏
𝑚𝑟𝑜𝑖𝑏 ̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 , and 𝑚𝑟𝑜𝑖𝑏
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 all use information from week w. Then the second stage
𝑖,𝑤 𝑖,𝑤 𝑖,𝑤
estimation for the contemporaneous relation between return and marketable retail order imbalance becomes
𝑟𝑒𝑡(𝑖, 𝑤) = 𝑒0(𝑤) + 𝑒1(𝑤)𝑚𝑟𝑜𝑖𝑏 ̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 + 𝑒2(𝑤)𝑚𝑟𝑜𝑖𝑏 ̂ 𝑖,𝑤𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ 𝑖,𝑤
+ 𝑒3(𝑤)𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
+
𝑖,𝑤
𝑒4(𝑤)′ 𝑐𝑜𝑛𝑡𝑟𝑜𝑙𝑠(𝑖, 𝑤 − 1) + 𝑢5(𝑖, 𝑤). (11’)
In comparison with equation (11), equation (11’) gives us something closer to an estimate of the contemporaneous
relation between components of 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤) and 𝑟𝑒𝑡(𝑖, 𝑤), rather than a predictive relation between 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 −
1) and 𝑟𝑒𝑡(𝑖, 𝑤). We present the estimation results in Appendix Table AIV. Appendix Table AIV Panel A reports the
estimation results of equation (8’), and the results are quite similar to those in Table II and Panel A of Table VII.
Appendix Table AIV Panel B shows the estimation results for equation (11’). Appendix Table AIV Panel B shows
the estimation results for equation (11’). For instance, in regression I, the coefficient on mroib(w, persistence) is
0.0045 with a t-statistic of 14.26, while the coefficient on mroib(w-1, persistence) in Table VII Panel B is 0.0027 with
a t-statistic of 8.75. That is, the coefficient on past order persistence becomes larger and more significant in equation
24
The liquidity provision hypothesis receives substantial attention in the existing literature,
so here we take a closer look at this hypothesis. Kaniel, Saar, and Titman (2008) argue that retail
investors’ contrarian trading provides liquidity to the market, and this leads to the positive
predictive power of past marketable retail order imbalance for future stock returns. Therefore, in
equation (8), we use the part of the marketable retail order imbalance related to past returns,
mroib(contrarian), as a proxy for the “liquidity provision” hypothesis. Now our results in Table
VII show that the contrarian component of marketable retail order flow cannot significantly predict
future stock returns. Does this finding completely rule out the “liquidity provision” hypothesis for
the predictive power of marketable retail order flow? We are afraid not. As mentioned in our earlier
discussion of our approach’s caveat: we can only rule out the liquidity provision hypothesis, under
the assumption that the contrarian trading pattern captured by mroib(contrarian) is a perfect proxy
for the liquidity provision hypothesis. This seems to us to be a reasonable assumption, but as far
as we can tell it cannot be directly confirmed by any data that we observe.17 In this subsection, we
provide more results regarding the liquidity provision hypothesis through different approaches
beyond the predictive regression.
(11’) than in equation (11), indicating that contemporaneous price pressure (proxied by lag order imbalance) is more
important than lagged price pressure in equation (11). The coefficient on mroib(w, contrarian) stays insignificant. The
coefficient on mroib(w, other) is 0.0006 with a t-statistic of 5.07, while the coefficient on mroib(w-1, other) is 0.0008
with a t-statistic of 14.47 in Table VII Panel B. The residual component remains significant but becomes slightly
smaller in this case.
16
To be more specific, the retail order imbalance has a low autocorrelation coefficient between 10-20%, but the
positive autocorrelation lasts for a long period. Here the persistence refers to the long horizon rather than the
magnitude.
17
For example, recent studies, such as Arif, Ben-Rephael and Lee (2016), and Chakrabarty, Moulton, and Trzcinka
(2017), show that directional trading by active funds is highly persistent and price destabilizing. If the retail trades
provide liquidity to these active funds, then liquidity provision can also go through the persistence channel, rather than
the contrarian channel.
25
The main results of Kaniel, Saar and Titman (2008) are reported in their Table III, which
contains three main findings. First, the stocks the retail investors sell during the portfolio
construction week (week 0), the intense selling group, experience significantly positive excess
return, before week 0; while the stocks the retail investors buy during week 0, the intense buying
group, experience negative excess returns. This is a typical contrarian trading pattern of selling
winners and buying losers. In Panel A of Table VIII, the first row contains the firms intensely sold
by the retail investors using marketable orders, and the mean excess return in the 20 days prior to
the selling week is 0.67%. The bottom row contains the firms intensely bought by retail investors
using marketable orders, and the mean excess return on these stocks in the 20 days prior to the
selling week is -1.29%. Both numbers are highly significant, and confirm Kaniel, Saar and
Titman’s first finding.
The second finding of Kaniel, Saar and Titman (2008) is that after retail investors buy or
sell, the stocks the retail investors sell during week 0, the intense selling group, experience negative
excess returns, while the stocks the retail investors buy during week 0, the intense buying group,
experience positive excess returns. This shows that retail trading can predict returns in the correct
direction. In Panel A of Table VIII, we find that firms intensely sold by retail investors using
marketable orders (in the first row) experience a mean excess return in the 20 days after the selling
18
We thank an anonymous referee for this suggestion.
26
Finally, for the contemporaneous relation over week 0, Kaniel, Saar, and Titman (2008)
find that the contemporaneous excess return is significantly positive for stocks retail investors sell,
and negative for stocks they buy. Since the return signs are opposite of the retail trading direction,
they interpret this finding in favor of the liquidity provision hypothesis. From the column of k=0
in Table VIII Panel A, however, we find that for firms intensely sold by retail investors using
marketable orders, the contemporaneous return is significantly negative at -0.24% with a t-statistic
of -5.30. For the intensely bought firms in the bottom row, the contemporaneous return is
significantly positive at 0.11% with a t-statistic of 2.69. Our findings show consistent, rather than
opposite, signs between contemporaneous marketable retail trading and return direction, which
does not line up with the liquidity provision hypothesis proposed in Kaniel, Saar, and Titman
(2008).
What might cause the differences in our results? It might come from differences in the
retail order imbalance variable, the sample period or coverage. Our main variable comes from the
marketable retail order flows, while Kaniel, Saar and Titman (2008) use retail order imbalance
from both marketable and non-marketable order flows. Between the marketable and non-
marketable orders, it is likely that the marketable orders are more aggressive. In terms of the
sample period, the Kaniel, Saar and Titman (2008) sample is January 2000 through December
2003, and our sample is January 2010 through December 2015, which are about ten years apart.
Coverage wise, Kaniel, Saar and Titman (2008)’s sample is from NYSE’s Consolidated Equity
Audit Data (CAUD) which contains only retail trades that are executed on that exchange. During
the Kaniel, Saar and Titman (2008) sample period, only a small number of brokerages sent their
retail order flow to the NYSE. As a result, the NYSE’s market share of overall retail order activity
was (and has remained) quite small. In comparison, our sample is from TAQ which contains all
off-exchange and nearly all retail marketable orders. To summarize, the liquidity provision
hypothesis receives at most mixed support in our data sample.
27
D.1 Marketable Retail Order Imbalance and Future Returns across News Topics
We obtain news data from Thomson Reuters News Analytics (TRNA), which contains
prominent public news articles for a broad set of firms starting from 2003. TRNA provides key
information about each news item, such as the ticker, the time stamp of the news story, the news
topics the story belongs to, and sentiment scores for each article. News topics are grouped into five
categories: cross market, general news, economy, equities, and money/debt. Each category
contains several news subtopics, and we collect 58 such subtopics in our sample. The sentiment
score measures the probabilities of the article being positive, negative, or neutral, computed using
Thomson Reuters’ proprietary algorithm. We compute a net sentiment score as the difference
between the positive and negative sentiment score for each stock each day. The news data are
available from January 2010 to December 2014, which covers most of our main sample. We use
tickers to match the news data with our marketable retail order imbalance data, generating a
merged sample of 3,854,813 stock-day observations.
We first provide some simple statistics for the relation among news, returns, and
marketable retail order flow. To examine whether our measure of public news can predict future
stock returns, we estimate the following Fama-MacBeth regression:
Here, variable 𝑠𝑒𝑛𝑡(𝑖, 𝑤 − 1) is the average TRNA net sentiment score for firm i during week w-
1, calculated by averaging non-missing news sentiment for firm i within week w-1. The results are
reported in regression I and II of Table IX Panel A. In regression I, the coefficient of past week
28
To better understand how marketable retail order imbalances are related to public news, we
next estimate the contemporaneous relation between the two using the following Fama-MacBeth
specification:
We find that the current week’s marketable retail order imbalance is significantly and positively
related to the same-week public news sentiment in 10 out of the 58 subtopics. To save space, we
present the coefficient estimates for the 10 cases in Panel B of Table IX. These 10 subtopics
represent about 38% of total news days, and they mostly contain firm-level news. For instance, for
the subtopic “RESF” (results forecast) in the news type “equities”, the coefficient g1 is 0.0054,
with a significant t-statistic of 3.90, indicating that the marketable retail order imbalance has a
positive and significant contemporaneous relation with news related to forecasts of company
results. Out of the ten subtopics, four are from the category of “money and debt”, and three are
from the category of “equity”, with the highest two significant t-statistics for the subtopics “results
forecast” and “debt rating news.” Interestingly, the marketable retail order imbalances are never
statistically significantly correlated with the “economy” type of news. This finding implies that
marketable retail investors may have valuable information at the firm level rather than at the
market level. This is confirmed in some of our later results (Section III.A), where we find that
retail investors cannot reliably predict future market-wide returns.
In Appendix Table AVI, we also examine whether retail order imbalance can directly
predict our measure of public news. We find the coefficient signs to be mixed. In fact, there are
several cases where marketable retail order flows significantly predict future public news
sentiment with negative coefficients. This does not contradict our earlier results of a positive and
29
The above results show that marketable retail order imbalances are associated with some
types of contemporaneous public news, particularly firm-level news. In this subsection, we probe
deeper into the fraction of marketable retail order flows’ predictive power that is associated with
these public news releases, because it is also possible that marketable retail traders possess and
trade on non-public information that eventually makes its way into prices, but not via an
identifiable public news release.
Here, variable 𝑠𝑒𝑛𝑡(𝑖, 𝑤 − 1) is the average TRNA net sentiment score for firm i during week w-
1, which we use to capture information in contemporaneous public news releases. After we obtain
̂ (𝑤 − 1), ℎ1
the time-series of coefficients, {ℎ0 ̂ (𝑤 − 1), ℎ2
̂ (𝑤 − 1)′, ℎ3
̂ (𝑤 − 1)}, we define the
following terms:
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 = ℎ1
𝑚𝑟𝑜𝑖𝑏 ̂ (𝑤 − 1)𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 2),
𝑖,𝑤−1
̂ 𝑖,𝑤−1
𝑚𝑟𝑜𝑖𝑏 𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛 ̂ (𝑤 − 1)′ 𝑟𝑒𝑡(𝑖, 𝑤 − 2),
= ℎ2
(15)
̂ 𝑝𝑢𝑏𝑙𝑖𝑐𝑛𝑒𝑤𝑠
𝑚𝑟𝑜𝑖𝑏 ̂ (𝑤 − 1)𝑠𝑒𝑛𝑡(𝑖, 𝑤 − 1),
= ℎ3
𝑖,𝑤−1
̂ 𝑖,𝑤−1
𝑚𝑟𝑜𝑖𝑏 𝑜𝑡ℎ𝑒𝑟
= 𝑢8 ̂ (𝑤 − 1).
̂ (𝑖, 𝑤 − 1) + ℎ0
19
To give some context for why retail order flow does not show significant predictive power for future public
sentiment in our sample, it could be that the public news we observe is noisy; it could be that the information the retail
investors have does not warrant a specific news story; it could be that the public news is published further into the
future than the horizons we examine here; or another possibility is that retail investors incorporate other useful public
news into their trading, such as the SeekingAlpha posts studied by Farrell et al. (2020).
30
At the second stage, we estimate the following regression using the Fama-Macbeth
methodology, which is parallel to equation (6):
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 + 𝑗2(𝑤)𝑚𝑟𝑜𝑖𝑏
𝑟𝑒𝑡(𝑖, 𝑤) = 𝑗0(𝑤) + 𝑗1(𝑤)𝑚𝑟𝑜𝑖𝑏 ̂ 𝑖,𝑤−1
𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛
+
𝑖,𝑤−1
̂ 𝑝𝑢𝑏𝑙𝑖𝑐𝑛𝑒𝑤𝑠 + 𝑗4(𝑤)𝑚𝑟𝑜𝑖𝑏
𝑗3(𝑤)𝑚𝑟𝑜𝑖𝑏 ̂ 𝑖,𝑤−1
𝑜𝑡ℎ𝑒𝑟
+ 𝑗5(𝑤)′𝑐𝑜𝑛𝑡𝑟𝑜𝑙𝑠(𝑖, 𝑤 − 1) + 𝑢9(𝑖, 𝑤). (16)
𝑖,𝑤−1
Since we decompose the original order imbalance measure 𝑚𝑟𝑜𝑖𝑏(𝑖, 𝑤 − 1) into four parts,
related to persistence, contrarian trading pattern, public information, and the residual, the
coefficients in equation (16) reveal how each component helps to predict future stock returns.
Notice that for the first stage estimation, the public news component is derived from a
contemporaneous relation between the current news and current marketable retail order flow,
rather than past news. From the perspective of the empirical design, we can link the marketable
retail order imbalance with the past, contemporaneous, or future public news, but the
interpretations would be different. If we use future public news, the interpretation would be
whether and how marketable retail order flow “anticipates” future public news. If we use past
public news, the interpretation would be that previously “incorporated” public news can be a
component of the marketable retail order imbalance. When we use contemporaneous public news
sentiment, we interpret the related part of marketable retail order flow as contemporaneously
“processed” public news. Here we choose not to use future public news, because if we project
̂ 𝑝𝑢𝑏𝑙𝑖𝑐𝑛𝑒𝑤𝑠 would capture news from week w,
mroib(w-1) on sent(w) at the first stage, then 𝑚𝑟𝑜𝑖𝑏𝑖,𝑤
and would have a mechanical correlation with ret(w), which is the dependent variable in the second
stage estimation and the regression is no longer predictive. We also choose not to use past public
news, because we would like to maximize the explanatory power of public news for marketable
31
Table X Panel A provides results for the first-stage decomposition. The patterns of how
past marketable retail order imbalance and past returns affect the current order imbalance are very
similar to those in Table II. The coefficient on the contemporaneous public news sentiment ranges
between 0.0249 and 0.0305, all with t-statistics higher than 10. This clearly indicates that more
positive news is associated with more contemporaneous purchases by marketable retail investors.
The average adjusted R2 for the first stage estimation are mostly between 5.49% and 8.59%.
Panel B of Table X reports the results on the second-stage decomposition. From the top
̂ 𝑝𝑒𝑟𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑒 are positive and highly significant, and the coefficients
half, the coefficients on 𝑚𝑟𝑜𝑖𝑏𝑖,𝑤−1
̂ 𝑖,𝑤−1
on 𝑚𝑟𝑜𝑖𝑏 𝑐𝑜𝑛𝑡𝑟𝑎𝑟𝑖𝑎𝑛
are mostly insignificant, similar to the findings in Table VII. The coefficients
̂ 𝑝𝑢𝑏𝑙𝑖𝑐𝑛𝑒𝑤𝑠 , are also all insignificant,
on the public news components of order imbalance, 𝑚𝑟𝑜𝑖𝑏𝑖,𝑤−1
indicating that the contemporaneous public news component of marketable retail order imbalances
does not help to predict future returns significantly. In contrast, the “other” component of the
marketable retail order imbalance measure is always positive and significant in the regressions.
For instance, in the first regression, it has a coefficient of 0.0008 with a highly significant t-statistic
of 13.98. This result is consistent with the hypothesis that marketable retail investors trade on
information that is not incorporated into prices via the public news releases we measure. The
bottom half panel shows that when we move from the 25th percentile firm to the 75th firm, the
“other” component of the marketable retail order flow accounts for 0.07% to 0.10% of weekly
return differences, which is more than half of the return difference that the marketable retail order
imbalance can predict overall. These results suggest that either public news is noisy, the predictive
power of marketable retail investors’ order imbalance is not related to an identifiable public news
release, or the retail order imbalance is only related to public news releases in the more distant
future.
Returning to the interesting question raised at the beginning of this subsection, how can
retail investors using marketable orders, with far fewer resources than institutions, get the upper
hand in discovering non-public information? Here we offer two possible explanations.
32
Second, our data only contains marketable retail market orders, but not retail limit orders.
Retail limit orders could have opposite information, which would at least partially offset our
findings for marketable retail market orders. In addition, it is not clear that the counterparties to
marketable retail orders are necessarily “better informed” institutional investors. For example,
Chakrabarty, Moulton, and Trzcinka (2017) document the presence of uninformed “short-term”
institutional investors as a non-trivial part of the market.
33
If marketable retail order imbalances can predict future stock returns in the cross section,
retail investors using marketable orders may also be able to predict aggregate market moves. To
investigate this possibility, we aggregate marketable retail order imbalances across all firms to
predict aggregate stock market returns. We estimate the following equation:
where mkt(w+1,w+k) is the future k-week cumulative market return from week w+1 to week w+k,
and aggmroib(w) is the current aggregated marketable retail order imbalance measure for week w.
We compute aggmroib using either value-weighted or equal-weighted mroibvol or mroibtrd
measures. The results are shown in Table XI Panel A. They are the same regardless of the
weighting scheme or order imbalance measure: there is no evidence that marketable retail order
flows can reliably predict future market returns.
Our approach can also be used to identify the marketable retail order flow in exchange-
traded funds (ETFs). In Table XI Panel B, we examine marketable retail order flow in a large cross
section of ETFs over the same time period. In cross-sectional predictive regressions of the form in
equation (6), the coefficient is mostly around or below one basis point, which is much smaller than
the comparable coefficients shown in Table III, and the t-statistics are mostly insignificant. This
result suggests that marketable retail order flows cannot predict sector returns or overall equity
market returns. To separate sector-oriented information from broader market-wide information,
we select the six largest ETFs that focus on the overall U.S. equity market by tracking
comprehensive U.S. equity indexes: SPY, IVV, VTI, VOO, IWM, and IWB. The results are
reported in the last row of Panel B. Consistent with the market timing results in Panel A, we find
little evidence that marketable retail order flow can predict future returns on broad equity market
ETFs.
B. Subsample Analysis
34
To be more specific, we re-estimate the key results in Table III, for subsamples 2010-2012
and 2013-2015. For comparison, in Table III, we find that the marketable retail order flow can
predict future stock return, with a regression coefficient of 0.0009, a t-statistic of 15.60, and an
interquartile weekly return difference of 10.89 bps. In Table XI Panel C, for the subsample of
2010-2012, the coefficient on marketable retail order is 0.0010, with a t-statistic of 11.52, and an
interquartile weekly return difference of 12.13bps. For the subsample of 2013-2015, the coefficient
on retail order becomes 0.0009, with a t-statistic of 10.57, and an interquartile weekly return
difference of 9.74 bps. We also test whether the coefficients from the two subperiods are
significantly different from each other. For the mroib(w-1) coefficients in regression I and II, the
difference is 0.0001, with a t-statistic of 1.85; for the coefficients in regression III and IV, the
difference is 0.0001, with a t-statistic of 0.67. Therefore, we cannot reject the hypothesis that the
coefficients from these two subperiods are the same.
We draw two observations from this exercise. First, the predictive power of retail order
flow for the “cleaner” second subperiod is significant, which shows that our results are robust.
Second, we cannot reject the hypothesis that the relevant coefficients from these two subperiods
are the same. In addition, even during the earlier subperiod, we believe that fewer than 10% of all
subpenny trades are misclassified as retail. For these reasons, we choose to use the whole 2010-
2015 sample period to conduct our main analysis.
C. Market Conditions
Barrot, Kaniel, and Srear (2016) find that marketable retail trades contain more information
when markets are volatile, specifically when the VIX option-implied volatility index is high. Their
sample spans from 2002 to 2010, during which the VIX experiences dramatic changes. In contrast,
35
We re-estimate equation (6) for the high- and low-VIX subsamples. The results are
presented in Panel D of Table XI. Comparing the low- and high-VIX regimes, we find that the
coefficient on mroibvol is quite similar, yet the t-statistic is higher when VIX is low than when it
is high. This result might not be surprising, given that the volatility of all variables increases when
VIX is high. Overall, the predictive power in both high- and low-VIX regimes is positive and
significant.
D. Odd Lots
In this section, we investigate the behavior of odd lot marketable retail trades over the post-
December 2013 period when odd lot transactions are reported to the consolidated tape. Can odd
lot marketable retail order flow predict future firm-level returns? We estimate regression (6) using
odd lot marketable retail order imbalances and present the results in Table XI Panel E. Both
coefficients are positive but not statistically significant. In Appendix Table AVII, we find that
daily odd lot order imbalance measures can significantly predict returns for the next trading day
but not at longer horizons than that. We conclude that the odd lot marketable retail order imbalance
measure’s predictive power is much weaker than that of the overall marketable retail order
imbalance.
E. Order Sizes
As Figure 1 Panel A shows, a median market order submitted by a retail investor is around
$7,000. The median marketable retail trade is about 400 shares. The “stealth trading” literature
argues that medium-size orders are more likely to be informed and that large orders are usually
broken into smaller orders.
To determine if information content differs according to order size, we partition the orders
into large vs. small groups using 400 shares as the cutoff, and we estimate the predictive regression
for each group separately. The results are reported in Table XI Panel F. We find that both large
and small orders predict future stock returns, but the larger orders’ predictive power is stronger.
Our results suggest that more informed marketable retail investors may demand immediacy by
36
If marketable retail order flow is sufficiently informed, trading with these orders would be
unprofitable. This might raise the question of whether our results are consistent with the apparently
profitable business model of internalizers and wholesalers. Ultimately, as long as the information
content of marketable retail order flow is less than the bid-ask spread being charged, internalizers
and wholesalers on average can still earn positive revenues by trading with these orders. For
example, if a marketable retail buy and a marketable retail sell order arrive at the same time, they
offset each other, and a wholesaler earns the full bid-ask spread charged (the quoted spread less
the price improvement given). Ultimately, internalizers and wholesalers are only exposed to
adverse selection on marketable retail order imbalances. The summary statistics in Table I show
that there is a substantial amount of offsetting marketable retail order flow. The interquartile range
for the volume-based daily order imbalance measure is from -0.301 to 0.217, indicating that, even
at the ends of these ranges, more than two-thirds of the marketable retail order flow in such a stock
on a given day is offsetting buys and sells.
To get a better sense of the profitability of interacting with marketable retail order flow,
we compute standard microstructure information-content measures for the marketable retail trades
in our sample. Specifically, we calculate proportional effective spreads, one-minute price impacts,
and one-minute realized spreads for all marketable retail buys and sells during 2015. Realized
spreads are a standard proxy for trading revenue earned by a liquidity provider such as a
wholesaler. We apply standard data filters, eliminating all trades where effective spreads exceed
$1, and we calculate dollar-volume weighted averages across all stocks. We find that the mean
effective half-spread is 16 basis points. The one-minute price impact is four basis points, leaving
a realized half-spread of 12 basis points. In other words, interacting with our identified marketable
retail order flow appears to be profitable (at least before other costs) for the
wholesalers/internalizers because the bid-ask spreads are sufficiently large. The liquidity provider
(in this case, the wholesaler or internalizer) loses about four basis points (the price impact) of the
bid-ask spread to short-term price moves, but this leaves about 12 basis points (the realized spread)
of the bid-ask spread as average trading revenue to the liquidity provider. Note that the realized
37
We can also examine some of the segmentation that is performed by these liquidity
providers. For instance, while there is substantial competition among wholesalers, the magnitude
of price improvement can vary substantially across orders. These liquidity providers are likely to
rationally incorporate the potential information embedded in marketable retail orders and offer
price improvement only up to the point at which they can still profit from the trade. That is, on the
one hand, if they infer there might be relevant information in the marketable retail order, they
might offer less price improvement, and on the other hand, if they conclude that the marketable
retail order is unlikely to contain relevant information, they might be willing to offer more price
improvement. If this is true, the predictive power of marketable retail order imbalances should be
greater for marketable retail trades with less price improvement.
In the earlier sections, we group all orders with subpenny prices between 0.6 and 1 as
marketable retail-initiated buy orders and group those between 0 and 0.4 as marketable retail-
initiated sell orders. In this section, we further divide orders into “less price improvement” and
“more price improvement” types. For transactions with less improvement, we define buyer-
initiated trades as transactions with prices between 0.8 and the round penny, and seller-initiated
trades as trades with transaction prices between the round penny and 0.2 cents. For the “more price
improvement” category, we define buyer-initiated trades as trades with transaction prices between
0.6 and 0.8, and seller-initiated trades as trades with transaction price between 0.2 and 0.4. We
compute marketable retail order imbalances following equation (1) and (2). We compare the
predictive power of marketable retail order imbalances for “more” vs. “less” price improvement
by estimating equation (6) on each order imbalance measure separately.
Recall that the distribution of subpenny price improvements is displayed in Figure 1 Panel
B. Most transaction occur at a round penny or half penny. Based on the other bins, each covering
0.1 cent, there is slightly more trading volume for the “less price improvement” category than for
38
Kelley and Tetlock (2013) use the Dow Jones news archive to identify whether marketable
retail order flows are informed about cash flow news, and find that marketable retail market orders
can predict earnings surprises.
Here, we examine whether marketable retail order flow becomes more predictive around
earnings news. Specifically, we estimate a variant of equation (6) that allows the predictive
relationship to differ based on the variable eventday, an indicator that takes a value of 1 if day t is
an earnings announcement day and zero otherwise. The results are shown in Table XI Panel H.
They show that the predictive power of marketable retail order flow is greater on announcement
days, but the difference is not statistically significant. In Appendix Table AVIII, we directly
replicate the results in Kelley and Tetlock (2013), and find that our marketable retail order flows
can predict earnings news positively. The predictive power is statistically significant at the 1-day
horizon, but insignificant over longer horizons. That is, we are able to partially confirm KT’s
results. The difference may be attributable to the different samples periods and coverage we use.
While we cover all subpenny trades for most stocks over the 2010 to 2015 period, Kelley and
Tetlock cover about 1/3 of all marketable retail trades between 2003 and 2007.
Previous studies such as Chordia and Subrahmanyam (2004) find that overall order
imbalances (calculated using all reported transactions, including individual and institutional types)
can predict future stock returns. We use the Lee-Ready algorithm to compute the overall order
39
We proceed in two steps to address this question and report the results in Table XI Panel I.
In the first step, we re-estimate equation (6) using the overall order imbalance from the previous
week rather than marketable retail order imbalance as a key predicting variable. Consistent with
the literature, we find that overall order imbalances significantly predict future stock returns, with
a coefficient of 0.0004 and a significant t-statistic of 3.32.
In the second step, we estimate equation (6) using the marketable retail order imbalance
variables as key predicting variables, and include the overall order imbalance as a control. With
both marketable retail and market order imbalances in the model, marketable retail imbalances are
significantly positive, and they completely drive out the effect of overall order imbalances. Thus,
the predictive power of the marketable retail order imbalance seems to be stronger than that of the
overall order imbalance measure.
Here, we want to be cautious about the interpretation in the sense that this finding does not
necessarily indicate that the retail order flow that we identify is more informed than order flow
from institutional investors. First, due to different calculation methods for the two measures, the
difference between the overall oib and the marketable retail order imbalance is not the order
imbalance from institutional investors. Second, we only calculate the order flow from marketable
retail orders, which accounts for about half of the trades from retail investors, and the overall oib’s
weaker predictive power might be partially a result of uninformed trading by other participants in
the market.
Our identification for buy and sell orders relies on the implicit assumption that price
improvements are always a small fraction (less than half) of a cent. If price improvements are
larger, our method may not correctly sign trades. For example, if a stock has a bid price of $50.01
and an ask price of $50.04, and a marketable retail market buy order arrives and is improved by
0.75 cents, the reported transaction price would be $50.0325, and our trade-signing approach
40
IV. Conclusions
In this paper, we exploit the fact that most marketable retail order flows in U.S. equity
markets are internalized or sold to wholesalers. As a part of this routing process, marketable retail
orders are typically given a small fraction of a penny per share of price improvement relative to
the national best bid or offer price, and this price improvement can be observed when the trade is
reported to the consolidated tape. Institutional orders almost never receive this kind of price
improvement, so it becomes possible to use subpenny trade prices to identify a broad swath of
marketable marketable retail order flow. It is also straightforward to identify whether the
marketable retail order is buying or selling stock: transactions at prices that are just above a round
penny are classified as marketable retail sales, while transactions that are just below a round penny
are marketable retail purchases.
We use this methodology to characterize the trading behavior and information content of
marketable retail orders. We find that marketable retail order flows are on average contrarian over
weekly horizons, buying stocks that have experienced recent price declines and selling stocks that
have risen in the past week. More significantly, we find that the marketable retail order flow can
predict the cross section of future stock returns. Over the next week, stocks with more positive
marketable retail order imbalances outperform stocks with relatively negative marketable retail
41
20
Our measure is already used in a few studies. For instance, Farrell et al. (2020) find our retail order imbalances are
strongly correlated with the sentiment of “Seeking Alpha” articles and that the ability of retail order imbalances to
predict returns is roughly twice as high on days with SeekingAlpha posts. Israeli, Kasznik, and Sridharan (2019) use
our methodology to identify retail investor trading and then use abnormal retail trading volume and Bloomberg
searches as specific measures of retail and institutional investor attention.
42
Arif, Salman and Ben-Rephael, Azi and Lee, Charles M.C., 2016, Short-Sellers and Mutual
Funds: Why Does Short-Sale Volume Predict Stock Returns? Working Paper.
Barber, Brad M., and Terrance Odean, 2000, Trading is hazardous to your wealth: The common
stock investment performance of individual investors, Journal of Finance 55, 773−806.
Barber, Brad M., and Terrance Odean, 2008, All that glitters: The effect of attention and news
on the buying behavior of individual and institutional investors, Review of Financial Studies 21, 785-
818.
Barber, Brad M., Terrance Odean, and Ning Zhu, 2009, Do retail trades move markets? Review
of Financial Studies 22, 151−186.
Barrot, Jean-Noel, Ron Kaniel, and David Alexandre Sraer, 2016, Are Retail Traders
Compensated for Providing Liquidity? Journal of Financial Economics 120, 146-168.
Battalio, Robert, Shane A. Corwin, and Robert Jennings, 2016, Can Brokers Have It All? On
the Relation between Make-Take Fees and Limit Order Execution Quality, Journal of Finance 71,
2193-2238.
Blume, Marshall E., and Robert F. Stambaugh, 1983, Biases in Computed Returns: An
Application to the Size Effect, Journal of Financial Economics 12, 387-404.
Boehmer, Ekkehart, Charles M. Jones, and Xiaoyan Zhang, 2008. Which Shorts Are Informed?
Journal of Finance 63, 491-527.
Campbell, John Y., Tarun Ramadorai, and Allie Schwartz, 2009, Caught on tape: Institutional
Trading, Stock Returns, and Earnings Announcements, Journal of Financial Economics, 92(1), 66-91.
Chakrabarty, Bidisha, Pamela C. Moulton, and Charles Trzcinka, 2017, The Performance of
Short-Term Institutional Trades. Journal of Financial and Quantitative Analysis 52.4 (2017), 1403-
1428.
Chordia, Tarun, and Avanidhar Subrahmanyam, 2004, Order Imbalance and Stock Returns:
Theory and Evidence, Journal of Financial Economics 72, 485−518.
Chordia, Tarun, Sahn-Wook Huh, and Avanidhar Subrahmanyam, 2007, The Cross-Section of
Expected Trading Activity, Review of Financial Studies 20, 709-740.
Fama, Eugene F., and James D. MacBeth, 1973, Risk, Return, and Equilibrium: Empirical
Tests, Journal of Political Economy 81, 607−636.
43
44
45
Reg I II III IV
Dep.var Mroibvol Mroibvol Mroibtrd Mroibtrd
Return Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept -0.4013 -20.03 -0.4065 -20.19 -0.4326 -22.00 -0.4357 -22.01
Mroib(w-1) 0.2200 92.53 0.2201 92.57 0.2865 150.01 0.2866 150.06
Ret(w-1) -0.9481 -40.60 -0.9620 -41.43 -0.9003 -35.92 -0.9156 -36.74
Ret(m-1) -0.2778 -19.24 -0.2784 -19.30 -0.2258 -14.84 -0.2262 -14.87
Ret(m-7,m-2) -0.0586 -11.49 -0.0584 -11.46 -0.0380 -6.50 -0.0378 -6.48
Lmto 0.0003 5.31 0.0003 5.19 0.0002 3.93 0.0002 3.83
Lvol 0.8100 8.37 0.8478 8.79 0.4366 4.24 0.4633 4.51
Size 0.0154 12.06 0.0157 12.31 0.0209 16.37 0.0211 16.48
Lbm -0.0275 -17.66 -0.0274 -17.61 -0.0274 -18.09 -0.0273 -18.05
Adj.R2 6.00% 6.01% 9.49% 9.50%
46
Reg I II III IV
Order imbalance Mroibvol Mroibvol Mroibtrd Mroibtrd
Dep.var Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept 0.0050 2.58 0.0056 2.85 0.0050 2.58 0.0056 2.85
Mroib(w-1) 0.0009 15.60 0.0010 16.29 0.0008 12.30 0.0008 13.20
Ret (w-1) -0.0185 -5.83 -0.0220 -6.85 -0.0186 -5.88 -0.0222 -6.91
Ret (m-1) 0.0006 0.35 0.0006 0.34 0.0005 0.29 0.0005 0.29
Ret (m-7, m-2) 0.0008 1.16 0.0008 1.16 0.0008 1.12 0.0008 1.12
Lmto 0.0000 -3.37 0.0000 -3.76 0.0000 -3.36 0.0000 -3.75
Lvol -0.0223 -1.41 -0.0205 -1.31 -0.0217 -1.37 -0.0198 -1.27
Size -0.0001 -0.86 -0.0001 -0.92 -0.0001 -0.90 -0.0001 -0.96
Lbm -0.0001 -0.39 0.0000 -0.07 -0.0001 -0.42 0.0000 -0.10
Adj.R2 3.85% 3.85% 3.84% 3.84%
Interquartile 1.1888 1.1888 1.2292 1.2292
Interquartile weekly return diff 0.1089% 0.1144% 0.0931% 0.0997%
47
48
49
Panel A. Form portfolios on the previous week marketable retail order imbalance based on number of shares traded
Holding Whole sample Small Medium Big
Period Mean t-stat alpha t-stat alpha t-stat alpha t-stat alpha t-stat
1 week 0.092% 2.66 0.084% 2.43 0.403% 9.16 0.170% 6.24 0.067% 1.78
2 weeks 0.147% 2.45 0.135% 2.46 0.669% 9.01 0.292% 6.81 0.105% 1.70
4 weeks 0.223% 1.89 0.208% 2.00 1.124% 10.43 0.423% 6.36 0.143% 1.22
6 weeks 0.310% 1.72 0.277% 1.73 1.399% 13.02 0.558% 6.07 0.171% 1.05
8 weeks 0.448% 1.92 0.460% 2.26 1.709% 17.13 0.623% 4.18 0.342% 1.69
10 weeks 0.515% 1.99 0.484% 1.81 1.704% 11.17 0.578% 3.87 0.381% 1.53
12 weeks 0.588% 2.09 0.629% 1.89 1.857% 7.65 0.556% 3.20 0.477% 1.48
Panel B. Form portfolios on the previous week marketable retail order imbalance based on number of trades
Holding Whole sample Small Medium Big
Period Mean t-stat alpha t-stat alpha t-stat alpha t-stat alpha t-stat
1 week 0.056% 1.34 0.061% 1.44 0.343% 7.04 0.104% 3.52 0.055% 1.42
2 weeks 0.137% 1.72 0.143% 1.89 0.557% 6.72 0.194% 4.02 0.119% 1.61
4 weeks 0.238% 1.61 0.251% 1.88 0.880% 6.98 0.277% 3.75 0.214% 1.61
6 weeks 0.311% 1.50 0.350% 1.93 1.145% 6.25 0.313% 2.62 0.304% 1.84
8 weeks 0.427% 1.58 0.523% 2.26 1.468% 6.40 0.353% 1.91 0.449% 2.19
10 weeks 0.454% 1.41 0.539% 1.74 1.442% 5.37 0.292% 1.56 0.483% 1.64
12 weeks 0.529% 1.47 0.667% 1.70 1.672% 5.30 0.228% 1.05 0.567% 1.51
50
Panel A. First stage of projecting order imbalance on persistence and past return
Reg I II III IV
Dep.var Mroibvol(w-1) Mroibvol(w-1) Mroibtrd(w-1) Mroibtrd(w-1)
Return Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept -0.1413 -24.66 -0.1408 -24.61 -0.1054 -17.23 -0.1049 -17.19
Mroib(w-2) 0.2227 96.20 0.2228 96.20 0.2906 149.82 0.2907 149.85
Ret(w-2) -0.9286 -38.93 -0.9422 -39.80 -0.8926 -34.92 -0.9076 -35.81
Ret(m-1) -0.2029 -13.93 -0.2025 -13.90 -0.1591 -10.72 -0.1588 -10.70
Ret(m-7,m-2) -0.0267 -4.98 -0.0268 -4.99 -0.0054 -0.86 -0.0055 -0.88
Adj.R2 5.62% 5.63% 8.99% 9.00%
51
52
53
54
55
56
This table reports estimation results on a decomposition of our marketable retail order flow measure’s predictive power for future returns.
Our sample period covers January 2010 to December 2014, and our sample firms are all common stocks listed on U.S. stock exchanges
with a share price of at least $1. We estimate two-stage Fama-MacBeth regressions specified in equations (14) - (16). The dependent
variable is weekly returns, computed in two ways: using end-of-day bid-ask average price or CRSP closing price. The independent
variables are scaled marketable retail order imbalance measures: mroibvol (based on the number of shares traded) and mroibtrd (based
on the number of trades). In the first-stage estimation, the order imbalance measures are decomposed into four components. The variable
mroib(w-1, persistence) is estimated in the first stage using past order imbalance and reflects price pressure. The variable mroib(w-1,
contrarian) is estimated in the first stage using past returns over different horizons, which is connected to the liquidity provision
hypothesis. The variable mroib(w-1, public) is estimated in the first stage using week w-1 news sentiment, which proxies for marketable
retail order imbalances that predict returns associated with future news releases. The residual part of previous-week order imbalance
from first-stage estimation is denoted as “other,” which we attribute to retail investors’ valuable private information about future returns
that is incorporated into prices but is not associated with an identifiable public news release. As additional control variables, we include
previous-week return, ret(w-1), previous-month return, ret(m-1), and previous 6-month return, ret(m-7, m-2). Other control variables
are log book-to-market ratio (lbm), log market cap (size), monthly turnover (lmto), and monthly volatility of daily returns (lvol), all
measured at the end of the previous month. To account for serial correlation in the coefficients, the standard deviations of the time-series
are adjusted using Newey-West (1987) with five lags.
Panel A. First stage of projecting order imbalance on persistence, past return, and public news.
Reg I II III IV
Dep.var Mroibvol(w-1) Mroibvol(w-1) Mroibtrd(w-1) Mroibtrd(w-1)
Return Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept -0.1510 -24.08 -0.1505 -24.03 -0.1132 -16.89 -0.1127 -16.85
Mroib(w-2) 0.2208 112.64 0.2209 112.67 0.2827 125.44 0.2829 125.48
Ret(w-2) -0.8918 -36.22 -0.9051 -37.00 -0.8940 -34.04 -0.9098 -35.01
Ret(m-1) -0.2169 -13.68 -0.2156 -13.60 -0.1702 -10.56 -0.1687 -10.47
Ret(m-7,m-2) -0.0264 -4.54 -0.0264 -4.55 -0.0080 -1.17 -0.0081 -1.19
Sent(w-1) 0.0249 11.60 0.0249 11.60 0.0305 13.81 0.0305 13.81
Adj.R2 5.49% 5.50% 8.58% 8.59%
57
58
59
60
61
25,000
20,000
15,000
10,000
5,000
0
2010 2011 2012 2013 2014 2015
25,000
20,000
15,000
10,000
5,000
-
=0 0< 0.1< 0.2< 0.3< 0.4< =0.5 0.5< 0.6< 0.7< 0.8< 0.9<
<=0.1 <=0.2 <=0.3 <=0.4 <0.5 <=0.6 <=0.7 <=0.8 <=0.9 <1
62
63
Panel A. Weekly portfolio return difference using end-of-day bid-ask average prices
0.06
0.05
0.04
0.03
0.02
0.01
-0.01
-0.02
-0.03
-0.04
201001 201101 201201 201301 201401 201501
64
250
200
150
100
50
120000
100000
80000
60000
40000
20000
0.14
0.12
0.10
0.08
0.06
0.04
0.02
0.00
1 2 3 4 5 10 20 30 40 50 60 70 80
Panel B. Correlation of Marketable Retail Order Imbalance Measure Using Number of Trades with
Past Returns
0.030
0.025
0.020
0.015
0.010
0.005
0.000
-0.005 1 2 3 4 5 10 20 30 40 50 60 70 80
-0.010
-0.015
-0.020
Panel D. Seasonality
Reg I II III IV
Dep.var Mroibvol Mroibvol Mroibtrd Mroibtrd
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept -0.0383 -138.89 -0.0367 -155.82 -0.0323 -124.24 -0.0303 -136.36
Monday -0.0034 -6.04 -0.0049 -9.19
Friday 0.0027 4.94 0.0039 7.50
December -0.0343 -43.95 -0.0413 -56.10
January 0.0153 19.01 0.0168 22.08
10
Panel B. Second stage decomposition of order imbalance’s predictive power, after first stage in Panel A
Reg I II III IV
Order Imbalance Mroibvol(w) Mroibvol(w) Mroibtrd(w) Mroibtrd(w)
Dep.var Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept 0.0045 2.19 0.0051 2.47 0.0045 2.15 0.0051 2.44
Mroib(w-1,persistence) 0.0027 8.72 0.0029 9.36 0.0018 7.73 0.0019 8.49
Mroib(w-1,contrarian) 0.0420 0.71 0.0545 0.74 0.2910 1.73 0.1832 0.94
Mroib(w-1,other) 0.0008 13.99 0.0009 14.94 0.0006 10.17 0.0007 11.25
Ret(w-1) -0.0609 -2.36 -0.0883 -2.23 -0.3131 -1.80 -0.2255 -1.27
Ret(m-1) 0.0220 0.72 0.0080 0.35 0.0455 1.12 0.0190 0.82
Ret(m-7,m-2) 0.0006 2.04 0.0006 2.06 0.0006 1.88 0.0006 1.91
Lmto 0.0000 -3.36 0.0000 -3.67 0.0000 -3.36 0.0000 -3.68
Lvol -0.0230 -1.46 -0.0232 -1.49 -0.0225 -1.43 -0.0227 -1.45
Size -0.0001 -0.56 -0.0001 -0.62 -0.0001 -0.59 -0.0001 -0.66
Lbm -0.0001 -0.50 0.0000 -0.18 -0.0001 -0.59 -0.0001 -0.27
Adj.R2 4.04% 4.05% 4.03% 4.04%
Interquartile return diff Interquartile return diff Interquartile return diff Interquartile return diff
Mroib(w-1,persistence) 0.2579 0.0689% 0.2579 0.0739% 0.3473 0.0617% 0.3472 0.0675%
Mroib(w-1,contrarian) 0.0497 0.2087% 0.0499 0.2721% 0.0546 1.5878% 0.0551 1.0096%
Mroib(w-1,other) 1.1152 0.0911% 1.1152 0.0974% 1.1332 0.0714% 1.1332 0.0789%
11
Panel D. Second-stage decomposition of order imbalance’s predictive power, after first stage in Panel C
Reg I II III IV
Order Imbalance Mroibvol(w) Mroibvol(w) Mroibtrd(w) Mroibtrd(w)
Dep.var Bid-ask return CRSP return Bid-ask return CRSP return
Coef. t-stat Coef. t-stat Coef. t-stat Coef. t-stat
Intercept 0.0049 2.43 0.0054 2.69 0.0048 2.39 0.0053 2.66
Mroib(w-1,persistence) 0.0027 8.64 0.0029 9.20 0.0018 7.80 0.0020 8.51
Mroib(w-1,positive) 0.0196 0.08 0.4550 1.12 -0.0102 -0.03 -0.6046 -0.84
Mroib(w-1,negative) 0.0952 0.38 -0.6257 -1.19 -0.3945 -0.88 0.5624 0.77
Mroib(w-1,other) 0.0008 14.56 0.0009 15.53 0.0006 10.82 0.0007 11.93
Ret(w-1) 0.0213 0.07 -0.2911 -0.89 -0.1151 -0.26 0.1337 0.45
Ret(m-1) 0.0061 0.24 -0.0287 -0.88 0.0087 0.42 0.0017 0.15
Ret(m-7,m-2) 0.0005 1.65 0.0006 1.92 0.0005 1.88 0.0005 1.81
Lmto 0.0000 -3.44 0.0000 -3.76 0.0000 -3.44 0.0000 -3.76
Lvol -0.0202 -1.32 -0.0207 -1.37 -0.0199 -1.30 -0.0203 -1.34
Size -0.0001 -0.71 -0.0001 -0.76 -0.0001 -0.75 -0.0001 -0.80
Lbm -0.0001 -0.57 -0.0001 -0.25 -0.0001 -0.67 -0.0001 -0.34
Adj.R2 4.30% 4.32% 4.29% 4.31%
Interquartile return diff Interquartile return diff Interquartile return diff Interquartile return diff
Mroib(w-1,persistence) 0.2563 0.0690% 0.2564 0.0737% 0.3461 0.0622% 0.3461 0.0677%
Mroib(w-1,positive) 0.0357 0.0699% 0.0360 1.6397% 0.0405 -0.0412% 0.0407 -2.4626%
12
13
14
15
16
17
mroibvol mroibother
Topic Type Description N a1 t-stat a1 t-stat
FUND cross market investment funds 93,142 -0.0039 -1.19 -0.0003 -0.42
REGS cross market regulatory issues 61,513 -0.0044 -1.63 -0.0008 -1.20
MNGISS cross market management issues 54,170 -0.0022 -0.96 -0.0002 -0.31
NEWS cross market top stories 36,265 -0.0019 -0.30 -0.0016 -1.08
EXCA cross market exchange activities 20,533 -0.0114 -1.39 -0.0037 -1.56
DRV cross market derivatives 18,061 0.0032 0.45 0.0002 0.12
ISU cross market new issues 17,061 0.0073 0.96 0.0011 0.60
INV cross market Investing 13,916 0.0090 1.20 0.0015 0.82
PRESS cross market press digests 9,783 0.0029 0.22 0.0005 0.19
TRD cross market international trade 4,782 0.0192 1.65 0.0055 1.66
BKRT cross market bankruptcy 4,700 -0.0090 -0.75 -0.0002 -0.08
RTM cross market Retirement 3,075 -0.0187 -1.43 -0.0016 -0.46
HEA general news health/medicines 52,110 -0.0099 -3.13 -0.0020 -2.51
POL general news domestic politics 49,699 -0.0129 -4.45 -0.0029 -4.26
JUDIC general news judicial 28,280 0.0056 1.46 0.0018 1.62
LAW general news legislation 17,135 -0.0016 -0.28 0.0003 0.22
ENV general news environment/nature 16,189 -0.0178 -3.87 -0.0044 -3.64
SCI general news science/technology 11,035 -0.0175 -1.87 -0.0051 -2.32
CRIM general news crime 9,866 0.0230 2.27 0.0036 1.76
SECUR general news security 7,528 0.0121 1.44 0.0021 0.89
DIP general news diplomacy 6,057 0.0167 1.81 0.0024 0.91
18
19
20
21