2206.03186v1
2206.03186v1
8, AUGUST 2015 1
Abstract—One of the fundamental problems of using opti- models, and that they should ultimately be replaced by a-
mization models that use different time series as data input, posteriori methods. To that purpose, we first define full and
is the trade-off between model accuracy and computational aggregated optimization models and apply a traditional k-
arXiv:2206.03186v1 [math.OC] 7 Jun 2022
S. Wogrin, Institute of Electricity Economics and Energy Innovation, Graz 2 If it is not possible to meet demand with existing generators, then there
University of Technology, Austria, e-mail: [email protected] is non-supplied energy at a high cost. We have not modeled this explicitly
1 A-posteriori methods employ preliminary optimizations to improve the for simplicity. But a fictitious generator with high operating cost and infinite
aggregation process. upper bound can represent non-supplied energy in the presented formulation.
JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 2
Wind
aggregation/clustering procedure. A stylized formulation of the
full (left) and aggregated (right) ED is given below:
0.4
X X
min Cg pg,h min Cg pg,r Wr
g,h g,r 0.2
X X
s.t. pg,h = Dh ∀h s.t. pg,r = Dr ∀r
g g 0
0 0.2 0.4 0.6 0.8 1
P g ≤ pg,h ≤ P g,h ∀g, h P g ≤ pg,r ≤ P g,r ∀g, r
Demand
Consider two hours with different TS data whose optimal the MSE obtained by k-means, so the input error is higher
solution of the ED belong to the same basis B. The TS data with the obtained clusters. However, if we cluster all hours
affects the right-hand-side (RHS) vector of the constraints. An within their basis, then the aggregated optimization results are
expected value (or centroid) of these data, yields an optimal exact! Solving the aggregated optimization model with only
solution that also has the same optimal basis B. 3 representative periods that have been clustered, accounting
for the corresponding bases, yields an error of 0%. Apart
Theorem. For each i = 1, . . . , I consider the following LPs
from being theoretically exact, basis-oriented clustering also
(Ei ): min cT x s.t. Ax = bi , where x ∈ Rn , A ∈ Rmxn , c ∈
establishes the maximum number of clusters necessary to
Rn , bi ∈ Rm and the LPs only differ in the RHS values bi .
obtain exact optimization results, which is nothing current
Then, B is also an optimal basis for
P the problem (E): min cT x
bi TSA methods can offer.
s.t. Ax = E(bi ), where E(bi ) = i I .
Proof. We proof this by contradiction. Assume that B is not IV. C ONCLUSION
the optimal basis for problem (E): min cT x s.t. Ax = E(bi ). The takeaways from this paper are as follows. First, a-
Instead, let B (6= B) and N be the optimal basis and the priori TSA methods are fundamentally flawed when used
non-basis matrices of (E). Under this assumption, it follows in/for optimization models and therefore should be abandoned
T
that cB xB < cTB xB for this problem. In (E) we obtain that and replaced by a-posteriori methods. As we have shown by
Ax = BxB + N xN = E(bi ). By definition xN = 0, so counter-example, the lowest input error (as indicated by MSE)
we further simplify Ax = BxB = E(bi ) and it follows that does not translate into the lowest (or even a low) output error
−1
xB = B (E(bi )). We now substitute this optimal solution in when approximating full optimization model results. Second,
the objective function of (E), which yields cT x = cTB xB = basis-oriented TSA can achieve a tremendous reduction (3 out
−1 −1 P −1 of 8760 hours) in input data of several orders of magnitude
cTB B (E(bi )) = cTB B ( i bIi ) = I1 cTB B b1 + . . . + while replicating full model results exactly. This confirms that
1 T −1
I cB B bI . Since we know that B is an optimal basis for picking clusters intelligently can outperform traditional one-
1 T −1 size-fits-all a-priori TSA methods (such as k-means) even if
each problem (Ei ), we can say that: I cB B b1 + . . . +
1 T −1 those use most of the original data. Finally, more research is
I cB B bI ≥ I1 cTB B −1 b1 + . . . + 1 T −1
I cB B bI = cTB xB . This
required to develop a theoretical framework on most efficient
would imply that B is an optimal basis for (E), which is a a-posteriori TSA methods. The basis-oriented TSA proposed
contradiction. here could be a starting point. In future research, we plan to
This result has significant ramifications with respect to clus- explore basis-oriented TSA further to see if it can be extended
tering TS data: imagine two hours with different data, i.e. b1 to large-scale, realistic problems with, e.g., time-linking and
and b2 , but an identical optimal basis B for the underlying LP, discrete constraints.
then the full optimization model (with individually represented ACKNOWLEDGMENT
hours), and the aggregated optimization model where we only
have one expected hour (i.e. the cluster centroid or expected I want to thank D. Cardona-Vasquez, D. Di Tondo, S. Pineda
value b1 +b and J.M. Morales for their comments.
2 ), yield the same objective function value, and the
2
same expected results for the variables. Hence, those two R EFERENCES
hours can be merged without losing any accuracy in the
[1] H. Teichgraeber, A. R. Brandt, Clustering methods to find representative
final aggregated model. In other words, if hours are aggre- periods for the optimization of energy systems: An initial framework and
gated within their basis and represented by the cluster centroid, comparison, Applied energy 239 (2019) 1283–1293.
then the aggregated model results will be exactly the same [2] M. Hoffmann, L. Kotzur, D. Stolten, M. Robinius, A review on time
series aggregation methods for energy system models, Energies 13 (3)
as the full hourly model results and have zero output error (2020) 641.
in expectation. This shows: first, that TSA for optimization [3] F. Domı́nguez-Muñoz, J. M. Cejudo-López, A. Carrillo-Andrés,
purposes must be based on the impact of the aggregation on M. Gallardo-Salazar, Selection of typical demand days for chp optimiza-
tion, Energy and buildings 43 (11) (2011) 3036–3043.
the optimization output error and not on similarity of input [4] F. J. De Sisternes, M. D. Webster, O. J. De Sisternes, Optimal selection
data as in traditional methods; second, it introduces basis- of sample weeks for approximating the net load in generation planning
oriented TSA as a promising way of clustering for aggregated problems (2013).
[5] F. D. Munoz, A. D. Mills, Endogenous assessment of the capacity value
optimization problems. of solar pv in generation investment planning studies, IEEE Transactions
on Sustainable Energy 6 (4) (2015) 1574–1585.
[6] I. J. Scott, P. M. Carvalho, A. Botterud, C. A. Silva, Clustering represen-
B. Economic dispatch and basis-oriented clustering tative days for power systems generation expansion planning: Capturing
the effects of variable renewables and energy storage, Applied Energy
Applying basis-oriented clustering to the above-mentioned 253 (2019) 113603.
ED example shows that there are only 3 different bases. [7] A. Pöstges, C. Weber, Time series aggregation–a new methodological
approach using the “peak-load-pricing” model, Utilities Policy 59 (2019).
Therefore, we only require 3 clusters. In Figure 1b, we [8] M. Sun, F. Teng, X. Zhang, G. Strbac, D. Pudjianto, Data-driven represen-
have color-coded each hour depending on the basis (and tative day selection for investment decisions: A cost-oriented approach,
corresponding cluster) this hour belongs to: blue (the wind IEEE Transactions on Power Systems 34 (4) (2019) 2925–2936.
[9] C. Li, A. J. Conejo, J. D. Siirola, I. E. Grossmann, On representative
generator is the marginal generator), black (thermal is on the day selection for capacity expansion planning of power systems under
margin), and red (hours with non-supplied energy). The MSE extreme operating conditions, International Journal of Electrical Power
in the input space is 0.0385, which is 2.3 times larger than & Energy Systems 137 (2022) 107697.