0% found this document useful (0 votes)

40 views12 pages

Improving Online Algorithms Via ML Predictions: Ravi Kumar

Uploaded by

MrDonMatti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views12 pages

Improving Online Algorithms Via ML Predictions: Ravi Kumar

Uploaded by

MrDonMatti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Improving Online Algorithms via ML Predictions∗

Ravi Kumar Manish Purohit Zoya Svitkina

Google Google Google
[email protected] [email protected] [email protected]
arXiv:2407.17712v1 [cs.DS] 25 Jul 2024

Abstract
In this work we study the problem of using machine-learned predictions to improve
the performance of online algorithms. We consider two classical problems, ski
rental and non-clairvoyant job scheduling, and obtain new online algorithms that
use predictions to make their decisions. These algorithms are oblivious to the
performance of the predictor, improve with better predictions, but do not degrade
much if the predictions are poor.

1 Introduction
Dealing with uncertainty is one of the most challenging issues that real-world computational tasks,
besides humans, face. Ranging from “will it snow next week?” to “should I rent an apartment or
buy a house?”, there are questions that cannot be answered reliably without some knowledge of the
future. Similarly, the question of “which job should I run next?” is hard for a CPU scheduler that
does not know how long this job will run and what other jobs might arrive in the future.
There are two interesting and well-studied computational paradigms aimed at tackling uncertainty.
The first is in the field of machine learning where uncertainty is addressed by making predictions
about the future. This is typically achieved by examining the past and building robust models based
on the data. These models are then used to make predictions about the future. Humans and real-world
applications can use these predictions to adapt their behavior: knowing that it is likely to snow next
week can be used to plan a ski trip. The second is in the field of algorithm design. Here, the effort
has to been to develop a notion of competitive ratio2 for the goodness of an algorithm in the presence
of an unknown future and develop online algorithms that make decisions heedless of the future but
are provably good in the worst-case, i.e., even in the most pessimistic future scenario. Such online
algorithms are popular and successful in real-world systems and have been used to model problems
including paging, caching, job scheduling, and more (see the book by Borodin and El-Yaniv [5]).
Recently, there has been some interest in using machine-learned predictions to improve the quality
of online algorithms [21, 19]. The main motivation for this line of research is two-fold. The first
is to design new online algorithms that can avoid assuming a worst-case scenario and hence have
better performance guarantees both in theory and practice. The second is to leverage the vast
amount of modeling work in machine learning, which precisely deals with how to make predictions.
Furthermore, as machine-learning models are often retrained on new data, these algorithms can
naturally adapt to evolving data characteristics. When using the predictions, it is important that
the online algorithm is unaware of the performance of the predictor and makes no assumptions on
the types of prediction errors. Additionally, we desire two key properties of the algorithm: (i) if
the predictor is good, then the online algorithm should perform close to the best offline algorithm
(consistency) and (ii) if the predictor is bad, then the online algorithm should gracefully degrade, i.e.,
its performance should be close to that of the online algorithm without predictions (robustness).
∗
The conference version [18] of this work appeared in NeurIPS 2018.
2
Informally, competitive ratio compares the worst-case performance of an online algorithm to the best offline
algorithm that knows the future.
Our problems. We consider two basic problems in online algorithms and show how to use machine-
learned predictions to improve their performance in a provable manner. The first is ski rental, in
which a skier is going to ski for an unknown number of days and on each day can either rent skis at
unit price or buy them for a higher price b and ski for free from then on. The uncertainty is in the
number of skiing days, which a predictor can estimate. Such a prediction can be made reasonably
well, for example, by building models based on weather forecasts and past behavior of other skiers.
The ski rental problem is the canonical example of a large class of online rent-or-buy problems, which
arise whenever one needs to decide between a cheap short-term solution (“renting”) and an expensive
long-term one (“buying”). Several extensions and generalizations of the ski rental problem have
been studied leading to numerous applications such as dynamic TCP acknowledgement [11], buying
parking permits [22], renting cloud servers [14], snoopy caching [13], and others. The best known
deterministic algorithm for ski rental is the break-even algorithm: rent for the first b − 1 days and
buy on day b. It is easy to observe that the break-even algorithm has a competitive ratio of 2 and no
deterministic algorithm can do better. On the other hand, Karlin et al. [12] designed a randomized
e
algorithm that yields a competitive ratio of e−1 ≈ 1.58, which is also optimal.
The second problem we consider is non-clairvoyant job scheduling. In this problem a set of jobs, all
of which are available immediately, have to be scheduled on one machine; any job can be preempted
and resumed later. The objective is to minimize the sum of completion times of the jobs. The
uncertainty in this problem is that the scheduler does not know the running time of a job until it
actually finishes. Note that a predictor in this case can predict the running time of a job, once again, by
building a model based on the characteristics of the job, resource requirements, and its past behavior.
Non-clairvoyant job scheduling, introduced by Motwani et al. [24], is a basic problem in online
algorithms with a rich history and, in addition to its obvious applications to real-world systems, many
variants and extensions of it have been studied extensively in the literature [9, 3, 1, 10]. Motwani et
al. [24] showed that the round-robin algorithm has a competitive ratio of 2, which is optimal.

Main results. Before we present our main results we need a few formal notions. In online algorithms,
the competitive ratio of an algorithm is defined as the worst-case ratio of the algorithm cost to the
offline optimum. In our setting, this is a function c(η) of the error η of the predictor3 . We say that an
algorithm is γ-robust if c(η) ≤ γ for all η, and that it is β-consistent if c(0) = β. So consistency is a
measure of how well the algorithm does in the best case of perfect predictions, and robustness is a
measure of how well it does in the worst-case of terrible predictions.
Let λ ∈ (0, 1) be a hyperparameter. For the ski rental problem with a predictor, we first obtain a
deterministic online algorithm that is (1 + 1/λ)-robust and (1 + λ)-consistent (Section 2.2). We
next improve these bounds by obtaining a randomized algorithm that is ( 1−e1+1/b
−(λ−1/b)
)-robust and
( 1−eλ−λ )-consistent, where b is the cost of buying (Section 2.3). For the non-clairvoyant scheduling
problem, we obtain a randomized algorithm that is (2/(1 − λ))-robust and (1/λ)-consistent. Note
that the consistency bounds for all these algorithms circumvent the lower bounds, which is possible
only because of the predictions.
It turns out that for these problems, one has to be careful how the predictions are used. We illustrate
through an example that if the predictions are used naively, one cannot ensure robustness (Section 2.1).
Our algorithms proceed by opening up the classical online algorithms for these problems and using
the predictions in a judicious manner. We also conduct experiments to show that the algorithms we
develop are practical and achieve good performance compared to ones that do not use any prediction.

Related work. The work closest to ours is that of Medina and Vassilvitskii [21] and Lykouris
and Vassilvitskii [19]. The former used a prediction oracle to improve reserve price optimization,
relating the gap beween the expected bid and revenue to the average predictor loss. In a sense,
this paper initiated the study of online algorithms equipped with machine learned predictions. The
latter developed this framework further, introduced the concepts of robustness and consistency,
and considered the online caching problem with predictions. It modified the well-known Marker
algorithm to use the predictions ensuring both robustness and consistency. While we operate in the
same framework, none of their techniques are applicable to our setting. Another recent work is that

3
The definition of the prediction error η is problem-specific. In both the problems considered in this paper, η
is defined to be the L1 norm of the error.

2
of Kraska et al. [17] that empirically shows that better indexes can be built using machine learned
models; it does not provide any provable guarantees for its methods.
There are other computational models that try to tackle uncertainty. The field of robust optimiza-
tion [16] considers uncertain inputs and aims to design algorithms that yield good performance
guarantees for any potential realization of the inputs. There has been some work on analyzing
algorithms when the inputs are stochastic or come from a known distribution [20, 23, 6]. In the
optimization community, the whole field of online stochastic optimization concerns online decision
making under uncertainty by assuming a distribution on future inputs; see the book by Russell Bent
and Pascal Van Hentenryck [4]. Our work differs from these in that we do not assume anything about
the input; in fact, we do not assume anything about the predictor either!

2 Ski rental with prediction

In the ski rental problem, let rentals cost one unit per day, b be the cost to buy, x be the actual number
of skiing days, which is unknown to the algorithm, and y be the predicted number of days. Then
η = |y − x| is the prediction error. Note that we do not make any assumptions about its distribution.
The optimum cost is OPT = min{b, x}.

2.1 Warmup: A simple consistent, non-robust algorithm

We first show that an algorithm that naively uses the predicted number of days to decide whether or
not to buy is 1-consistent, i.e., its competitive ratio is 1 when η = 0. However, this algorithm is not
robust, as the competitive ratio can be arbitrarily large in case of incorrect predictions.

Algorithm 1: A simple 1-consistent algorithm

if y ≥ b then
Buy on the first day.
else
Keep renting for all skiing days.
end

Lemma 2.1. Let ALG denote the cost of the solution obtained by Algorithm 1 and let OPT denote
the optimal solution cost on the same instance. Then ALG ≤ OPT + η.

Proof. We consider different cases based on the relative values of the prediction y and the actual
number of days x of the instance. Recall that Algorithm 1 incurs a cost of b whenever the prediction
is at least b and incurs a cost of x otherwise.
• y ≥ b, x ≥ b =⇒ ALG = b = OPT.
• y < b, x < b =⇒ ALG = x = OPT
• y ≥ b, x < b =⇒ ALG = b ≤ x + y − x = x + η = OPT + η
• y < b, x ≥ b =⇒ ALG = x < b + x − y = b + η = OPT + η

A major drawback of Algorithm 1 is its lack of robustness. In particular, its competitive ratio can be
unbounded if the prediction y is small but x ≫ b. Our goal next is to obtain an algorithm that is both
consistent and robust.

2.2 A deterministic robust and consistent algorithm

In this section, we show that a small modification to Algorithm 1 yields an algorithm that is both
consistent and robust. Let λ ∈ (0, 1) be a hyperparameter. As we see later, varying λ gives us a
smooth trade-off between the robustness and consistency of the algorithm.
Theorem
2.2. With a parameter λ ∈ (0, 1), Algorithm 2 has a competitive ratio of at most
1+λ η
min , (1 + λ) + . In particular, Algorithm 2 is (1 + 1/λ)-robust and (1 + λ)-
λ (1 − λ)OPT
consistent.

3
Algorithm 2: A deterministic robust and consistent algorithm.
if y ≥ b then
Buy on the start of day ⌈λb⌉
else
Buy on the start of day ⌈b/λ⌉
end

Proof. We begin with the first bound. Suppose y ≥ b and the algorithm buys the skis at the start of
day ⌈λb⌉. Since the algorithm incurs a cost of b + ⌈λb⌉ − 1 whenever x ≥ ⌈λb⌉, the worst competitive
ratio is obtainedwhen x = ⌈λb⌉, for which OPT = ⌈λb⌉. In this case, we have ALG = b+⌈λb⌉−1 ≤
b + λb ≤ 1+λ λ ⌈λb⌉ = 1+λ
λ OPT. On the other hand, when y < b, the algorithm buys skis at the
start of day ⌈b/λ⌉ and rents until then. In this case, the worst competitive ratiois attained whenever
x = ⌈b/λ⌉ as we have OPT = b and ALG = b + ⌈b/λ⌉ − 1 ≤ b + b/λ = 1+λ λ OPT.
To prove the second bound, we need to consider the following two cases. Suppose y ≥ b. Then,
for all x < ⌈λb⌉, we have ALG = OPT = x. On the other hand, for x ≥ ⌈λb⌉, we have
ALG = b + ⌈λb⌉ − 1 ≤ (1 + λ)b ≤ (1 + λ)(OPT + η). The second inequality follows since
either OP T = b (if x ≥ b) or b ≤ y ≤ OPT + η (if x < b). Suppose y < b. Then, for all
x ≤ b, we have ALG = OPT = x. Similarly, for all x ∈ (b, ⌈b/λ⌉), we have ALG = x ≤ y + η <
b + η = OPT + η. Finally for all x ≥ ⌈b/λ⌉, noting that η = x − y > b/λ − b = (1 − λ)b/λ,
1 1
we have ALG = b + ⌈b/λ⌉ − 1 ≤ b + b/λ < b + ( 1−λ )η = OPT + ( 1−λ )η. Thus we obtain
1
ALG ≤ (1 + λ)OPT + ( 1−λ )η, completing the proof.

Thus, Algorithm 2 gives an option to trade-off consistency and robustness. In particular, greater trust
in the predictor suggests setting λ close to zero as this leads to a better competitive ratio when η is
small. On the other hand, setting λ close to one is conservative and yields a more robust algorithm.

2.3 A randomized robust and consistent algorithm

In this section we consider a family of randomized algorithms and compare their performance against
an oblivious adversary. In particular, we design robust and consistent algorithms that yield a better
trade-off than the above deterministic algorithms. Let λ ∈ (1/b, 1) be a hyperparameter. For a given
λ, Algorithm 3 samples the day when skis are bought based on two different probability distributions,
depending on the prediction received, and rents until that day.

Algorithm 3: A randomized robust and consistent algorithm

if y ≥ b then
Let k ← ⌊λb⌋;
k−i
Define qi ← b−1 b
1
· b(1−(1− 1/b)k ) for all 1 ≤ i ≤ k;

Choose j ∈ {1 . . . k} randomly from the distribution defined by qi ;

Buy at the start of day j.
else
Let ℓ ← ⌈b/λ⌉;
ℓ−i
Define ri ← b−1 b
1
· b(1−(1− 1/b)ℓ ) for all 1 ≤ i ≤ ℓ;

Choose j ∈ {1 . . . ℓ} randomly from the distribution defined by ri ;

Buy at the start of day j.
end

Theorem 2.3. Algorithm 3 yields a competitive ratio of at most min{ 1−e1+1/b η

, λ (1 + OPT
−(λ−1/b) 1−e−λ
)}.
In particular, Algorithm 3 is ( 1−e1+1/b
−(λ−1/b)
)-robust and ( 1−eλ−λ )-consistent.4
4 1
The conference version [18] of this paper incorrectly claimed a slightly stronger robustness of 1−e−(λ−1/b)
.

4
Proof. We consider different cases depending on the relative values of y and x.
(i) y ≥ b, x ≥ k. Here, we have OPT = min{b, x}. Since the algorithm incurs a cost of (b + i − 1)
when we buy at the beginning of day i, we have
k k k−i
X X b−1 1 k
E[ALG] = (b + i − 1)qi = (b + i − 1) 1/b)k )
=
i=1 i=1
b b(1 − (1 − 1 − (1 − 1/b)k

k k/b λ
≤ ≤ (OPT + η) ≤ (OPT + η).
1−e −k/b
1 − e−k/b 1 − e−λ
(ii) y ≥ b, x < k. Here, we have OPT = x. On the other hand, the algorithm incurs a cost of
(b + i − 1) only if it buys at the beginning of day i ≤ x. In particular, we have
X x X k
E[ALG] = (b + i − 1)qi + xqi
i=1 i=x+1
" x k−i k k−i #
1 X b−1 X b−1
= (b + i − 1) + x
b(1 − (1 − 1/b)k ) i=1 b i=x+1
b

x 1 1
= ≤ OPT ≤ OPT,
1 − (1 − 1/b)k 1 − e−k/b 1 − e−(λ−1/b)
which establishes robustness. In order to prove consistency, we can rewrite the RHS as follows

1 k/b (b−k)/b
E[ALG] ≤ OPT = OPT + x
1 − e−k/b 1 − e−k/b 1 − e−k/b

k/b k/b λ
≤ OPT + η ≤ (OPT + η),
1 − e−k/b 1 − e−k/b 1 − e−λ
since x < k and b − k ≤ η.
(iii) y < b, x < ℓ. Here, we have OPT = min{b, x}. On the other hand, the expected cost of the
algorithm can be computed similar to (ii)
x ℓ
X X 1
E[ALG] = (b + i − 1)ri + xri ≤ x
i=1 i=x+1
1 − e−ℓ/b

1 λ
≤ (OPT + η) ≤ (OPT + η).
1 − e−1/λ 1 − e−λ
(iv) y < b, x ≥ ℓ. Here, we have OPT = b. The expected cost incurred by the algorithm is as in (i).
ℓ
X ℓ ⌈b/λ⌉
E[ALG] = (b + i − 1)ri = ≤
i=1
1 − (1 − /b)
1 ℓ (1 − e−ℓ/b )

1/λ + 1/b 1 + 1/b
≤ OPT ≤ OPT,
(1 − e−1/λ ) 1 − e−(λ−1/b)
where the last inequality is proven in Lemma A.2 in the Appendix. This completes the proof of
robustness. To prove consistency, we rewrite the RHS as follows.
ℓ ℓ 1
E[ALG] ≤ −ℓ/b
≤ −1/λ
= (b + ℓ − b)
1−e 1−e 1 − e−1/λ

1 λ
≤ (OPT + η) ≤ (OPT + η).
1 − e−1/λ 1 − e−λ
Algorithms 2 and 3 both yield a smooth trade-off between the robustness and consistency guarantees
for the ski rental problem. As shown in Figure 1, the randomized algorithm offers a much better
trade-off by always guaranteeing smaller consistency for a given robustness guarantee. We remark
that setting λ = 1 in Algorithms 2 and 3 allows us to recover the best deterministic and randomized
algorithms for the classical ski rental problem without using predictions.

2.4 Extensions

5
Consider a generalization of the ski rental problem where we
have a varying demand xi for computing resources on each day
i. Such a situation models the problem faced while designing
small enterprise data centers. System designers have the choice
of buying machines at a high setup cost or renting machines
from a cloud service provider to handle the computing needs
of the enterprise. One can satisfy the demand in two ways:
either pay 1 to rent one machine and satisfy one unit of demand
for one day, or pay b to buy a machine and use it to satisfy
one unit of demand for all future days. It is easy to cast the
classical ski rental problem in this framework by setting xi = 1 Figure 1: Ski rental: Robustness vs.
for the first x days and to 0 later. Kodialam [15] considers consistency.
this generalization and gives a deterministic algorithm with a
competitive ratio of 2 as well as a randomized algorithm with
e
competitive ratio of e−1 .
P
Now suppose we have predictions yi for the demand on day i. We define η = i |xi − yi | to be the
total L1 error of the predictions. Both Algorithms 2 and 3 extend naturally to this setting to yield the
same robustness and consistency guarantees as in Theorems 2.2 and 2.3. Our results follow from
viewing an instance of ski rental with varying demand problem as k disjoint instances of the classical
ski rental problem, where k is an upper bound on the maximum demand on any day. The proofs are
similar to those in Sections 2.2 and 2.3; we omit them for brevity.

3 Non-clairvoyant job scheduling with prediction

We consider the simplest variant of non-clairvoyant job scheduling, i.e., scheduling n jobs on a
single machine with no release dates. The processing requirement xj of a job j is unknown to the
algorithm and only becomes known once the job has finished processing. Any job can be preempted
at any time and resumed at a later time without any cost. The objective function is to minimize the
sum of completion times of the jobs. Note that no algorithm can yield any non-trivial guarantees if
preemptions are not allowed.
Let x1 , . . . , xn denote the actual processing times of the n jobs, which are unknown to the non-
clairvoyant algorithm. In the clairvoyant case, when processing times are known up front, the optimal
algorithm is to simply schedule the jobs in non-decreasing order of job lengths, i.e., shortest job
first. A deterministic non-clairvoyant algorithm called round-robin (RR) yields a competitive ratio of
2 [24], which is known to be best possible.
Now, suppose that instead of being truly non-clairvoyant, the algorithm has an oracle that predicts the
processing time of each job. Let y1 , . . . , yn be the predicted
Pn processing times of the n jobs. Then
ηj = |xj − yj | is the prediction error for job j, and η = j=1 ηj is the total error. We assume that
there are no zero-length jobs and that units are normalized such that the actual processing time of
the shortest job is at least one. Our goal in this section is to design algorithms that are both robust
and consistent, i.e., can use good predictions to beat the lower bound of 2, while at the same time
guaranteeing a worst-case constant competitive ratio.

3.1 A preferential round-robin algorithm

In scheduling problems with preemption, we can simplify exposition by talking about several jobs
running concurrently on the machine, with rates that sum to at most 1. For example, in the round-robin
algorithm, at any point of time, all k unfinished jobs run on the machine at equal rates of 1/k. This is
just a shorthand terminology for saying that in any infinitesimal time interval, 1/k fraction of that
interval is dedicated to running each of the jobs.
We call a non-clairvoyant scheduling algorithm monotonic if it has the following property: given two
instances with identical inputs and actual job processing times (x1 , . . . , xn ) and (x′1 , . . . , x′n ) such
that xj ≤ x′j for all j, the objective function value found by the algorithm for the first instance is no
higher than that for the second. It is easy to see that the round-robin algorithm is monotonic.
We consider the Shortest Predicted Job First (SPJF) algorithm, which sorts the jobs in the increasing
order of their predicted processing times yj and executes them to completion in that order. Note that

6
SPJF is monotonic, because if processing times xj became smaller (with predictions yj staying the
same), all jobs would finish only sooner, thus decreasing the total completion time objective. SPJF
produces the optimal schedule in the case that the predictions are perfect, but for bad predictions,
its worst-case performance is not bounded by a constant. To get the best of both worlds, i.e. good
performance for good predictions as well as a constant-factor approximation in the worst-case, we
combine SPJF with RR using the following, calling the algorithm Preferential Round-Robin (PRR).
Lemma 3.1. Given two monotonic algorithms with competitive ratios α and β for the minimum total
completion time problem with preemptions, and a parameter λ ∈ (0, 1), one can obtain an algorithm
β
with competitive ratio min{ αλ , 1−λ }.

Proof. The combined algorithm runs the two given algorithms in parallel. The α-approximation (call
it A) is run at a rate of λ, and the β-approximation (B) at a rate of 1 − λ. Compared to running at
rate 1, if algorithm A runs at a slower rate of λ, all completion times increase by a factor of 1/λ, so it
becomes a αλ -approximation. Now, the fact that some of the jobs are concurrently being executed by
algorithm B only decreases their processing times from the point of view of A, so by monotonicity,
this does not make the objective of A any worse. Similarly, when algorithm B runs at a lower rate of
β
1 − λ, it becomes a 1−λ -approximation, and by monotonicity can only get better from concurrency
with A. Thus, both bounds hold simultaneously, and the overall guarantee is their minimum.

We next analyze the performance of SPJF.

2η

Lemma 3.2. The SPJF algorithm has competitive ratio at most 1 + n .

Proof. Assume w.l.o.g. that jobs are numbered in non-decreasing order of their actual processing
times, i.e. x1 ≤ . . . ≤ xn . For any pair of jobs (i, j), define d(i, j) as the amount of job i that has
been executed before the completion time of job j. In other words, d(i, j) is the amount of time by
which i delays j. Let ALG denote the output of SPJF. Then
n
X X
ALG = xj + (d(i, j) + d(j, i)).
j=1 (i,j):i<j

For i < j such that yi < yj , the shorter job is scheduled first and hence d(i, j) + d(j, i) = xi + 0, but
for job pairs that are wrongly predicted, the longer job is scheduled first, so d(i, j) + d(j, i) = 0 + xj .
This yields
n
X X X n
X X X
ALG = xj + xi + xj = xj + xi + (xj − xi )
j=1 (i,j):i<j (i,j):i<j j=1 (i,j):i<j (i,j):i<j
yi <yj yi ≥yj yi ≥yj
n
X X X X
≤ xj + xi + ηi + ηj = OPT + ηi + ηj ≤ OPT + (n − 1)η,
j=1 (i,j):i<j (i,j):i<j (i,j):i<j
yi ≥yj yi ≥yj

ALG
which yields OPT ≤ 1 + (n−1)η
OPT . Now, using our assumption that all jobs have length at least 1, we
n(n+1)
have OPT ≥ 2 . This yields an upper bound of 1 + 2(n−1)η 2η
n(n+1) < 1 + n on the competitive ratio
of SPJF.

We give an example showing that this bound is asymptotically tight. Suppose that there are n − 1 jobs
with processing times 1 and one job with processing time 1 + ϵ and suppose the predicted lengths are
yj = 1 for all jobs. Then η = ϵ, OPT = n(n+1) 2 + ϵ, and, if SPJF happens to schedule the longest
job first, increasing the completion time of n − 1 jobs by ϵ each, ALG = OPT + (n − 1)ϵ. This gives
ALG 2(n−1)η
the ratio of OPT = 1 + n(n+1)+2ϵ , which approaches the bound in Lemma 3.2 as n increases and ϵ
decreases.
Finally, we bound the performance of the preferential round-robin algorithm.
Theorem 3.3. The preferential round-robin algorithm with parameter λ ∈ (0, 1) has competitive
ratio at most min{ λ1 (1 + 2η 2 2 1
n ), 1−λ }. In particular, it is 1−λ -robust and λ -consistent.

7
Proof. This follows from the competitive ratio of SPJF (Lemma 3.2) and the competitive ratio of 2
for round-robin, and by combining the two algorithms using Lemma 3.1.

Setting λ > 0.5 gives an algorithm that beats the round-robin ratio of 2 in the case of sufficiently
good predictions. For the special case of zero prediction errors (or, more generally, if the order of
jobs sorted by yj is the same as that sorted by xj ), we can obtain an improved competitive ratio of
1+λ
2λ via a more sophisticated analysis.
Theorem 3.4. The preferential round-robin algorithm with parameter λ ∈ (0, 1) has competitive
ratio at most ( 1+λ
2λ ) when η = 0.

Proof. Suppose w.l.o.g. that the jobs are sorted in non-decreasing job lengths (both actual and
predicted), i.e. x1 ≤ · · · ≤ xn and y1 ≤ · · · ≤ yn . Since the optimal solution schedules the jobs
sequentially, we have
n
X n
X X
OPT = (n − j + 1)xj = xj + xi . (1)
j=1 j=1 (i,j):i<j

We call a job active if it has not completed yet. When there are k active jobs, the preferential
round-robin algorithm executes all active jobs at a rate of 1−λk , and the active job with the shortest
predicted processing time (we call this job current) at an additional rate of λ. Note that each job j
finishes while being the current job. This can be shown inductively: suppose job j − 1 finishes at time
t. Then by time t, job j has received strictly less processing than j − 1, but its size is at least as big.
So it has some processing remaining, which means that it becomes current at time t and stays current
until completion. Let phase k of the algorithm denote the interval of time when job k is current.
For any pair of jobs (i, j), define d(i, j) as the amount of job i that has been executed before the
completion time of j. In other words, d(i, j) is the amount of time by which i delays j. We can now
express the cost of our algorithm as
n
X X X
ALG = xj + d(i, j) + d(j, i). (2)
j=1 (i,j):i<j (i,j):i<j

If i < j, as job i completes before job j, we have d(i, j) = xi . To compute the last term in (2),
consider any phase k, and let tk denote its length. In this phase, the current job k executes at a rate of
at least λ, which implies that tk ≤ xλk . During phase k, jobs {k + 1, ..., n} receive tn−k+1
k (1−λ)
amount
of processing each. Such a job k + i delays i jobs with smaller indices, namely {k, ..., k + i − 1}.
Let dk (i, j) denote the delay in phase k:
n−k
X tk (1 − λ) X tk (1 − λ)(n − k) xk (1 − λ)(n − k)
dk (j, i) = · i = ≤ .
n − k + 1 i=1 2 2λ
(i,j):i<j

Substituting back into Equation (2),

n
X X n
X X
ALG = xj + d(i, j) + dk (j, i)
j=1 (i,j):i<j (i,j):i<j k=1
n n n
X X X xk (1 − λ)(n − k) X xk (1 − λ)(n − k)
≤ xj + xi + = OPT +
j=1
2λ 2λ
(i,j):i<j k=1 k=1
n
1−λ X 1−λ 1+λ
≤ OPT + xk (n − k + 1) = OPT + OPT = OPT,
2λ 2λ 2λ
k=1

using Equation (1) for the last line.

8
(a) Ski Rental (b) Non-clairvoyant Scheduling
Figure 2: Average competitive ratio over varying prediction errors.

4 Experimental results
4.1 Ski rental

We test the performance of our algorithms for the ski rental problem via simulations. For all
experiments, we set the cost of buying to b = 100 and the actual number of skiing days x is a
uniformly drawn integer from [1, 4b]. The predicted number of days y is simulated as y = x + ϵ
where ϵ is drawn from a normal distribution with mean 0 and standard deviation σ. We consider
both randomized and deterministic algorithms for two different values of the trade-off parameter λ.
Recall that by setting λ = 1, our algorithms ignore the predictions and reduce to the known optimal
algorithms (deterministic and randomized, respectively) [12]. We set λ = 0.5 for the deterministic
algorithm that guarantees a worst-case competitive ratio of 3. In order to obtain the same worst-
case competitive ratio, we set λ = ln(3/2) for the randomized algorithm. For each σ, we plot the
average competitive ratio obtained by each algorithm over 10000 independent trials in Figure 2a. We
observe that even for rather large prediction errors, our algorithms perform significantly better than
their classical counterparts. In particular, even our deterministic algorithm that uses the predictions
performs better than the classical randomized algorithm for errors up to a standard deviation of 2b.

4.2 Non-clairvoyant scheduling

We generate a synthetic dataset with 50 jobs where the N min max mean σ
processing time of each job is sampled independently from 50 1 22352 2168 5475.42
a Pareto distribution with an exponent of α = 1.1. (As
observed in prior work [7, 8, 2], job size distributions Table 1: Statistics of job lengths.
in a number of settings are well-modeled by a Pareto
distribution with α close to 1.) Pertinent characteristics of the generated dataset are presented
in Table 1. In order to simulate predicted job lengths and compare the performance of the different
algorithms with respect to the errors in the prediction, we set the predicted job length yi = xi + ϵi ,
where ϵi is drawn from a normal distribution with mean zero and standard deviation σ.
Figure 2b shows the competitive ratio of the three algorithms versus varying prediction errors. For a
parameter σ, we plot the average competitive ratio over 1000 independent trials where the prediction
error has the specified standard deviation. As expected, the naïve strategy of scheduling jobs in
non-decreasing order of their predicted job lengths (SPJF) performs very well when the errors are
low, but quickly deteriorates as the errors increase. In contrast, our preferential round-robin algorithm
(with λ = 0.5) performs no worse than round-robin even when the predictions have very large error.

5 Conclusions
In this paper we furthered the study of using ML predictions to provably improve the worst-case
performance of online algorithms. There are many other important online algorithms including
k-server, portfolio optimization, etc, and it will be interesting to see if predictions can be useful for

9
them as well. Another research direction would be to use the error distribution of the ML predictor to
further improve the bounds.

Acknowledgements
We thank Chenyang Xu for pointing out a bug in the conference version [18] and thank Erik Vee for
his help in fixing the bug.

References
[1] Nikhil Bansal, Kedar Dhamdhere, Jochen Könemann, and Amitabh Sinha. Non-clairvoyant
scheduling for minimizing mean slowdown. Algorithmica, 40(4):305–318, 2004.
[2] Nikhil Bansal and Mor Harchol-Balter. Analysis of SRPT scheduling: Investigating unfairness.
In SIGMETRICS, pages 279–290, 2001.
[3] Luca Becchetti and Stefano Leonardi. Non-clairvoyant scheduling to minimize the average flow
time on single and parallel machines. In STOC, pages 94–103, 2001.
[4] Russell Bent and Pascal Van Hentenryck. Online Stochastic Combinatorial Optimization. MIT
Press, 2009.
[5] A. Borodin and R. El-Yaniv. Online Computation and Competitive Analysis. Cambridge
University Press, 1998.
[6] Sebastien Bubeck and Aleksandrs Slivkins. The best of both worlds: Stochastic and adversarial
bandits. In COLT, pages 42.1–42.23, 2012.
[7] Mark E Crovella and Azer Bestavros. Self-similarity in world wide web traffic: Evidence and
possible causes. Transactions on Networking, 5(6):835–846, 1997.
[8] Mor Harchol-Balter and Allen B Downey. Exploiting process lifetime distributions for dynamic
load balancing. ACM TOCS, 15(3):253–285, 1997.
[9] Sungjin Im, Janardhan Kulkarni, and Kamesh Munagala. Competitive algorithms from competi-
tive equilibria: Non-clairvoyant scheduling under polyhedral constraints. J. ACM, 65(1):3:1–
3:33, 2017.
[10] Sungjin Im, Janardhan Kulkarni, Kamesh Munagala, and Kirk Pruhs. Selfishmigrate: A scalable
algorithm for non-clairvoyantly scheduling heterogeneous processors. In FOCS, pages 531–540,
2014.
[11] Anna R Karlin, Claire Kenyon, and Dana Randall. Dynamic TCP acknowledgement and other
stories about e/(e − 1). Algorithmica, 36(3):209–224, 2003.
[12] Anna R. Karlin, Mark S. Manasse, Lyle A. McGeoch, and Susan Owicki. Competitive random-
ized algorithms for nonuniform problems. Algorithmica, 11(6):542–571, 1994.
[13] Anna R. Karlin, Mark S. Manasse, Larry Rudolph, and Daniel Dominic Sleator. Competitive
snoopy caching. Algorithmica, 3:77–119, 1988.
[14] Ali Khanafer, Murali Kodialam, and Krishna P.N. Puttaswamy. The constrained ski-rental
problem and its application to online cloud cost optimization. In INFOCOM, pages 1492–1500,
2013.
[15] Rohan Kodialam. Competitive algorithms for an online rent or buy problem with variable
demand. In SIAM Undergraduate Research Online, volume 7, pages 233–245, 2014.
[16] Panos Kouvelis and Gang Yu. Robust Discrete Optimization and its Applications, volume 14.
Springer Science & Business Media, 2013.
[17] Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, and Neoklis Polyzotis. The case for learned
index structures. In SIGMOD, pages 489–504, 2018.
[18] Ravi Kumar, Manish Purohit, and Zoya Svitkina. Improving online algorithms via ML predic-
tions. In NeurIPS, pages 9684–9693, 2018.
[19] Thodoris Lykouris and Sergei Vassilvitskii. Competitive caching with machine learned advice.
In ICML, pages 3302–3311, 2018.

10
[20] Mohammad Mahdian, Hamid Nazerzadeh, and Amin Saberi. Online optimization with uncertain
information. ACM TALG, 8(1):2:1–2:29, 2012.
[21] Andres Muñoz Medina and Sergei Vassilvitskii. Revenue optimization with approximate bid
predictions. In NIPS, pages 1856–1864, 2017.
[22] Adam Meyerson. The parking permit problem. In FOCS, pages 274–282, 2005.
[23] Vahab S. Mirrokni, Shayan Oveis Gharan, and Morteza Zadimoghaddam. Simultaneous
approximations for adversarial and stochastic online budgeted allocation. In SODA, pages
1690–1701, 2012.
[24] Rajeev Motwani, Steven Phillips, and Eric Torng. Nonclairvoyant scheduling. Theoretical
Computer Science, 130(1):17–47, 1994.

A Deferred Proofs
We first state a few simple observations that will be useful.
Lemma A.1. For 0 < x ≤ 1,
1
(i) ex− x ≤ 1.
1
(ii) x
e − e− x ≥ 0.
1 e−x
(iii) 1 − x + x ≥ xe .

1
Proof. (i) For x ∈ (0, 1], we have x ≤ 1/x =⇒ ex ≤ e1/x and hence ex− x ≤ 1.
(ii) For any y ≤ 1, we have 1 − y ≤ e−y =⇒ ey ≤ 1
1−y . Showing (ii) is equivalent to showing
1 1 1
1− x
x−e ≥ 0. But since 1 − x ≤ 1, we can substitute y = 1 − x to get
1 1
x − e1− x ≥ x − =0
1 − (1 − x1 )

−x
(iii) We first show that h(x) := 1 − x1 + e x is concave for x ≥ 0 (since limx→0 h(x) = 0, we define
−x 2 x
h(0) = 0 to make it continuous at 0). Indeed, consider h′′ (x) = e (x +2x+2−2e
x3
)
. Note that for
all x ≥ 0, we have ex ≥ 1 + x + x2 /2, and hence we have h′′ (x) ≤ 0. Thus h(x) is concave in the
range x ≥ 0. By concavity, we get that for all 0 < x ≤ 1, h(x) ≥ (1 − x) · h(0) + x · h(1) = xe as
desired.
Lemma A.2. Let b ≥ 2 be an integer and let λ ∈ (1/b, 1) be a real number. Then,
1/λ + 1/b 1 + 1/b
−1/λ
≤ .
1−e 1 − e−(λ−1/b)

Proof. For convenience, let y = 1/b and rearrange the terms so that the lemma statement is equivalent
to showing the following, subject to λ ∈ (y, 1).

(1 + y)(1 − e−1/λ ) − (1/λ + y) + (1/λ + y)ey−λ ≥ 0.

Note that we used λ ≥ y here while rearranging the terms. Using ey ≥ 1 + y, it instead suffices to
show the following inequality.

(1 + y)(1 − e−1/λ ) − (1/λ + y) + (1/λ + y)(1 + y)e−λ ≥ 0. (3)

The LHS of (3) is a quadratic in y, written as:

f (y) := y 2 (e−λ ) + y((1/λ + 1)e−λ − e−1/λ ) + (1 − e−1/λ − 1/λ + e−λ /λ). (4)

11
The goal is to show (4) is non-negative when 1 ≥ λ ≥ y ≥ 0. To do this, we minimize f (y) subject
to 0 ≤ y ≤ λ.
The minimum of f (y) is attained at
∂f /∂y = 0 = 2y(e−λ ) + (1/λ + 1)e−λ − e−1/λ ,
yielding
eλ−1/λ − (1/λ + 1) Lemma A.1(i)
ymin = ≤ 0.
2
Consequently, subject to the constraint that y ≥ 0, the minimum of f (y) is attained at y = 0.
Plugging in y = 0 in (4), we get
1 1 e−λ Lemma A.1(iii) λ 1 Lemma A.1(ii)
f (y) ≥ f (0) = 1 − e− λ − + ≥ − e− λ ≥ 0.
λ λ e
This completes the proof.

Decision-Focused Learning and its Applications in Operations Management
No ratings yet
Decision-Focused Learning and its Applications in Operations Management
191 pages
2112.11927v2
No ratings yet
2112.11927v2
24 pages
AIML internship report
No ratings yet
AIML internship report
38 pages
pd-survey
No ratings yet
pd-survey
174 pages
Cost-sensitive reinforcement learning for credit risk
No ratings yet
Cost-sensitive reinforcement learning for credit risk
19 pages
Online Algorithm Short note
No ratings yet
Online Algorithm Short note
30 pages
8.Online Algorithms
No ratings yet
8.Online Algorithms
45 pages
Student Project Online K-Means Clustering
No ratings yet
Student Project Online K-Means Clustering
11 pages
RM_Notes-1
No ratings yet
RM_Notes-1
9 pages
5 - Online - Algorithms in Algorithms
No ratings yet
5 - Online - Algorithms in Algorithms
66 pages
Lecture 4 - CS50's Introduction to Artificial Intelligence with Python
No ratings yet
Lecture 4 - CS50's Introduction to Artificial Intelligence with Python
17 pages
a17_journal[1].docxnew
No ratings yet
a17_journal[1].docxnew
9 pages
IJRPR22505
No ratings yet
IJRPR22505
3 pages
0081
No ratings yet
0081
7 pages
High-Dimensional Prediction For Sequential Decision Making
No ratings yet
High-Dimensional Prediction For Sequential Decision Making
74 pages
Cluster-Then-Predict and Predictive Algorithms Log
No ratings yet
Cluster-Then-Predict and Predictive Algorithms Log
5 pages
26414-Article Text-30477-1-2-20230626
No ratings yet
26414-Article Text-30477-1-2-20230626
8 pages
Learning in Budgeted Auctions With Spacing Objectives
No ratings yet
Learning in Budgeted Auctions With Spacing Objectives
53 pages
Lecture 6 - Online - Algorithms
No ratings yet
Lecture 6 - Online - Algorithms
25 pages
Unsuperviser Machine Learning For Maintenance Optimization
No ratings yet
Unsuperviser Machine Learning For Maintenance Optimization
12 pages
SSRN Id4269263
No ratings yet
SSRN Id4269263
5 pages
ملخص شابتر 2 الصيانه
No ratings yet
ملخص شابتر 2 الصيانه
7 pages
qg_cnd_skb7
No ratings yet
qg_cnd_skb7
84 pages
KMBN MK02 UNIT 3
100% (1)
KMBN MK02 UNIT 3
29 pages
AUU Lec2
No ratings yet
AUU Lec2
4 pages
models
No ratings yet
models
5 pages
Online Passive-Aggressive Algorithms
No ratings yet
Online Passive-Aggressive Algorithms
35 pages
G710_User_Manual
No ratings yet
G710_User_Manual
1 page
Draft ch1 8
No ratings yet
Draft ch1 8
105 pages
Rendell Henry 2009 Brain Imp-1
No ratings yet
Rendell Henry 2009 Brain Imp-1
10 pages
A Network Airline Revenue Management Fra PDF
No ratings yet
A Network Airline Revenue Management Fra PDF
29 pages
Beyond Worst-Case Analysis in Combinatorial Optimization: Colin White January 10, 2018
No ratings yet
Beyond Worst-Case Analysis in Combinatorial Optimization: Colin White January 10, 2018
30 pages
Online Algorithms For Rent or Buy With Expert Advice
No ratings yet
Online Algorithms For Rent or Buy With Expert Advice
9 pages
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
No ratings yet
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
28 pages
s00521-022-07143-2
No ratings yet
s00521-022-07143-2
19 pages
Easychair Preprint: Vinod Kimbhaune, Harshil Donga, Asutosh Trivedi, Sonam Mahajan and Viraj Mahajan
No ratings yet
Easychair Preprint: Vinod Kimbhaune, Harshil Donga, Asutosh Trivedi, Sonam Mahajan and Viraj Mahajan
5 pages
Data Mining Techniques For Sales Forecastings
No ratings yet
Data Mining Techniques For Sales Forecastings
4 pages
Average Case 1
No ratings yet
Average Case 1
59 pages
An Alternative Ranking Problem For Search Engines: 1 Motivation
No ratings yet
An Alternative Ranking Problem For Search Engines: 1 Motivation
22 pages
1896-Document Upload-6001-1-10-20201102
No ratings yet
1896-Document Upload-6001-1-10-20201102
9 pages
product_presentation_cns_mcch01
No ratings yet
product_presentation_cns_mcch01
10 pages
Competitive Analysis: Steven Skiena
No ratings yet
Competitive Analysis: Steven Skiena
17 pages
Google Gemini for Advanced Users: Google Gemini
From Everand
Google Gemini for Advanced Users: Google Gemini
Daniel Fankhauser
No ratings yet
Building An Algorithm For Predictive Maintenance
No ratings yet
Building An Algorithm For Predictive Maintenance
13 pages
Haftom Aregawi
No ratings yet
Haftom Aregawi
88 pages
Mixtures of Gaussian Process Models For Human Pose Estimation
No ratings yet
Mixtures of Gaussian Process Models For Human Pose Estimation
9 pages
Implementation of Flight Fare Prediction System Using Machine Learning
No ratings yet
Implementation of Flight Fare Prediction System Using Machine Learning
11 pages
Weather Aus
No ratings yet
Weather Aus
6 pages
Flight Fare Prediction Using Machine Learning Approach
No ratings yet
Flight Fare Prediction Using Machine Learning Approach
5 pages
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
No ratings yet
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
64 pages
MAV - CW1 (S) - Summative Assessment - Skywards International Airlines Report v11
No ratings yet
MAV - CW1 (S) - Summative Assessment - Skywards International Airlines Report v11
13 pages
Chapter-2-RRL (3)
No ratings yet
Chapter-2-RRL (3)
22 pages
Iot and Machine Learning
No ratings yet
Iot and Machine Learning
12 pages
rs20378_cnd_gpw04
No ratings yet
rs20378_cnd_gpw04
2 pages
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
MIT-license-and-release-form-0
No ratings yet
MIT-license-and-release-form-0
2 pages
An Artificial Intelligence Neural Network Predictive Model For Anomaly Detection and Monitoring of Wind Turbines Using SCADA Data
No ratings yet
An Artificial Intelligence Neural Network Predictive Model For Anomaly Detection and Monitoring of Wind Turbines Using SCADA Data
15 pages
Improving Blasting Operations Using Data
No ratings yet
Improving Blasting Operations Using Data
7 pages
Chapters PDF
No ratings yet
Chapters PDF
51 pages
rs12255_cnd_sgm04rgb
No ratings yet
rs12255_cnd_sgm04rgb
1 page
Forecasting Fundamentals
No ratings yet
Forecasting Fundamentals
44 pages
5, Becoming an expert
No ratings yet
5, Becoming an expert
12 pages
Runtime PDF
No ratings yet
Runtime PDF
11 pages
MIT_License
No ratings yet
MIT_License
1 page
MIT-LICENSE
No ratings yet
MIT-LICENSE
1 page
Protein Structure Prediction Thesis
100% (3)
Protein Structure Prediction Thesis
8 pages
AAscript
No ratings yet
AAscript
158 pages
Baby Mozart 2008
No ratings yet
Baby Mozart 2008
28 pages
AI sample paper -1
No ratings yet
AI sample paper -1
10 pages
Expert Systems With Applications Chakraborty Et Al 2021
No ratings yet
Expert Systems With Applications Chakraborty Et Al 2021
11 pages
Transparency and Algorithmic Governance PDF
No ratings yet
Transparency and Algorithmic Governance PDF
57 pages
Lesson Plan On Making Predictions
100% (2)
Lesson Plan On Making Predictions
17 pages
Modern C++23 QuickStart Pro
From Everand
Modern C++23 QuickStart Pro
Jarek Thalor
No ratings yet
Modern C++23 QuickStart Pro: Advanced programming including variadic templates, lambdas, async IO, multithreading and thread sync
From Everand
Modern C++23 QuickStart Pro: Advanced programming including variadic templates, lambdas, async IO, multithreading and thread sync
Jarek Thalor
No ratings yet
Size Mix PDF
No ratings yet
Size Mix PDF
20 pages
Advances in Web Inteligent-2
No ratings yet
Advances in Web Inteligent-2
190 pages
PISA 2024 Science Strategic Vision Proposal
No ratings yet
PISA 2024 Science Strategic Vision Proposal
28 pages
ML Word To PDF
No ratings yet
ML Word To PDF
229 pages
Building Scalable Systems with C: Optimizing Performance and Portability
From Everand
Building Scalable Systems with C: Optimizing Performance and Portability
Larry Jones
No ratings yet
Machine Learning Curriculum Berkley
100% (1)
Machine Learning Curriculum Berkley
12 pages
Deltares Piping Paper Sellmeijer
No ratings yet
Deltares Piping Paper Sellmeijer
6 pages
Coomer - Three Modes of Inquiry
No ratings yet
Coomer - Three Modes of Inquiry
4 pages
Investigación de Operaciones Introducción
No ratings yet
Investigación de Operaciones Introducción
26 pages
Physics Unit 1 Review Solutions
No ratings yet
Physics Unit 1 Review Solutions
2 pages
Machine Learning Models For Forecasting and Estimation of Business Operations
No ratings yet
Machine Learning Models For Forecasting and Estimation of Business Operations
8 pages
GE Digital Twin Overview and Tutorial - RRI v3 PDF
100% (1)
GE Digital Twin Overview and Tutorial - RRI v3 PDF
45 pages
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
From Everand
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
Anais Sutherland
No ratings yet
Food Safety Risk Assessment Tool
100% (3)
Food Safety Risk Assessment Tool
15 pages
Mastering Dynamic Programming in Python
From Everand
Mastering Dynamic Programming in Python
Ed A Norex
No ratings yet
Industrialisation and Tunnelling: The Italian Experience: Pietro Lunardi, Lunardi's Geoengineering Office, Milan, Italy
No ratings yet
Industrialisation and Tunnelling: The Italian Experience: Pietro Lunardi, Lunardi's Geoengineering Office, Milan, Italy
0 pages
Design and Analysis of Algorithms: 1, #1
From Everand
Design and Analysis of Algorithms: 1, #1
S. R. Jena
No ratings yet
ASP.NET Core 1.0 High Performance
From Everand
ASP.NET Core 1.0 High Performance
James Singleton
No ratings yet
Interpretable Machine Learning PDF
100% (2)
Interpretable Machine Learning PDF
251 pages
Case Study NIKE
No ratings yet
Case Study NIKE
2 pages
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet
Updated Thesis 1
No ratings yet
Updated Thesis 1
20 pages
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Computational Geometry: Exploring Geometric Insights for Computer Vision
From Everand
Computational Geometry: Exploring Geometric Insights for Computer Vision
Fouad Sabry
No ratings yet

Improving Online Algorithms Via ML Predictions: Ravi Kumar

Uploaded by

Improving Online Algorithms Via ML Predictions: Ravi Kumar

Uploaded by

Improving Online Algorithms via ML Predictions∗

Ravi Kumar Manish Purohit Zoya Svitkina

2 Ski rental with prediction

2.1 Warmup: A simple consistent, non-robust algorithm

Algorithm 1: A simple 1-consistent algorithm

2.2 A deterministic robust and consistent algorithm

2.3 A randomized robust and consistent algorithm

Algorithm 3: A randomized robust and consistent algorithm

Choose j ∈ {1 . . . k} randomly from the distribution defined by qi ;

Choose j ∈ {1 . . . ℓ} randomly from the distribution defined by ri ;

Theorem 2.3. Algorithm 3 yields a competitive ratio of at most min{ 1−e1+1/b η

3 Non-clairvoyant job scheduling with prediction

3.1 A preferential round-robin algorithm

We next analyze the performance of SPJF.

Substituting back into Equation (2),

using Equation (1) for the last line.

4.2 Non-clairvoyant scheduling

(1 + y)(1 − e−1/λ ) − (1/λ + y) + (1/λ + y)ey−λ ≥ 0.

(1 + y)(1 − e−1/λ ) − (1/λ + y) + (1/λ + y)(1 + y)e−λ ≥ 0. (3)

The LHS of (3) is a quadratic in y, written as:

You might also like