Approximation Algorithm (7): Minimizing Sum of Completion Times

Unweighted Version

Minimizing the sum of completion times is a classic scheduling problem. Suppose there are $n$ jobs to be scheduled. Let $r_1, r_2, \ldots, r_n$ be their release dates and $p_1, p_2, \ldots, p_n$ be their processing times. We schedule all $n$ jobs in some order. Let $C_1, C_2, \ldots, C_n$ be the completion times of the jobs in that order. The goal is to find a schedule that minimizes $\sum\limits_{i=1}^n C_i$. The machine is non-preemptive, meaning a job cannot be interrupted once started.

Now we can do a relaxation at this problem by just changing nonpreemptive machine to preemptive machine which means each work may interrupt other works. If we change it to the preemptive machine, we can solve this problem with an algorithm below.

$\operatorname{while}$ there is any work not finished.

$S \leftarrow \{$ work $i$ which didn't finished yet and $r_i \le$ current date $\}$.
Choose a work $i \in S$ that has the smallest left processing time.
Process work $i$ until work $i$ is done or new $r_j$ release.

This is the Shortest Remaining Processing Time (SRPT) rule. It runs in polynomial time: we maintain remaining processing times in a heap to get the minimum efficiently, and preprocess release dates by sorting to detect when new jobs become available.

However, notice that schedule from preemptive machine can’t be run on the nonpreemptive machine. Therefore, we need to make a strict order for the execution. Therefore, we will reorder the index of works to the order of $C_i^P$ which $C_i^P$ is the termination time of preemptive machine. Now after this reordering, we will just do work in the order of $1,2,\cdots,n$.

After this reordering, it will result in $2$-approximation algorithm. The proof is like follow.

Let $C_i^N$ is the termination time of nonpreemptive machine. Since, schedule of nonpreemptive machine can be run on preemptive machine, $\sum\limits_{i=1}^n C_i^P \le \operatorname{OPT}$ such that $\operatorname{OPT}$ is an optimal solution. Now, there is two simple facts that $C^P_i \ge \max\limits_{k=1,\cdots,i}r_k$ and $C^P_i \ge \sum\limits_{k=1}^i p_k$ because it is reordered in the order of $C^P_i$. Notice that if work $j$ is done then every work $i < j$ should be done from the definition. Now, let’s think about work $i$ in the nonpreemptive machine. Then we can know that $i$ should finished at most $\max\limits_{k=1,2,\cdots,i}r_k$ $+$ $\sum\limits_{k=1}^i p_k$. The reason is that work $i$ can only be idle because work $1, 2, \cdots, i - 1$ are still working.

Therefore, $C^N_i$ $\le$ $\max\limits_{k=1,2,\cdots,i}r_k$ $+$ $\sum\limits_{k=1}^i p_k$ $\le$ $2C^P_i$. Which means $\sum\limits_{i=1}^n C^N_i$ $\le$ $2\sum\limits_{i=1}^n C^P_i$ $\le$ $2\operatorname{OPT}$.

Weighted Version

The weighted version is similar to the unweighted version.

There are $n$ jobs with release dates $r_1, \ldots, r_n$, processing times $p_1, \ldots, p_n$, and weights $w_1, \ldots, w_n \ge 0$. Let $C_1, \ldots, C_n$ be the completion times in the chosen order. The goal is to find a schedule that minimizes $\sum\limits_{i=1}^n w_i C_i$. The machine is non-preemptive.

Unlike the unweighted version, the weighted variant cannot be solved with a preemptive machine in polynomial time unless P $=$ NP. Therefore, we will use a relaxation with linear programming in it. Relaxation of this problem is like below.

Minimize $\sum\limits_{i=1}^n w_i C_i$ such that $C_i \ge r_i + p_i$ $\forall i \in N$ and $\sum\limits_{i \in S} p_iC_i \ge \frac{1}{2}\rho(S)^2$ $\forall S \subset N$ which $N = \{1, 2, \cdots, n$}. Objective function is trivial to be “Minimize $\sum\limits_{i=1}^n w_i C_i$” and first constraint is trivial either that “$C_i \ge r_i + p_i$ $\forall i \in N$” from the meaning. How about the second constraint? If you think about any $S \subset N$, each works in $S$ will be ordered in an optimal solution. Therefore, we can imagine that $S^\star$ is a ordered list in the order of an optimal solution from $S$. Then,

$\sum\limits_{i \in S} p_iC_i$
$=$ $\sum\limits_{i \in S^\star} p_iC_i$
$\ge$ $\sum\limits_{i,j \in S^\star, j \le i} p_ip_j$
$=$ $\frac{1}{2}\sum\limits_{i,j \in S^\star, j \le i} p_ip_j$ + $\frac{1}{2}\sum\limits_{i,j \in S^\star, j \le i} p_ip_j$
$=$ $\frac{1}{2}\sum\limits_{i,j \in S^\star, j \le i} p_ip_j$ + $\frac{1}{2}\sum\limits_{j,i \in S^\star, i \le j} p_jp_i$
$=$ $\frac{1}{2}\sum\limits_{i,j \in S^\star, j \le i} p_ip_j$ + $\frac{1}{2}\sum\limits_{j,i \in S^\star, i < j} p_jp_i$ $+$ $\frac{1}{2}\sum\limits_{i \in S^\star} p_ip_i$
$=$ $\frac{1}{2}\sum\limits_{i,j \in S^\star,} p_ip_j$ + $\frac{1}{2}\sum\limits_{i \in S^\star} p_i^2$
$=$ $\frac{1}{2}(\sum\limits_{i \in S^\star} p_i)^2$ + $\frac{1}{2}\sum\limits_{i \in S^\star} p_i^2$
$\ge$ $\frac{1}{2}(\sum\limits_{i \in S^\star} p_i)^2$.

Now let $\rho(S) = \sum\limits_{i \in S} p_i$ then constraint holds.

Now, if we can solve that linear problem in polynomial time. We can make a strict order by termination time $C_i$ like unweighted version did. Then that solution should be in $3\operatorname{OPT}$.

Proof is like follow. Let’s define $C^L_i$ as a termination time founded by linear programming. However we can sort that works in the order of $C^L_i$ so let’s assume so. Which means $C^L_1$ $\le$ $C^L_2$ $\le$ $C^L_3$ $\le$ $\cdots$ $\le$ $C^L_n$. Let’s define one more solution $C^R_i$ as a termination time of nonpreemptive machine’s work $i$ with processing order in $C^L_i$. Then, we can get 2 facts like unweighted version did and we can know other two facts below With these facts.

$\sum\limits_{i = 1}^nC^L_i \le \operatorname{OPT}$
$C^R_i$ $\le$ $\max\limits_{k=1,2,\cdots,i}r_k$ $+$ $\sum\limits_{k=1}^i p_k$
$\max\limits_{k=1,2,\cdots,i}r_k \le C^L_i$
$\frac{1}{2}\sum\limits_{k=1}^i p_k = \frac{1}{2}\rho(\{1,2,\cdots,i\}) \le C^L_i$

Third fact is a trivial because $C^L_i$ is ordered in $i$ and this means $C^L_i \ge C^L_j$ $\forall 1 \le j \le i$ and from the constraint $C^L_i \ge C^L_j \ge r_j$ $\forall 1 \le j \le i$. Similarly, the fourth fact can be proven as follows. From the constaint, $\frac{1}{2}\rho(\{1,2,\cdots,i\})^2$ $\le$ $\sum\limits_{j = 1}^{i} p_jC^L_j$.
Then, $\frac{1}{2}\rho(\{1,2,\cdots,i\})^2$ $\le$ $\sum\limits_{j = 1}^{i} p_jC^L_j$ $\le$ $\sum\limits_{j = 1}^{i} p_jC^L_i$ $=$ $C^L_i\sum\limits_{j = 1}^{i} p_j$ $=$ $C^L_i\rho(\{1,2,\cdots,i\})$.
As a result, $\frac{1}{2}\rho(\{1,2,\cdots,i\})$ $\le$ $C^L_i$ by dividing $\rho(\{1,2,\cdots,i\})$ for each side of $\frac{1}{2}\rho(\{1,2,\cdots,i\})^2$ $\le$ $C^L_i\rho(\{1,2,\cdots,i\})$.
Therefore, fact 3, 4 has been proven.
Which means $C^R_i$ $\le$ $\max\limits_{k=1,2,\cdots,i}r_k$ $+$ $\sum\limits_{k=1}^i p_k$ $\le$ $C^L_i + 2C^L_i$ $=$ $3C^L_i$.
Therefore, $\sum\limits_{i=1}^n C^R_i$ $\le$ $\sum\limits_{i=1}^n 3C^L_i$ $\le$ $3\operatorname{OPT}$

We have not yet discussed the running time. The number of constraints appears exponential, but the ellipsoid method solves LP in polynomial time regardless of the number of constraints.

To solve this problem with ellipsoid method, we need a seperation oracle that runs in the polynomial time and checks whether a solution is a feasible or not. If we have a seperation oracle, ellipsoid method can solve the linear programming in $O(poly(log u, n))$ which $u$ is bit in use for saving the data.

However, we need a separation oracle that checks feasibility efficiently, since the number of constraints is exponential. Therefore, we claim below.

“Let’s define $S_1 = \{1$}, $S_2 = \{1, 2$}, $\cdots$, $S_n = \{1,2,\cdots,n$}. If every variable satisfies constraints for all $S_i$s then that solution statisfies all constraints.”

Before proving this, we will check brief 2 facts below. Notice that S is an arbitrary set in $N$, $a$ is in $S$ and $b$ isn’t in S.

$\rho(S)^2 - \rho(S - \{a\})^2$ $=$ $2p_a\rho(S - \{a\}) + p_a^2$ $=$ $p_a(2\rho(S - \{a\}) + p_a)$
$\rho(S + \{b\})^2$ - $\rho(S)^2$ $=$ $2p_b\rho(S) + p_b^2$ $=$ $p_b(2\rho(S) + p_b)$

Proof is simple that $\rho(S)^2$ $=$ $\rho(S- \{a\} + \{a\})^2$ $=$ $(\rho(S- \{a\}) + p_a)^2$ $=$ $\rho(S - \{a\})^2 + 2p_a\rho(S - \{a\}) + p_a^2$ and simillar for fact 2.
Now, let’s think about $f(S) = \sum\limits_{i \in S}p_iC_i - \frac{1}{2}\rho(S)$.
Then with the fact above, $f(S - \{a\}) - f(S)$ $=$ $-p_aC_a + \frac{1}{2}(\rho(S)^2 - \rho(S - \{a\})^2)$ $=$ $-p_aC_a + \frac{1}{2}p_a(2\rho(S - \{a\}) + p_a)$ $=$ $p_a(-C_a + \rho(S - \{a\}) + \frac{1}{2}p_a)$. Simillary $f(S + \{b\}) - f(S)$ $=$ $p_bC_b -\frac{1}{2}(\rho(S + \{b\}) - \rho(S))$ $=$ $p_bC_b - \frac{1}{2}p_b(2\rho(S) + p_b)$ $=$ $p_b(C_b - \rho(S) - \frac{1}{2}p_b)$.

As a result, removing $a$ from $S$ will decrease $f(S)$ if $C_a > \rho(S - \{a\}) + \frac{1}{2}p_a$ and adding $b$ to $S$ will decrease $f(S)$ if $C_b < \rho(S) + \frac{1}{2}p_b$.

Now let’s assume that every variable satisfies for all $S_i$s but there is some constraint that doesn’t matches. Which means $\sum\limits_{j \in X}p_jC_j < \frac{1}{2}\rho(X)^2$. Notice that $f(X) < 0$. Now, let’s remove the biggest $j$ in $X$ if removing $j$ decreases $f(X)$. Let’s define $S_h$ as the termination of such an operation. Then, it will stop when $C_h \le \rho(S_h - \{h\}) + \frac{1}{2}p_h$ for the biggest $j = h$ in $X$. However we can add any $1 \le j < h$ to decrease $f(S_h)$ because $C_j$ $\le$ $C_h$ $\le$ $\rho(S_h - \{h\}) + \frac{1}{2}p_h$ $<$ $\rho(S_h - \{h\}) + p_h$ $=$ $\rho(S_h)$ $<$ $\rho(S_h) + \frac{1}{2}p_j$. If we define $S_e=\{1,2,\cdots,h$} then, $f(S_e) \le f(S_h) \le f(X) < 0$. However, it can’t be true because $f(S_e) = \sum\limits_{i = 1}^e p_iC_i - \frac{1}{2}\rho(S_e) \ge 0$ from the algorithm. As a result, we don’t need to see all the contraint but only for $S = \{1,2,\cdots,i$} is enough.