Comp 2673, Spring 2003 - Graph Theory, part IV

Graph Theory, Part V

Dijkstra's Shortest Path Algorithm

It's an important problem of graph theory to find the shortest path between two vertices. In a weighted graph, we're looking for the path for which the sum of the weights of the edges along the path is as small as possible. Because weights often represent distance, we call this the shortest path. In a graph or directed graph, we're looking for the path with the fewest edges. This problem is important in communications, where information must be sent between nodes in the fastest possible way, or in transportation, where we need to find the most efficient route to transport goods between different locations. This problem is an example of an optimization problem - we want to find the minimum path length. Optimization problems in the discrete settings of computer science tend to be much more difficult than in the continuous settings you've seen in calculus. For many simple optimization problems (such as the travelling salesman problem) there are no good solutions. However, for the shortest-path problem, there turns out to be a good solution - an algorithm called Dijkstra's algorithm.

Say that v₀, v₁, ..., v_n are the vertices in a connected graph. Our goal is to find the shortest path from v₀ to v_i. We'll call the length of this path the distance from v₀ to v_i and write it d(v₀, v_i). As part of this process, we'll end up finding the shortest path from v₀ to many of the other nodes as well (and thus the distance from v₀ to the other nodes).

The algorithm is based on this observation: If the path v₀ → w₁ → w₂ → ... → w_n → v_i is the shortest path from ₀ to v_i, then the path v₀ → w₁ → w₂ → ... → w_n is the shortest path from v₀ to w_n. We will use this idea to build shortest paths starting at v₀ and ending at various vertices. We'll find them in order of each vertex's distance to v₀. First, the distance from v₀ to itself is 0. So v₀ is handled - we'll put it into the list of vertices for which we've found the shortest path. Now find the closest vertex reachable from v₀ with one edge. This means checking for each vertex v_i the following value:

    weight(v₀, v_i)

and choosing the smallest one. Call that vertex w₁. The distance from v₀ to w₁ (written d(v₀, w₁)) is just the weight(v₀, w₁). We're now done with w₁. We move on now to the next-closest vertex. It must pass through w₁, or through no intervening nodes at all. So for each vertex v_i, we'll just check the values

    weight(v₀, v_i)
    d(v₀, w₁) + weight(w₁, v_i)

and pick the smallest. Call this vertex w₂, and we have calculated d(v_i, w₂). We're now done with the vertex w₂. The next step is to find the next closest vertex. The shortest path to the next-closest vertex has its penultimate (second-to-last) vertex in the list {v₀, w₁, w₂}, the list of done vertices. So we just just check, for each vertex v_i, the following values
weight(v₀, v_i)
d(v₀, w₁) + weight(w₁, v_i)
d(v₀, w₂) + weight(w₂, v_i)
and pick the smallest, which we call w₃. Continue this process of choosing the next-closest vertex until you've reached the vertex you're looking for, and you're done.

In order to implement this algorithm, we'll have to store some information as we go. There are many ways to organize the information you collect as you proceed with your algorithm that will make it more efficient. Here is one way. At each step, for each vertex, store both your current best distance to that node and the previous node you should pass through to get to it. At each step, also store your list of vertices that you're done with (those for which you've already found the shortest path). For each vertex, check to see if passing through the latest "done" vertex shortens the path. In other words, if w_latest is the latest "done" vertex, then for each vertex v_i, check d(v₀, w_latest) + weight(w_latest, v_ii, then update the distance and previous node for v_i. Finally, go through the list of vertices that have not been handled yet, and choose the closest one. Add it to the list of "done" vertices.

Here's an example. We'll find the shortest path from v₀ to all other vertices in this graph.

Step # v₀ v₁ v₂ v₃ v₄ v₅ v₆ v₇ Next closest vertex:

0 (0, v₀) (∞, -) (∞, -) (∞, -) (∞, -) (∞, -) (∞, -) (∞, -) v₀

1 (∞, -) (13, v₀) (∞, -) (16, v₀) (8, v₀) (∞, -) (∞, -) v₅

2 (18, v₅) (13, v₀) (25, v₅) (15, v₅) (∞, -) (∞, -) v₂

3 (18, v₅) (25, v₅) (15, v₅) (∞, -) (∞, -) v₄

4 (18, v₅) (20, v₄) (∞, -) (∞, -) v₁

5 (20, v₄) (∞, -) (∞, -) v₃

6 (∞, -) (∞, -)

We stop computing when all the remaining vertices have a distance of ∞. None of those vertices are reachable from v₀. The completed table gives the shortest path from v₀ to every vertex (the distance written in red). To find the shortest path itself, we trace the paths backwards from the previous nodes (in red in the table). Here are the final paths and distances from v₀ to every other vertex (actually, note that these paths are all in reverse order):

Step #	v₀	v₁	v₂	v₃	v₄	v₅	v₆	v₇	Next closest vertex:
0	(0, v₀)	(∞, -)	(∞, -)	(∞, -)	(∞, -)	(∞, -)	(∞, -)	(∞, -)	v₀
1		(∞, -)	(13, v₀)	(∞, -)	(16, v₀)	(8, v₀)	(∞, -)	(∞, -)	v₅
2		(18, v₅)	(13, v₀)	(25, v₅)	(15, v₅)		(∞, -)	(∞, -)	v₂
3		(18, v₅)		(25, v₅)	(15, v₅)		(∞, -)	(∞, -)	v₄
4		(18, v₅)		(20, v₄)			(∞, -)	(∞, -)	v₁
5				(20, v₄)			(∞, -)	(∞, -)	v₃
6							(∞, -)	(∞, -)

d(v₀, v₁) = 18, path v₁ → v₅ → v₀
d(v₀, v₂) = 13, path v₂ → v₀
d(v₀, v₃) = 20, path v₃ → v₄ → v₅ → v₀
d(v₀, v₄) = 15, path v₄ → v₅ → v₀
d(v₀, v₅) = 8, path v₅ → v₀
d(v₀, v₆) = ∞, path not available
d(v₀, v₇) = ∞, path not available