Computer scienceAlgorithms and Data StructuresAlgorithmsGraph algorithmsShortest path algorithms

Dijkstra's algorithm

13 minutes read

Assume that a weighted graph below represents a map, where the nodes are cities, the edges are roads and the weight of each edge is the distance between two cities:

Suppose we are in the city $A$ and want to take a trip to the city $G$ . To get from $A$ to $G$ as fast as possible, we need to know the shortest path between the cities, that is, the path with the minimum total weight. How can this path be found?

One of the possible approaches is to use Dijkstra's algorithm. It allows us to find the shortest paths from a given node to all other nodes of a graph. Let's consider the algorithm in more detail.

Algorithm description

First, let's agree that we will use the following notations further:

source — the node from which we want to find the shortest paths to all other nodes;
$weight(u, v)$ — the weight of the edge that connects the nodes $u$ and $v$ ;
$dist(v)$ — the current distance from the source to the node $v$ .

Now, Dijkstra's algorithm can be formulated as follows:

Set the current distance to the source to $0$ , for all other nodes assign it to $\infty$ . Mark all nodes as unprocessed.
Find an unprocessed node $u$ with the smallest $dist(u)$ .
For all unprocessed neighbors $v$ of the node $u$ , check whether the distance from $u$ to $v$ is less than the current distance to $v$ . If that is the case, that is $dist(u) + weight(u, v) < dist(v)$ , update the current distance to $v$ to a smaller value.
When all the neighbors of $u$ are considered, mark $u$ as processed.
If there are no unprocessed nodes, the algorithm is finished. Otherwise, go back to the 2nd step.

After all the nodes are processed, each of the current distances $dist(u)$ is the shortest distance from the source to the node $u$ .

Pseudocode

After understanding the core principles of Dijkstra's algorithm, let's explore how the choice of data structure for storing current distances can influence its efficiency. The following pseudocode presents an implementation of Dijkstra's algorithm using a priority queue (binary heap), which as we'll soon explain, provides a more efficient solution for sparse graphs (than the array-based storage for current distances).

// Function to perform Dijkstra's algorithm using a priority queue (binary heap)
function DijkstraPriorityQueue(graph, source):
    // Initialize distances with infinity for all nodes except the source
    distances = {node: INFINITY for node in graph}
    distances = 0

    // So essentially, `distances` stores the following distance values: 
    // distances = {'A': 0, 'B': INFINITY, 'C': INFINITY, ...}
    // Here, node `A` is `source`. 

    // Initialize priority queue with (distance, node) pairs
    priorityQueue = PriorityQueue()
    priorityQueue.enqueue((0, source))

    // Dijkstra's algorithm with priority queue
    while not priorityQueue.isEmpty():
        // Dequeue the node with the smallest current distance
        currentDistance, current = priorityQueue.dequeue()

        // Process the current node
        processNode(current)

        // Update distances to unprocessed neighbors
        for neighbor, weight in graph[current]:
            newDistance = distances[current] + weight
            if newDistance < distances[neighbor]:
                distances[neighbor] = newDistance
                priorityQueue.enqueue((newDistance, neighbor))

// Function to process a node during Dijkstra's algorithm (customize as needed)
function processNode(node):
    // Print or perform any desired operation on the node
    print("Processing node:", node)

While the priority queue-based implementation is efficient for sparse graphs, a different approach using an array can be more suitable for dense graphs. The following pseudocode demonstrates an array-based implementation of Dijkstra's algorithm.

// Function to perform Dijkstra's algorithm on a weighted graph
function Dijkstra(graph, source):
    // Initialize distances with infinity for all nodes except the source
    distances = {node: INFINITY for node in graph}
    distances = 0

    // So essentially, `distances` stores the following distance values: 
    // distances = {'A': 0, 'B': INFINITY, 'C': INFINITY, ...}
    // Here, node `A` is `source`. 

    // Mark all nodes as unprocessed
    unprocessedNodes = set(graph.keys())

    // Dijkstra's algorithm
    while unprocessedNodes is not empty:
        // Find the node with the smallest current distance
        current = findSmallestDistanceNode(unprocessedNodes, distances)

        // Process the current node and mark it as processed
        processNode(current)
        unprocessedNodes.remove(current)

        // Update distances to unprocessed neighbors
        for neighbor, weight in graph[current]:
            if neighbor in unprocessedNodes:
                newDistance = distances[current] + weight
                if newDistance < distances[neighbor]:
                    distances[neighbor] = newDistance

// Function to find the node with the smallest current distance
function findSmallestDistanceNode(nodes, distances):
    smallestNode = null
    smallestDistance = INFINITY

    for node in nodes:
        if distances[node] < smallestDistance:
            smallestNode = node
            smallestDistance = distances[node]

    return smallestNode

// Function to process a node during Dijkstra's algorithm (customize as needed)
function processNode(node):
    // Print or perform any desired operation on the node
    print("Processing node:", node)

Example

Let's apply the algorithm to find the shortest paths from the node $A$ to all other nodes of the graph above. On the figures below, the node that is processed at the current step and the edges incident to the unprocessed neighbors of this node are shown in green. The nodes that are already processed and the edges that correspond to the current shortest paths are shown in blue. The current distances are shown in bold near the nodes.

At the first step, we consider the source node $A$ , since it has the smallest current distance. The node has two neighbors: $B$ and $C$ . The distance from $A$ to $B$ is $dist(A) + weight(A, B) = 0 + 4 = 4$ . Since $4$ is less than $\infty$ , the distance to $B$ is set to $4$ . Similarly, the distance to $C$ is set to $7$ . Then, $A$ is marked as processed.

At the next step, the node $B$ has the smallest distance. The distance from $B$ to $C$ is $dist(B) + weight(B, C) = 4 + 1 = 5$ , which is less than $7$ . So, the distance to $C$ is set to $5$ . Following the same rule, the distances to $D$ and $E$ are set to $7$ and $14$ respectively and then $B$ is marked as processed.

Finding the shortest paths, part 1

Next, the node $C$ has the smallest distance. The distance from $C$ to $D$ is $dist(C) + weight(C, D) = 5 + 8 = 13$ , which is greater than $7$ . So, the distance to $D$ does not need to be updated. The distance from $C$ to $F$ is $10$ , which is less than $\infty$ . So, it is set to $10$ .

At the 4th step, the node $D$ has the smallest distance. The distances from $D$ to $E$ and $F$ are less than the previous distances for these nodes, so they are set to $12$ and $8$ , respectively.

Finding the shortest paths, part 2

Continuing the same process for the remaining nodes, we finally get the graph shown in the figure $8$ .

The resulting tree (shown in blue) is called the shortest path tree. We can use it to reconstruct the shortest path from the source to any other node of the graph. To achieve this, each node should know the previous node on the shortest path from the source to it. The algorithm can be slightly modified to calculate this information.

Once we have obtained the resulting tree, known as the shortest path tree, we can leverage it to reconstruct the shortest path from the source to any other node in the graph. This involves using an additional array, $from$ , which stores the previous node on the shortest path from the source to each node. The algorithm can be slightly modified to calculate this information. The following pseudocode demonstrates how to reconstruct the shortest path using the $from$ array.

// Function to reconstruct the shortest path from the source to a given node
function reconstructShortestPath(from, source, target):
    path = []
    current = target

    // Backtrack from the target to the source using the 'from' array
    while current != source:
        path.add(current)
        current = from[current]

    // Add the source node to complete the path
    path.add(source)

    // Reverse the path to get it in the correct order
    return reverse(path)

// Function to reverse a list
function reverse(lst):
    reversedList = []
    for i from length(lst) - 1 to 0:
        reversedList.add(lst[i])
    return reversedList

In the beginning, we create another array $from$ and fill it with default values. Every time we stay at some vertex $u$ and update the distance to its neighbor $v$ , we should also update $from(v)=u$ , since the current shortest path now comes to $v$ from $u$ .

To get the shortest path from the source to a node $v$ , we start at $v$ , then move to $from(v)$ , then to $from(from(v))$ , and so on, until we get to the source. This gives us the shortest path in the reversed order. For example, the shortest path from $A$ to $G$ is $A \to B \to D \to F \to E \to G.$ The total weight of this path is $4 + 3 + 1 + 3 + 4 = 15$ , which corresponds to $dist(G)$ .

Note that although the graph above is undirected, the same algorithm works for directed graphs as well. Another important detail of Dijkstra's algorithm is that it does not work if a graph contains negative weight edges. The example below illustrates the statement:

A graph containing edges with negative weight

We can see that the final distance from $A$ to $B$ is $2$ , although the path $A \to C \to B$ has a weight of $5 + (-4) = 1$ . The problem is that when the node $C$ was considered, the node $B$ had already been processed and the weight of $B$ could not be updated.

The last thing to mention is that there are a lot of resources that allow visualizing the steps of Dijkstra's algorithm. Those who are interested in a more interactive example of how the algorithm works may check out a visualization.

Complexity analysis

Consider a graph $G$ with $n$ nodes and $m$ edges. The running time of Dijkstra's algorithm for this graph depends on a data structure used for storing the current distances.

One of the possible implementations uses an array $A$ of size $n$ , where $A[i]$ stores the current distance to the node $i$ . In this case, finding the smallest distance requires $O(n)$ operations, since we need to scan the whole array. The update operation can be done in $O(1)$ . Since the total number of searches is $n$ and the total number of updates is no more than $m$ , the overall running time is $O(n^2 + m)$ .

Another approach is to store the distances in a priority queue. If the queue is implemented via a binary heap, searching and updating can be done in $O(\log n)$ . In this case, the running time is $n \cdot \log n + m \cdot \log n = O((n + m) \cdot \log n)$ .

So, which of the implementations is preferable? If $G$ is a dense graph (with lots of edges, $m \approx n^2$ ), then the priority queue based implementation results in $O(n^2 \cdot \log n)$ . In this case, the array-based implementation is a better choice. For sparse graphs, (with few edges, $m \approx n$ ), the priority queue based implementation is more appropriate since for such graphs the running time of the algorithm is $O(n \log n)$ .

111 learners liked this piece of theory. 10 didn't like it. What about you?

Report a typo

Dijkstra's algorithm

Algorithm description

Pseudocode

Example

Complexity analysis

Related topics