Books / Analysis of Algorithms / Chapter 16

Breadth-First Search

Now that we have reviewed the basic terminology associated with graphs, the first algorithm we will investigate is breadth-first search. This algorithm is used to find the shortest paths (by number of edges) to every reachable vertex from a given one.

Problem: Given a source vertex s, find the shortest paths (in terms of number of edges) to every reachable vertex from s.

Using breadth-first search algorithm, we will use will find all vertices reachable at a distance k before discovering reachable vertices at a distance k+1. Ultimately, the algorithm will produce a breadth-first tree with s as the root.

During the execution of the algorithm, vertices will be colored (denoted by u.color). The colors represent the vertex’s current state as follows

white - the vertex is undiscovered (i.e. currently no path has been found to the vertex)
gray - the vertex has been discovered and is on the frontier, i.e. there may be further vertices that can be discovered
black - the vertex has been discovered and has been completely searched

The algorithm also uses two additional fields for each vertex

u.π - predecessor vertex

u.d - distance when the vertex is first discovered (and is subsequently the shortest distance from the source)

We will employ a queue Q which will track which vertices are currently under discovery. Thus vertices that have not yet been placed in Q will be white, those that are in Q will be gray, and those that have been removed from Q will be black.

BFS Algorithm

The algorithm for breadth-first search is

BFS(G,s)
 for each vertex u ∈ G.V - {s}
    u.color == WHITE
    u.d = INF
    u.pi = NIL
 s.color = GRAY
 s.d = 0
 s.pi = NIL
 Q = ∅
 ENQUEUE(Q,s)
while Q ≠ ∅
   u = DEQUEUE(Q)
   for each v ∈ G.Adj[u]
      if v.color == WHITE
         v.color = GRAY
         v.d = u.d + 1
         v.pi = u
         ENQUEUE(Q,v)
   u.color = BLACK

Basically the algorithm performs the following operations:

Initialize Q with the source vertex s
Dequeue the head vertex u from Q and mark as black
Queue all white vertices adjacent to u marking them as gray, set their distance to u’s distance + 1, and set their π to u
Repeat 2-3 until Q = ∅

Analysis

Since no vertex is ever enqueued/dequeued more than once ⇒ O(V)

Each adjacency list is only scanned once (when the vertex is dequeued) with max size the total number of edges ⇒ O(E)

Initialization overhead ⇒ O(V)

Thus the total run time for BFS is

It can be proven that the algorithm produces the shortest paths (in terms of the minimum number of edges) to all reachable vertices from the source s. These paths can be represented by a breadth-first tree that is given by the predecessor subgraph