[Math] Finding connected components in a graph using BFS

algorithmsgraph theory

I'd like to know how do I change the known BFS algorithm in order to find all of the connected components of a given graph. The original algorithm stops whenever we've colored an entire component in black, but how do I change it in order for it to run through all of the components? And how do I distinguish between one component to the other?

Best Answer

Use an integer to keep track of the "colors" that identify each component, as @Joffan mentioned. Start BFS at a vertex $v$. When it finishes, all vertices that are reachable from $v$ are colored (i.e., labeled with a number). Loop through all vertices which are still unlabeled and call BFS on those unlabeled vertices to find other components.

Below is some pseudo-code which initializes all vertices with an unexplored label (an integer 0). It keeps a counter, $componentID$, which vertices are labeled with as they are explored. When a connected component is finished being explored (meaning that the standard BFS has finished), the counter increments. BFS is only called on vertices which belong to a component that has not been explored yet.

// input: graph G
// output: labeling of edges and partition of the vertices of G
LabelAllConnectedComponents(G):
    // initialize all vertices and edges as unexplored (label is 0)
    for all u ∈ G.vertices()
        setLabel(u, UNEXPLORED)
    for all e ∈ G.edges()
        setLabel(e, UNEXPLORED)

    // call BFS on every unlabeled vertex, which results in
    // calling BFS once for each connected component
    componentID = 1
    for all v ∈ G.vertices()
        if getLabel(v) == 0:
            BFS(G, v, componentID++)

// standard breadth-first-search algorithm that works on one component
BFS(G, s, componentID):
    L[0] = new empty sequence
    insert vertex s at the end of L[0]
    setLabel(s, componentID)
    i = 0
    while L[i] is not empty:
        L[i+1] = new empty sequence
        for all v ∈ L[i].elements()
            for all e ∈ G.incidentEdges(v)
                if getLabel(e) == UNEXPLORED
                    w ← opposite(v,e)
                if getLabel(w) == UNEXPLORED
                    setLabel(e, DISCOVERY)
                    setLabel(w, componentID)
                    L[i+1].insertLast(w)
                else
                    setLabel(e, CROSS)
        i = i+1

The total running time is $O(|V| + |E|)$ since each edge and vertex is labeled exactly twice - once to initialize and again when it's visited.

Related Solutions

[Math] How to use BFS or DFS to determine the connectivity in a non-connected graph

Your solution looks good to me!

I have one suggestion if you consider directed graphs. Consider the graph:

A -> B -> C
D -> F

Let's say your algorithm starts arbitrarily at node B. It will traverse the graph through B and C and mark them as connected components - but it won't catch A!

A simple way to solve this problem is to allow the DFS to traverse the graph going both forwards and backwards, as though the graph was non-directed.

This is another way to solve it: Your algorithm produces sets of connected components. Let's say, as above, that the algorithm starts at B and produces the set {B, C}. Then, let's say that the algorithm arbitrarily selects A as its next starting point. When it traverses the graph to B, it should union the new set of connected components {A} with the already-existing set {B, C} instead of simply skipping node B.

In general, if you encounter a node that's already part of a group of connected components during a DFS, mark the set of connected components that the node is in and then skip the node. Then, when you've finished traversing as many nodes as possible without retracing paths found by previous DFS's, union all of the marked sets and the set produced by the most recent DFS into one set of connected components and discard the unioned sets.

[Math] Largest and least amount of connected components of graph with conditions

HINT: I don’t understand your picture: a cycle of length $10$ requires $10$ vertices and $10$ edges, so there are $20$ vertices and $20$ edges remaining.

Find a way to attach the remaining $20$ vertices to the $10$-cycle using exactly $20$ edges, thereby producing a connected graph; this shows that the minimum possible number of components is $1$.
Notice that in any cycle the number of edges is equal to the number of vertices. To maximize the number of components, add as many cycles as possible to the original $10$-cycle. You’ll want to make them as small as you can.

Best Answer

Related Solutions

[Math] How to use BFS or DFS to determine the connectivity in a non-connected graph

[Math] Largest and least amount of connected components of graph with conditions

Related Question