Finding Bridges Online¶
We are given an undirected graph. A bridge is an edge whose removal makes the graph disconnected (or, more precisely, increases the number of connected components). Our task is to find all the bridges in the given graph.
Informally this task can be put as follows: we have to find all the "important" roads on the given road map, i.e. such roads that the removal of any of them will lead to some cities being unreachable from others.
There is already the article Finding Bridges in
More rigorously the statement of the problem is as follows:
Initially the graph is empty and consists of
It is also possible to maintain a list of all bridges as well as explicitly support the 2-edge-connected components.
The algorithm described below works in
Algorithm¶
First let's define a
It is very easy to see, that the bridges partition the graph into 2-edge-connected components. If we compress each of those 2-edge-connected components into vertices and only leave the bridges as edges in the compressed graph, then we obtain an acyclic graph, i.e. a forest.
The algorithm described below maintains this forest explicitly as well as the 2-edge-connected components.
It is clear that initially, when the graph is empty, it contains
When adding the next edge
-
Both vertices
Thus, in this case the number of bridges does not change.
-
The vertices
Thus, in this case the number of bridges increases by one.
-
The vertices
Thus, in this case the number of bridges decreases by two or more.
Consequently the whole task is reduced to the effective implementation of all these operations over the forest of 2-edge-connected components.
Data Structures for storing the forest¶
The only data structure that we need is Disjoint Set Union.
In fact we will make two copies of this structure:
one will be to maintain the connected components, the other to maintain the 2-edge-connected components.
And in addition we store the structure of the trees in the forest of 2-edge-connected components via pointers:
Each 2-edge-connected component will store the index par[]
of its ancestor in the tree.
We will now consistently disassemble every operation that we need to learn to implement:
-
Check whether the two vertices lie in the same connected / 2-edge-connected component. It is done with the usual DSU algorithm, we just find and compare the representatives of the DSUs.
-
Joining two trees for some edge
However the question about the effectiveness of the re-rooting operation arises: in order to re-root the tree with the root
par[]
in the opposite direction, and also change the references to the ancestors in the DSU that is responsible for the connected components.Thus, the cost of re-rooting is
We now apply a standard technique: we re-root the tree that contains fewer vertices. Then it is intuitively clear that the worst case is when two trees of approximately equal sizes are combined, but then the result is a tree of twice the size. This does not allow this situation to happen many times.
In general the total cost can be written in the form of a recurrence:
Thus, the total time spent on all re-rooting operations will be
We will have to maintain the size of each connected component, but the data structure DSU makes this possible without difficulty.
-
Searching for the cycle formed by adding a new edge
After finding the cycle we compress all vertices of the detected cycle into one vertex. This means that we already have a complexity proportional to the cycle length, which means that we also can use any LCA algorithm proportional to the length, and don't have to use any fast one.
Since all information about the structure of the tree is available is the ancestor array
par[]
, the only reasonable LCA algorithm is the following: mark the verticespar[a]
andpar[b]
and mark them, then advance to their ancestors and so on, until we reach an already marked vertex. This vertex is the LCA that we are looking for, and we can find the vertices on the cycle by traversing the path fromIt is obvious that the complexity of this algorithm is proportional to the length of the desired cycle.
-
Compression of the cycle by adding a new edge
We need to create a new 2-edge-connected component, which will consist of all vertices of the detected cycle (also the detected cycle itself could consist of some 2-edge-connected components, but this does not change anything). In addition it is necessary to compress them in such a way that the structure of the tree is not disturbed, and all pointers
par[]
and two DSUs are still correct.The easiest way to achieve this is to compress all the vertices of the cycle to their LCA. In fact the LCA is the highest of the vertices, i.e. its ancestor pointer
par[]
remains unchanged. For all the other vertices of the loop the ancestors do not need to be updated, since these vertices simply cease to exists. But in the DSU of the 2-edge-connected components all these vertices will simply point to the LCA.We will implement the DSU of the 2-edge-connected components without the Union by rank optimization, therefore we will get the complexity
par[]
accordingly.
Implementation¶
Here is the final implementation of the whole algorithm.
As mentioned before, for the sake of simplicity the DSU of the 2-edge-connected components is written without Union by rank, therefore the resulting complexity will be
Also in this implementation the bridges themselves are not stored, only their count bridges
.
However it will not be difficult to create a set
of all bridges.
Initially you call the function init()
, which initializes the two DSUs (creating a separate set for each vertex, and setting the size equal to one), and sets the ancestors par
.
The main function is add_edge(a, b)
, which processes and adds a new edge.
vector<int> par, dsu_2ecc, dsu_cc, dsu_cc_size;
int bridges;
int lca_iteration;
vector<int> last_visit;
void init(int n) {
par.resize(n);
dsu_2ecc.resize(n);
dsu_cc.resize(n);
dsu_cc_size.resize(n);
lca_iteration = 0;
last_visit.assign(n, 0);
for (int i=0; i<n; ++i) {
dsu_2ecc[i] = i;
dsu_cc[i] = i;
dsu_cc_size[i] = 1;
par[i] = -1;
}
bridges = 0;
}
int find_2ecc(int v) {
if (v == -1)
return -1;
return dsu_2ecc[v] == v ? v : dsu_2ecc[v] = find_2ecc(dsu_2ecc[v]);
}
int find_cc(int v) {
v = find_2ecc(v);
return dsu_cc[v] == v ? v : dsu_cc[v] = find_cc(dsu_cc[v]);
}
void make_root(int v) {
int root = v;
int child = -1;
while (v != -1) {
int p = find_2ecc(par[v]);
par[v] = child;
dsu_cc[v] = root;
child = v;
v = p;
}
dsu_cc_size[root] = dsu_cc_size[child];
}
void merge_path (int a, int b) {
++lca_iteration;
vector<int> path_a, path_b;
int lca = -1;
while (lca == -1) {
if (a != -1) {
a = find_2ecc(a);
path_a.push_back(a);
if (last_visit[a] == lca_iteration){
lca = a;
break;
}
last_visit[a] = lca_iteration;
a = par[a];
}
if (b != -1) {
b = find_2ecc(b);
path_b.push_back(b);
if (last_visit[b] == lca_iteration){
lca = b;
break;
}
last_visit[b] = lca_iteration;
b = par[b];
}
}
for (int v : path_a) {
dsu_2ecc[v] = lca;
if (v == lca)
break;
--bridges;
}
for (int v : path_b) {
dsu_2ecc[v] = lca;
if (v == lca)
break;
--bridges;
}
}
void add_edge(int a, int b) {
a = find_2ecc(a);
b = find_2ecc(b);
if (a == b)
return;
int ca = find_cc(a);
int cb = find_cc(b);
if (ca != cb) {
++bridges;
if (dsu_cc_size[ca] > dsu_cc_size[cb]) {
swap(a, b);
swap(ca, cb);
}
make_root(a);
par[a] = dsu_cc[a] = b;
dsu_cc_size[cb] += dsu_cc_size[a];
} else {
merge_path(a, b);
}
}
The DSU for the 2-edge-connected components is stored in the vector dsu_2ecc
, and the function returning the representative is find_2ecc(v)
.
This function is used many times in the rest of the code, since after the compression of several vertices into one all these vertices cease to exist, and instead only the leader has the correct ancestor par
in the forest of 2-edge-connected components.
The DSU for the connected components is stored in the vector dsu_cc
, and there is also an additional vector dsu_cc_size
to store the component sizes.
The function find_cc(v)
returns the leader of the connectivity component (which is actually the root of the tree).
The re-rooting of a tree make_root(v)
works as described above:
if traverses from the vertex par
in the opposite direction.
The link to the representative of the connected component dsu_cc
is also updated, so that it points to the new root vertex.
After re-rooting we have to assign the new root the correct size of the connected component.
Also we have to be careful that we call find_2ecc()
to get the representatives of the 2-edge-connected component, rather than some other vertex that have already been compressed.
The cycle finding and compression function merge_path(a, b)
is also implemented as described above.
It searches for the LCA of path_a
and path_b
, and we use them to walk through them a second time up to the LCA, thereby obtaining all vertices of the cycle.
All the vertices of the cycle get compressed by attaching them to the LCA, hence the average complexity is
Finally the query function add_edge(a, b)
determines the connected components in which the vertices merge_path(a, b)
is called, which will detect the cycle and compress it into one 2-edge-connected component.