Union-Find algorithm for finding components of a Graph #764

deadshotsb · 2020-05-19T06:51:13Z

A DFS approach has been added for finding the number of components of a graph, but an approach using Union-Find algorithm is more than welcomed.

An Union-Find Approach using rank or path-compression can be a better solution

cclauss · 2020-05-19T09:58:56Z

We want a benchmark that proves that it is a better solution.

ayaankhan98 · 2020-05-19T10:24:18Z

i want to work on this.

coderanant · 2020-05-20T16:19:37Z

I was just wondering how "Union-Find Approach using rank or path-compression" could be a better approach in finding the number of connected components. It'll take O(n log n) while the dfs approach works in O(n).
Union-Find Structure (DSU) is rather generally used for the case where we are given several elements, each of which is a separate set. A DSU will have an operation to combine any two sets, and it will be able to tell in which set a specific element is and can create a set from a new element.
An implementation describing this use of DSU would be more helpful.

cclauss · 2020-05-20T18:01:40Z

Why write all this theoretical text?? Write up both algorithms and a benchmark instead.

coderanant · 2020-05-21T18:13:27Z

I have added Pull Request #773 regarding this issue.

deadshotsb · 2020-05-21T18:23:17Z

@coderanant DFS will take O(E) and O(E) = [O(V), O(V2)].

coderanant · 2020-05-21T19:31:27Z

@deadshotsb Sorry for not making it clear.
DFS will take O(E) = [O(V), O(V²)]
DSU will take O(E) = [O(V log(V)), O(V² log(V))]
I have however added approach with both Union by size of tree and Path Compression which might work very closer to DFS approach in most cases.

cclauss · 2020-05-21T20:35:53Z

Where is the benchmark?

coderanant · 2020-05-22T09:49:32Z

How am I supposed to write a benchmark?
Please pardon if this is very intuitive as I am kinda new to Open Source.

deadshotsb · 2020-05-23T09:19:24Z

@coderanant The best method to write a benchmark according to test the algorithms based on some randomly generated Graphs and showing a relation between time and the size of graph for the algorithms under consideration.

ayaankhan98 · 2020-05-24T12:17:30Z

what is the current status of this issue, as far as i know i had merged the PR related to this issue, what the status of benchmark @coderanant ?

testitem · 2020-05-29T11:38:59Z

I was just wondering how "Union-Find Approach using rank or path-compression" could be a better approach in finding the number of connected components. It'll take O(n log n) while the dfs approach works in O(n).
Union-Find Structure (DSU) is rather generally used for the case where we are given several elements, each of which is a separate set. A DSU will have an operation to combine any two sets, and it will be able to tell in which set a specific element is and can create a set from a new element.
An implementation describing this use of DSU would be more helpful.

Let's say you are given Q queries. Each query is to add an edge into a graph. After each query you are to return the amount of connected components in this graph.

DFS is O(NQ), while DSU is (Q a(N)).

kvedala · 2020-05-29T11:48:44Z

I was just wondering how "Union-Find Approach using rank or path-compression" could be a better approach in finding the number of connected components. It'll take O(n log n) while the dfs approach works in O(n).
Union-Find Structure (DSU) is rather generally used for the case where we are given several elements, each of which is a separate set. A DSU will have an operation to combine any two sets, and it will be able to tell in which set a specific element is and can create a set from a new element.
An implementation describing this use of DSU would be more helpful.

Let's say you are given Q queries. Each query is to add an edge into a graph. After each query you are to return the amount of connected components in this graph.

DFS is O(NQ), while DSU is (Q a(N)).

Since this repository is presenting multiple algorithms from an educational perspective, older inefficient algorithm implementations would also be welcome along with faster and more efficient implementations.

testitem · 2020-05-29T11:51:54Z

In that case I am confused as for why cclauss is obsessed with speed benchmarks.

cclauss · 2020-05-29T11:53:50Z

We want to be able to see the theoretical performance DFS is O(NQ), while DSU is (Q a(N)) as well as the actual performance 32 sec. vs. 97 sec.

As your username says, let's test the item. ;-)

coderanant · 2020-05-29T11:58:31Z

I was just wondering how "Union-Find Approach using rank or path-compression" could be a better approach in finding the number of connected components. It'll take O(n log n) while the dfs approach works in O(n).
Union-Find Structure (DSU) is rather generally used for the case where we are given several elements, each of which is a separate set. A DSU will have an operation to combine any two sets, and it will be able to tell in which set a specific element is and can create a set from a new element.
An implementation describing this use of DSU would be more helpful.

Let's say you are given Q queries. Each query is to add an edge into a graph. After each query you are to return the amount of connected components in this graph.

DFS is O(NQ), while DSU is (Q a(N)).

Thanks for the clarification.
I didn't consider this use-case.

Chillee · 2020-05-29T12:02:39Z

If the "point" of this repo is to present algorithms from an "educational" perspective, what use are benchmarks?

On a similar note, if the goal is to present algorithms from an "educational" perspective, the repo should do a better job of documenting/explaining each algorithm.

kvedala · 2020-05-29T12:30:34Z

If the "point" of this repo is to present algorithms from an "educational" perspective, what use are benchmarks?

On a similar note, if the goal is to present algorithms from an "educational" perspective, the repo should do a better job of documenting/explaining each algorithm.

Documentation is being updated. A major re-work in these regards can be found here https://kvedala.github.io/C-Plus-Plus. However, for this documentation to automatically stay up-to-date with latest commits, the code submitted must be properly structured and documented.

kvedala · 2020-05-29T12:49:36Z

If the "point" of this repo is to present algorithms from an "educational" perspective, what use are benchmarks?
On a similar note, if the goal is to present algorithms from an "educational" perspective, the repo should do a better job of documenting/explaining each algorithm.

Documentation is being updated. A major re-work in these regards can be found here https://kvedala.github.io/C-Plus-Plus. However, for this documentation to automatically stay up-to-date with latest commits, the code submitted must be properly structured and documented.

Also, any help towards this herculean effort would be greatly appreciated 😅

deadshotsb added the enhancement label May 19, 2020

deadshotsb assigned ayaankhan98 May 20, 2020

cclauss assigned coderanant May 20, 2020

May	JUN	Jul
	09
2019	2020	2021

TheAlgorithms / C-Plus-Plus

Union-Find algorithm for finding components of a Graph #764

Union-Find algorithm for finding components of a Graph #764

deadshotsb commented May 19, 2020

cclauss commented May 19, 2020

ayaankhan98 commented May 19, 2020

coderanant commented May 20, 2020

cclauss commented May 20, 2020

coderanant commented May 21, 2020

deadshotsb commented May 21, 2020

coderanant commented May 21, 2020

cclauss commented May 21, 2020

coderanant commented May 22, 2020

deadshotsb commented May 23, 2020

ayaankhan98 commented May 24, 2020

testitem commented May 29, 2020

kvedala commented May 29, 2020

testitem commented May 29, 2020

cclauss commented May 29, 2020 •

edited

coderanant commented May 29, 2020

Chillee commented May 29, 2020

kvedala commented May 29, 2020

kvedala commented May 29, 2020

TheAlgorithms / C-Plus-Plus

Join GitHub today

Union-Find algorithm for finding components of a Graph #764

Union-Find algorithm for finding components of a Graph #764

Comments

deadshotsb commented May 19, 2020

cclauss commented May 19, 2020

ayaankhan98 commented May 19, 2020

coderanant commented May 20, 2020

cclauss commented May 20, 2020

coderanant commented May 21, 2020

deadshotsb commented May 21, 2020

coderanant commented May 21, 2020

cclauss commented May 21, 2020

coderanant commented May 22, 2020

deadshotsb commented May 23, 2020

ayaankhan98 commented May 24, 2020

testitem commented May 29, 2020

kvedala commented May 29, 2020

testitem commented May 29, 2020

cclauss commented May 29, 2020 • edited

coderanant commented May 29, 2020

Chillee commented May 29, 2020

kvedala commented May 29, 2020

kvedala commented May 29, 2020

cclauss commented May 29, 2020 •

edited