I have to find the best algorithm from already known for parallel computing of connected components of graph.
Here is a brief outline of my data and computer architecture:
I have read about parallel algorithms for computing connected components of graphs. As I have noticed, some of them base on the classical BFS approach for the serialized case. To be honest, I got a bit lost in the number of these algorithms. Could anyone give me some advice, which algorithm would be the best for my purposes?
Ligra is either the state of the art or close to it for single-machine multicore implementations. It should be able to handle your graph no problem.
Connected Components at Scale via Local Contractions, by my colleagues Jakub Łącki, Vahab Mirrokni, and Michał Włodarczyk, is the state of the art (at least, that I know about) for MapReduce algorithms. We've used it on graphs a thousand times bigger than yours.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With