Hi Horace. Great post!
The link you posted at the end:
-
Parallelize it across arbitrary number of devices using arbitrary horizontal/vertical parallelism
Links to the FLOP counter post, but I can’t find anything on parallelism in there. Is this link correct?
Thanks.