What is Nostr?
renzume / renzume.
npub16wt…nfqk
2025-02-27 09:10:24

renzume on Nostr: DualPipe is a bidirectional pipeline parallelism algorithm that optimizes ...

DualPipe is a bidirectional pipeline parallelism algorithm that optimizes computation-communication overlap in neural networks by achieving full overlap of forward and backward phases. The solution, presented in the DeepSeek-V3 Technical Report, reduces pipeline bubbles and requires implementation of custom overlapped forward-backward methods for specific modules.
https://github.com/deepseek-ai/DualPipe
#machinelearning #parallelism #algorithm #pytorch #deepseek
Author Public Key
npub16wtj5hrk96w2kcw9gpxz7ee5sqpzhyyxp6kh08fltmh4e0n6weqq3hnfqk