Thursday, February 27, 2025

DeepSeek DualPipe - a innovative bidirectional pipeline parallelism algorithm used in V3 and R1 training

DualPipe is an innovative bidirectional pipeline parallelism algorithm introduced in the DeepSeek-V3 Technical Report. It achieves full overlap of forward and backward computation-communication phases, also reducing pipeline bubbles. For detailed information on computation-communication overlap, please refer to the profile data.


Profile Data 






Training - Forward and backward chunks in DualPipe


More detail at  https://github.com/deepseek-ai/profile-data



Generative AI, Robot Operating System (ROS 2), Computer Vision, Natural Language Processing service, Generative AI Chatbot, Machine Learning, Mobile App, Web App? Yes, I do provide!


Call me: +84854147015

WhatsApp: +601151992689

https://amatasiam.web.app

Email: ThomasTrungVo@Gmail.Com

Facebook: 
https://www.facebook.com/voduytrung

X: 
https://x.com/ThomasTrung

No comments:

Post a Comment