10 Comments
User's avatar
Dr. Ashish Bamania's avatar

Great deep dive and very helpful. Thanks for writing this!

Expand full comment
Cameron R. Wolfe, Ph.D.'s avatar

Thank you so much, and of course!

Expand full comment
DesignREM support's avatar

Amazing article, a true gem. I would be interested to learn more about how DeepSeek used reinforcement learning to such great effect.

Expand full comment
Cameron R. Wolfe, Ph.D.'s avatar

Working on that article now!

Expand full comment
Rahul Saini's avatar

Great in-depth article on MoE LLMs.

Expand full comment
Cameron R. Wolfe, Ph.D.'s avatar

Thank you very much!

Expand full comment
Michael's avatar

I agree with Dr. Bamania- this was very rewarding reading.

Expand full comment
Cameron R. Wolfe, Ph.D.'s avatar

Thank you! I'm glad you found it helpful!

Expand full comment
JustaPlacebo's avatar

Do you expect this leapfrogging to persist for a while longer? The collective response to this tug of war seems overwhelmingly simplified and “gotcha!” meme-based. Public discourse overall appears to lean into a predictable lack of reservation for exponential leaps between companies, unknowns, and established methods.

Expand full comment
Srini Vijay, PhD's avatar

Great article. I couldn't finish it in one go though. I have book marked it and will come back to it later.

Expand full comment