Representative MoE training playbooks by hardware platform and model family. Summarizes rounded throughput bands, parallelism patterns, and common tuning stacks.
Creator's repository · nvidia/skills
License: Apache-2.0