Expert-parallel load balancer using linear programming to optimize MoE workload distribution. Supports Cube, Hypercube, Ring, and Torus topologies. Early research stage successor to EPLB.

Library

GitHub Repository

infrastructuremoeopen-source

Related