Expert Routing Visualizer

Watch how tokens route to different expert subsets in a Mixture-of-Experts layer

Input Tokens
Router
8 Expert Networks (22B params each)
Statistics
0/8
Active Experts
0B
Active Params
176B
Total Params
0%
Compute Savings
Expert Load