vllm.model_executor.models.dbrx ¶
Classes:
-
DbrxMoE–A tensor-parallel MoE implementation for DBRX.
-
DbrxRouter–A Router implementation for DBRX that returns logits for each expert
DbrxMoE ¶
Bases: Module
A tensor-parallel MoE implementation for DBRX.
Each expert's weights are sharded across all ranks and a fused MoE kernel is used for the forward pass, and finally we reduce the outputs across ranks.
Source code in vllm/model_executor/models/dbrx.py
DbrxRouter ¶
Bases: Module
A Router implementation for DBRX that returns logits for each expert per token.