Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add GLM-5 NVFP4 GB200 disagg-mtp TRT-LLM benchmarks via Dynamo
#1800 opened Jun 16, 2026 by xinli-sw Collaborator Loading…
[NV]Add GLM-5 NVFP4 GB300 disagg-mtp TRT-LLM benchmarks via Dynamo
#1799 opened Jun 16, 2026 by xinli-sw Collaborator Loading…
chore(runners): add TensorWave MI300X docker runners (mi300x-tw)
#1793 opened Jun 16, 2026 by cquil11 Collaborator Loading…
[NV] Update MiniMax M3 B300 vLLM serving settings non-canary-full-sweep-enabled Run the full sweep without the canary gate (full search space, no trim)
#1781 opened Jun 15, 2026 by jasonlizhengjian Collaborator Loading…
[Klaud Cold][Experimental][DNM] minimaxm3-fp8-mi355x-vllm-disagg: day-zero MoRI-IO disagg smoke test (1P TP8 + 1D TP8, conc 1) non-canary-full-sweep-enabled Run the full sweep without the canary gate (full search space, no trim)
#1762 opened Jun 14, 2026 by functionstackx Collaborator Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.