Repositories / vllm-project / vllm

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

监控状态：已开启最近同步：2026-06-04 08:41 同步状态：空闲下次计划：2026-06-04 09:41

PR 列表

最近 1 天最近 3 天最近 7 天

更多筛选

排序重要度开始结束

✕ 清空

标签聚合仓库周报

2026-05-30

#43881 [ROCm] cmake: support PYTORCH_FOUND_HIP for torch 2.13 native HIP language support

原始 PR · 作者 nemanjaudovic · 合并时间 2026-05-30 13:16

缺陷修复重要性 2.77 洞察度 3.00

修复 PyTorch 2.13 下 ROCm 构建失败

PR 变更明确且经过审批，可安全合并。内部无需精读，但涉及 CMake 配置维护的工程师可了解此兼容性处理模式。

rocmci/buildbugfix

#44028 [ROCm][CI] Fix failure in the Phi3V pooling test

原始 PR · 作者 AndreasKaratzas · 合并时间 2026-05-30 12:14

缺陷修复重要性 6.21 洞察度 5.00

分离 Phi3V 测试中特殊 token 验证用例

建议接受此 PR。变更清晰、动机明确，且拆分后的测试覆盖更精确。可作为测试分离重构的参考案例。

rocmmulti-modalitytest

#43997 [Refactor] Remove dead current_tool_name_sent assignments from tool parsers

原始 PR · 作者 sfeng33 · 合并时间 2026-05-30 09:45

重构重要性 4.42 洞察度 2.00

移除工具解析器中死代码 current_tool_name_sent

直接合并即可，无需额外审查。这是一个安全且清晰的死代码清理。

refactortool-callingcleanup

#43792 offload prompt_embeds decode in render_prompts_async to avoid blocking

原始 PR · 作者 gagandhakrey · 合并时间 2026-05-30 09:36

性能优化重要性 6.61 洞察度 5.00

修复 render_prompts_async 假异步引起的事件循环阻塞

此 PR 是一次精准的性能修复，值得合并。建议未来添加一个简单的集成测试来验证 `render_prompts_async` 不阻塞事件循环，可作为跟进项。

performancebugfixfrontend

#38445 [PERF]MiniMax-M2 gate kernel

原始 PR · 作者 jeejeelee · 合并时间 2026-05-30 09:28

性能优化重要性 9.36 洞察度 7.00

融合 MiniMax-M2 MoE 门控的 FP32 路由 GEMM 核函数

值得精读，展示了如何为特定模型定制融合 GEMM 并通过分层调度集成到现有 MoE 门控框架。重点可关注 `GateLinear.forward` 的四级调度设计和 `fp32_router_gemm_fake` 的注册模式。

performancekernelmodel

#44033 Revert "[MoE Refactor] Migrate MoeWNA16Method quantization to MK orac…

原始 PR · 作者 bnellnm · 合并时间 2026-05-30 07:45

重构重要性 8.50 洞察度 3.00

回退 WNA16 MoE oracle 迁移重构

该 PR 是修复性回退，值得相关人员了解合并过程中出现的问题，但普通使用者无需深究。关注后续意图为正确合并的重新提交。

refactormoequantization

#43974 [CI] Fix smoke test step key to bypass block gate

原始 PR · 作者 khluu · 合并时间 2026-05-30 07:28

缺陷修复重要性 2.59 洞察度 2.00

修复 CI smoke 测试步骤键名使其绕过手动阻塞门

值得合并，修复了之前 PR 引入的 CI 流程问题。CI 维护者可关注是否有其他步骤键名也需按此命名规范调整。

ci/buildbugfixinfra

#44023 [CI] Remove duplicate Harmony test coverage

原始 PR · 作者 sfeng33 · 合并时间 2026-05-30 06:52

测试重要性 5.85 洞察度 2.00

删除重复的 Harmony 测试覆盖

该 PR 已合并，无直接行动项。建议团队将此作为测试清理的范例，定期审查并移除重复或不必要的测试，保持测试套件精简高效。

testcleanupci/build

第 18 / 269 页 · 共 2148 条

上一页 1 … 16 17 18 19 20 … 269 下一页