Repositories / sgl-project / sglang

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

监控状态：已开启最近同步：2026-06-07 10:11 同步状态：空闲下次计划：2026-06-07 11:11

后台正在同步并分析最近 PR，页面会自动刷新并逐步显示最新结果。

PR 列表

最近 1 天最近 3 天最近 7 天

更多筛选

排序重要度开始结束

✕ 清空

标签聚合仓库周报

2026-06-06

#27413 Add scripted-runtime unit, core integration, and chunked-prefill tests

原始 PR · 作者 fzyzcjy · 合并时间 2026-06-06 09:08

测试重要性 8.15 洞察度 5.00

为scripted-runtime添加单元/集成和chunked-prefill测试

本PR值得精读，特别是对sglang测试基础设施感兴趣的团队成员。`test_scripted_runtime_core.py`展示了如何通过生成器脚本驱动调度器步进测试，这种模式可复用于其他模块的集成测试。`test_scripted_core_1gpu.py`中的生命周期暂停测试设计精巧，覆盖了`pause_generation(mode='retract')`后的waiting_queue行为和输出冻结验证。建议所有scripted-runtime的相关修改都运行这些测试以确保不破坏语义。

testschedulingrefactor

#27412 Add scripted-runtime KV-pool and lock-ref exhauster primitives

原始 PR · 作者 fzyzcjy · 合并时间 2026-06-06 09:07

测试重要性 6.98 洞察度 3.00

为 scripted runtime 添加 KV 池和锁引用耗尽原语

测试团队推荐精读这两个 Exhauster 的实现，后续 chunked-prefill 测试将依赖它们。也可作为如何在 scripted 测试中模拟系统状态的参考模式。

testschedulingkv-cache

#27411 Add scripted-runtime harness core and wire scheduler/IPC hooks

原始 PR · 作者 fzyzcjy · 合并时间 2026-06-06 09:07

测试重要性 8.47 洞察度 6.00

新增 scripted-runtime 测试框架核心与调度器 IPC 钩子

值得对 scripted-runtime 感兴趣或有复杂调度测试需求的工程师阅读，尤其 ScriptedSchedulerHook 的 IPC 分发和 ScriptedHttpServer 的生命周期管理设计。

testschedulinginfra

#27410 Add kv_canary PP self-test fixture and SWA divergence coverage

原始 PR · 作者 fzyzcjy · 合并时间 2026-06-06 09:06

测试重要性 7.00 洞察度 3.00

为 kv_canary 添加 PP 自测夹具和 SWA divergence 测试

值得关注 `CanaryPPFixture` 基类的设计，它为 PP 测试提供了可复用的服务器参数配置和生命周期管理，为后续更多 PP 场景测试提供了模式参考。建议读者精读新增的扰动测试用例，了解 real-kv-hash 扰动的触发条件与断言方法。

testkv-cacheconsistency

#27405 Don't write crash dump on graceful exit

原始 PR · 作者 cctry · 合并时间 2026-06-06 08:06

缺陷修复重要性 3.84 洞察度 2.00

修复优雅退出时误写 crash dump 的问题

本次变更为一次小范围、低风险的 bugfix，逻辑清晰，改动量小。建议快速合并。

bugfix

#25337 [plugin] default device detection fixes for OOT platform plugins

原始 PR · 作者 DevashishLal-CB · 合并时间 2026-06-06 07:55

功能重要性 6.55 洞察度 6.00

OOT平台插件设备检测修复与导入优化

此 PR 是硬件抽象层 RFC 的第一步落地，值得关注其设计取舍。对于平台集成者，建议精读 `device_mixin.py` 和 `device_config.py` 的变更以了解接口约定。对于核心开发者，注意后续需要清理剩余的延迟导入和硬编码检查。

featureinfrarefactor

#27404 Remove DeepSeek V4 release Docker workflow

原始 PR · 作者 Fridge003 · 合并时间 2026-06-06 07:49

基础设施重要性 4.90 洞察度 1.00

删除 DeepSeek V4 发布 Docker 构建工作流

该 PR 简单明确，无需精读。关注点在于确认 DeepSeek V4 的发布是否已完全迁移，避免遗漏。

infradeepseekdocker

#26726 fix(spec-dec): treat `num_nextn_predict_layers=0` the same as absent for EAGLE3 drafts

原始 PR · 作者 thanhhao98 · 合并时间 2026-06-06 07:08

缺陷修复重要性 6.25 洞察度 3.00

修复 EAGLE3 draft num_nextn_predict_layers=0 时层数计算错误

建议尽快合入并发布补丁，该修复解决了 EAGLE3 的一个显式崩溃问题，且风险极低。同时建议在相关测试中增加 num_nextn_predict_layers=0 的边界测试用例。

bugfixspeculative-decodingdeepseek

第 6 / 357 页 · 共 2850 条

上一页 1 … 4 5 6 7 8 … 357 下一页