Repositories / sgl-project / sglang

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

监控状态：已开启最近同步：2026-06-07 10:11 同步状态：空闲下次计划：2026-06-07 11:11

后台正在同步并分析最近 PR，页面会自动刷新并逐步显示最新结果。

PR 列表

最近 1 天最近 3 天最近 7 天

更多筛选

排序重要度开始结束

✕ 清空

标签聚合仓库周报

2026-05-19

#25711 Expose can-run-cuda-graph as a regular property on the embedding result

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:18

重构重要性 5.07 洞察度 3.00

EmbeddingBatchResult 显式暴露 can_run_cuda_graph 属性

可快速合并，作为重构链中的一环，为未来统一类型处理打下基础。

refactorscheduling

#25710 Remove the dead hasattr fallback around the test-only crash counter

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:17

重构重要性 4.59 洞察度 2.00

移除测试崩溃计数器的冗余hasattr保护

简单清理，可安全合并。

refactortest

#25709 Refactor batch_result_processor into per-step prefill/decode helpers

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:17

重构重要性 8.61 洞察度 6.00

将批结果处理器拆分为 per-step 助手方法

建议阅读此 PR，它是大型方法拆分为单一职责助手方法的良好范例。设计决策（如将 logprob 移动与处理分离、统一 spec-v1/v2 路径处理）值得借鉴。

refactorscheduling

#25708 Route streaming output through the accumulator's payload method instead of an inline send

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:16

重构重要性 7.08 洞察度 5.00

提取 Streaming output 构造到 Accumulator.to_payload

值得精读，展示了如何通过提取方法简化复杂函数、提升模块化。设计上值得借鉴：将条件守卫和数据构建统一封装到 payload 方法中，减少主函数职责。

refactorschedulingencapsulation

#25707 Log per-request time stats in a dedicated tail step

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:16

重构重要性 5.87 洞察度 2.00

提取时间日志记录为私有方法

这是一个小范围重构，属于代码清理，值得合并但不需精读。

refactor

#25706 Route streaming-accept decisions through the accumulator instead of an inline gate

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:15

重构重要性 7.72 洞察度 5.00

流式输出接受逻辑封装到累加器

值得精读，作为将内联逻辑封装进数据类方法的范例，展示了如何逐步简化循环并保持行为一致。

refactorscheduling

#25705 Pack scattered output-streamer state into a dedicated accumulator

原始 PR · 作者 fzyzcjy · 合并时间 2026-05-19 09:15

重构重要性 8.61 洞察度 6.00

将流式输出状态封装为专用累加器类

这个 PR 是输出流组件重构链的第一步，值得关注其逐步提炼的设计模式。对于理解 SGLang 流式输出内部机制的人，可以仔细阅读 `_GenerationStreamAccumulator` 的初始化和 `_stream_output_generation` 的改写，体会如何用数据类封装局部状态以简化大型方法。

refactorschedulingencapsulation

#25303 [Spec]: Make Triton standalone spec test deterministic

原始 PR · 作者 Ratish1 · 合并时间 2026-05-19 08:36

测试重要性 3.84 洞察度 5.00

使Triton独立推测解码测试确定化

值得精读：PR body中对非确定性原因的排查思路(逐项检查掩码、位置、KV槽、接受逻辑)和实验对比(纯确定性 vs 仅目标验证统一注意力)展示了系统性的根因分析。配置级修复优先于内核修改的设计决策值得学习。

speculative-decodingtestbugfix

第 104 / 357 页 · 共 2850 条

上一页 1 … 102 103 104 105 106 … 357 下一页

sgl-project/sglang

PR 列表

#25711 Expose can-run-cuda-graph as a regular property on the embedding result

#25710 Remove the dead hasattr fallback around the test-only crash counter

#25709 Refactor batch_result_processor into per-step prefill/decode helpers

#25708 Route streaming output through the accumulator's payload method instead of an inline send

#25707 Log per-request time stats in a dedicated tail step

#25706 Route streaming-accept decisions through the accumulator instead of an inline gate

#25705 Pack scattered output-streamer state into a dedicated accumulator

#25303 [Spec]: Make Triton standalone spec test deterministic

参与讨论