sgl-project/sglang · 标签视图

标签列表

bugfix · 983

run-ci · 859

performance · 764

test · 658

refactor · 618

feature · 572

ci · 402

documentation · 365

scheduling · 352

infra · 331

diffusion · 322

deepseek · 301

kv-cache · 244

speculative-decoding · 235

npu · 214

quant · 200

amd · 193

jit-kernel · 188

moe · 140

consistency · 127

observability · 123

multimodal · 122

hicache · 120

dependencies · 115

sgl-kernel · 77

attention · 70

lora · 62

debugging · 52

intel · 46

docker · 40

blackwell · 39

benchmark · 32

xpu · 30

model-gateway · 25

cpu · 24

mamba · 24

mla · 23

disaggregation · 20

fp8 · 16

mthreads · 13

security · 11

macos · 9

ray · 6

hisparse · 4

cookbook · 3

deployment · 3

encapsulation · 3

cuda · 2

deterministic · 2

gemma4 · 2

memory · 2

mlx · 2

other · 2

swa · 2

accuracy · 1

aiter · 1

arm · 1

conf · 1

configuration · 1

cuda-graph · 1

dependency · 1

dflash · 1

distributed · 1

glm · 1

kernel · 1

kimi_k25 · 1

kubernetes · 1

kvcache · 1

llama · 1

memory-management · 1

modelexpress · 1

multi-tokenizer · 1

nixl · 1

nvidia · 1

piecewise-cuda-graph · 1

pp · 1

pypi · 1

radix-cache · 1

rdma · 1

roc · 1

streaming · 1

tool-call · 1

tpu · 1

triton · 1

ui · 1

unified-radix-tree · 1

vlm · 1

聚合结果

attention 相关 PR

2026-06-07

2026-06-06