-
Notifications
You must be signed in to change notification settings - Fork 830
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix broadcast crash in quantized SDPA with GQA + batched padding mask (batch >= 2)
#1467
opened Jul 4, 2026 by
pinglin
Loading…
Fix NewlineTokenizer registration for transformers >= 5.13
#1465
opened Jul 4, 2026 by
chandukona
Loading…
GLM-5.2: full/shared indexer typing for glm_moe_dsa (DSA schedule + interleaved indexer rope)
#1463
opened Jul 4, 2026 by
machiabeli
•
Draft
Fix AutoTokenizer.register() for transformers 5.13.0+ compatibility
#1459
opened Jul 3, 2026 by
jonpspri
Loading…
qwen3_5: load in-checkpoint MTP head + speculative rollback for hybrid (GDN) caches
#1456
opened Jul 3, 2026 by
pierre427
Loading…
fix(mlx_lm.server): fail fast when --draft-model set with non-trimmable cache
#1455
opened Jul 2, 2026 by
tejkas
Loading…
DeepSeek-V3.2/GLM DSA: fix silent >128k top-k corruption + sparse-gather prefill
#1454
opened Jul 2, 2026 by
aidiffuser
Loading…
Fix DSA indexer LoRA-training crash: stop gradients through sparse-attention top-k indices
#1452
opened Jul 2, 2026 by
trevorgordon981
Loading…
Fix Mistral tool parser dropping parallel/multiple tool calls
#1448
opened Jul 2, 2026 by
DavidObando
Loading…
Fix dropped tool calls for models with empty tool_call_end (Mistral/Devstral)
#1447
opened Jul 1, 2026 by
DavidObando
Loading…
Fix frozen PRNG in categorical_sampling under repeated sampling
#1444
opened Jun 30, 2026 by
utkarshtiwari-24
Loading…
Fix qwen3.5-MoE garbage output: don't double-shift RMSNorm on MTP-retaining checkpoints
#1442
opened Jun 29, 2026 by
embwl0x
Loading…
Make RotatingKVCache trimmable so prefix cache reuse works for sliding-window models
#1437
opened Jun 26, 2026 by
amirarsalan90
Loading…
Fix: pythonic tool parser not auto-detected for LFM2.5 models
#1436
opened Jun 25, 2026 by
grumdahl
Loading…
fix: use FiscalNote/billsum HF dataset path in test_datsets
#1434
opened Jun 25, 2026 by
ttxs69
Loading…
server: add --lazy CLI flag for deferred model loading
#1429
opened Jun 24, 2026 by
cyq1017
Loading…
feat: add batch_generate_same_prompt for SSM/hybrid models
#1422
opened Jun 21, 2026 by
HaoXuAI
Contributor
Loading…
GLM-5.2 (glm_moe_dsa) inference: IndexShare, indexer RoPE/eps, int8 MLA-KV
#1419
opened Jun 21, 2026 by
avlp12
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.