Uh oh!

There was an error while loading. Please reload this page.

ml-explore / mlx-lm Public

Notifications You must be signed in to change notification settings
Fork 830
Star 6.2k

Code
Issues 180
Pull requests 243
Discussions
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: ml-explore/mlx-lm

Labels 9 Milestones 0

New pull request New

243 Open 689 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix broadcast crash in quantized SDPA with GQA + batched padding mask (batch >= 2)

#1467 opened Jul 4, 2026 by pinglin

Loading…

Add quant2 / quant2_128 mixed-bit quant recipes

#1466 opened Jul 4, 2026 by dahai80

Loading…

Fix NewlineTokenizer registration for transformers >= 5.13

#1465 opened Jul 4, 2026 by chandukona

Loading…

Add LongCat-2.0

#1464 opened Jul 4, 2026 by kernelpool Contributor

Loading…

GLM-5.2: full/shared indexer typing for glm_moe_dsa (DSA schedule + interleaved indexer rope)

#1463 opened Jul 4, 2026 by machiabeli • Draft

Fix import crash with transformers >= 5.13

#1460 opened Jul 3, 2026 by Lazarus-931

Loading…

Fix AutoTokenizer.register() for transformers 5.13.0+ compatibility

#1459 opened Jul 3, 2026 by jonpspri

Loading…

qwen3_5: load in-checkpoint MTP head + speculative rollback for hybrid (GDN) caches

#1456 opened Jul 3, 2026 by pierre427

Loading…

fix(mlx_lm.server): fail fast when --draft-model set with non-trimmable cache

#1455 opened Jul 2, 2026 by tejkas

Loading…

DeepSeek-V3.2/GLM DSA: fix silent >128k top-k corruption + sparse-gather prefill

#1454 opened Jul 2, 2026 by aidiffuser

Loading…

Fix DSA indexer LoRA-training crash: stop gradients through sparse-attention top-k indices

#1452 opened Jul 2, 2026 by trevorgordon981

Loading…

Fix Mistral tool parser dropping parallel/multiple tool calls

#1448 opened Jul 2, 2026 by DavidObando

Loading…

Fix dropped tool calls for models with empty tool_call_end (Mistral/Devstral)

#1447 opened Jul 1, 2026 by DavidObando

Loading…

Fix frozen PRNG in categorical_sampling under repeated sampling

#1444 opened Jun 30, 2026 by utkarshtiwari-24

Loading…

Fix qwen3.5-MoE garbage output: don't double-shift RMSNorm on MTP-retaining checkpoints

#1442 opened Jun 29, 2026 by embwl0x

Loading…

Feature/layer streaming

#1440 opened Jun 28, 2026 by SashimiSaketoro

Loading…

Make RotatingKVCache trimmable so prefix cache reuse works for sliding-window models

#1437 opened Jun 26, 2026 by amirarsalan90

Loading…

Fix: pythonic tool parser not auto-detected for LFM2.5 models

#1436 opened Jun 25, 2026 by grumdahl

Loading…

fix: use FiscalNote/billsum HF dataset path in test_datsets

#1434 opened Jun 25, 2026 by ttxs69

Loading…

Add dedicated tests for AFM7 model

#1432 opened Jun 24, 2026 by Sreya8

Loading…

server: add --lazy CLI flag for deferred model loading

#1429 opened Jun 24, 2026 by cyq1017

Loading…

Fix speculative decode with full rotating caches

#1427 opened Jun 23, 2026 by cyq1017

Loading…

tests: add dedicated unit tests for Gemma3n model

#1424 opened Jun 23, 2026 by Sreya8

Loading…

feat: add batch_generate_same_prompt for SSM/hybrid models

#1422 opened Jun 21, 2026 by HaoXuAI Contributor

Loading…

GLM-5.2 (glm_moe_dsa) inference: IndexShare, indexer RoPE/eps, int8 MLA-KV

#1419 opened Jun 21, 2026 by avlp12

Loading…

Previous 1 2 3 4 5 … 9 10 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!