Default Branch

137435ff15 · kleidiai : add sme fp16 compute path for q4_0 gemm on aarch64 (#20043) · Updated 2026-03-03 10:40:26 +01:00

Branches

2763dc8b53 · ggml-quants : handle zero amax for MXFP4 · Updated 2025-08-06 22:26:25 +02:00

2096
2

ea5e55d03e · Merge branch 'master' into compilade/imatrix-neutral-prior · Updated 2025-08-05 19:34:40 +02:00

2098
4

2ec70c964b · tests: Fix OPT_STEP_SGD test-backend-ops · Updated 2025-08-05 06:57:14 +02:00

2104
4

145401c9e3 · context : fix logits size overflow for huge batches · Updated 2025-08-05 04:26:46 +02:00

2103
2

342e7014db · imatrix : only warn about suffix when output format is unspecified · Updated 2025-08-04 21:12:27 +02:00

2108
2

e549515cb3 · memory : handle kv_unified for hybrid models · Updated 2025-08-03 06:45:47 +02:00

2117
1

91e67b8583 · imatrix : fix 3d tensor counts · Updated 2025-07-31 17:56:38 +02:00

2145
4

b98f80a6b4 · server : test alternative LRU logic · Updated 2025-07-29 20:19:21 +02:00

2166
1

0591b39e48 · ops: add MUSA · Updated 2025-07-29 11:25:32 +02:00

2172
1

381879e0ac · cont : tmp · Updated 2025-07-29 06:42:55 +02:00

2196
3

fb371c18ec · bench,common : add CPU extra buffer types · Updated 2025-07-28 20:53:18 +02:00

2173
1

e9f7e7cce2 · ops : update BLAS · Updated 2025-07-28 08:42:57 +02:00

2183
1

a5801f408f · sync : ggml · Updated 2025-07-25 13:31:39 +02:00

2202
2

6f4c57236b · server : fix vision test regex · Updated 2025-07-25 10:22:36 +02:00

2224
1

e65aa69402 · context : only sort outputs when needed · Updated 2025-07-24 17:06:34 +02:00

2211
1

a124399f19 · sched : fix multiple evaluations of the same graph with pipeline parallelism · Updated 2025-07-24 16:03:14 +02:00

2211
1

978c88ba0a · cont : add TODO · Updated 2025-07-24 15:31:10 +02:00

2213
2

1ef3cc1a87 · imatrix : use GGUF regardless of the output filename · Updated 2025-07-24 05:22:41 +02:00

2218
2

55cf48de1e · cuda : fix multi-seq, quantized FA · Updated 2025-07-22 19:48:53 +02:00

2260
2

0a0af0dbbd · Vulkan: Fix fprintf format-security warning · Updated 2025-07-19 11:45:31 +02:00

2254
1