Default Branch

c5a778891b · ggml: add GATED_DELTA_NET op (#19504) · Updated 2026-03-07 08:41:10 +01:00

Branches

b9bb4cbe86 · Separate bug and enhancement template + no default title · Updated 2023-10-23 17:59:11 +02:00

6818
1

c0f4d54870 · server : add comment about changing slot_state to bool · Updated 2023-10-22 21:24:39 +02:00

6825
72

cb79f8a2d8 · llama : add SKIP_KQ_KQV option · Updated 2023-10-22 08:58:29 +02:00

6825
3

56ba00b923 · sampling : hide prev behind API and apply #3661 · Updated 2023-10-20 17:53:27 +02:00

6828
6

ad2727d091 · Merge branch 'master' into speculative-tree · Updated 2023-10-18 09:50:58 +02:00

6839
18

932589c0ef · Honor -ngl option for Cuda offloading in llava · Updated 2023-10-14 02:12:10 +02:00

6853
1

5261aee8d8 · sampling : one sequence per sampling context · Updated 2023-10-12 19:36:44 +02:00

6856
1

2fcdf869cd · batched-bench : add mmq CLI arg · Updated 2023-10-11 18:42:33 +02:00

6868
7

ee7456926e · ggml-alloc : fix assert in debug builds · Updated 2023-10-09 14:33:12 +02:00

6877
1

ee268b5446 · llama : no longer perform uninitialized access to the KV cache · Updated 2023-10-08 10:49:38 +02:00

6884
5

acead654d2 · Merge branch 'master' into fix-refact · Updated 2023-10-08 10:25:16 +02:00

6884
4

6b9554a740 · metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7 · Updated 2023-10-08 08:55:13 +02:00

6891
5

ba44776dc2 · bump version · Updated 2023-10-07 20:47:48 +02:00

6890
6

5ab6c2132a · server-parallel : add "--reverse-prompt" + compiler warning fixes · Updated 2023-10-06 13:32:19 +02:00

6903
4

5418932b71 · llama : fix comments for llama_kv_cache API · Updated 2023-10-03 20:01:52 +02:00

6928
5

c5650ed470 · server : avoid context swaps by shifting the KV cache · Updated 2023-09-28 18:03:36 +02:00

6952
57

72e7ef4e53 · simple : fixes · Updated 2023-09-26 23:19:36 +02:00

6978
48

784d14ed31 · llama : store non-RoPEd K cache (WIP) · Updated 2023-09-17 22:43:07 +02:00

6990
5

92a4f86879 · llama : make starcoder graph build more consistent with others · Updated 2023-09-15 16:57:10 +02:00

7000
20

e7e7b11455 · llama : remove experimental stuff · Updated 2023-09-14 21:52:01 +02:00

7012
3