Default Branch

6fce5c6a7d · opencl: add l2_norm (#20160) · Updated 2026-03-07 03:03:05 +01:00

Branches

47d604fa2d · fix issues · Updated 2023-11-05 13:20:22 +01:00

6745
3

3ef358fffd · Revert "cuda : use CUDA memory pool with async memory allocation/deallocation when available (#3903)" · Updated 2023-11-04 21:26:51 +01:00

6749
2

46868a499e · metal : multi-simd softmax · Updated 2023-11-01 20:16:34 +01:00

6774
1

a8796f9609 · llm : cleanup + comments · Updated 2023-11-01 19:08:02 +01:00

6783
4

7420bef83e · wip wip wip · Updated 2023-11-01 07:51:43 +01:00

6783
1

afb3929279 · Merge branch 'master' into llama-refactor · Updated 2023-10-31 19:35:31 +01:00

6785
21

29fe516913 · wip · Updated 2023-10-31 17:36:37 +01:00

6786
1

dab42893c9 · scripts : working curl pipe · Updated 2023-10-31 16:03:56 +01:00

6786
3

7923b70cb8 · llama : add llm_build_inp_embd helper · Updated 2023-10-31 15:43:08 +01:00

6791
37

4b3cb98d46 · ggml-impl : move extern "C" to start of file · Updated 2023-10-30 18:05:58 +01:00

6787
7
lto

bc28aaa8c2 · make : use -lfto=auto to avoid warnings and maintain perf · Updated 2023-10-30 15:00:53 +01:00

6787
5

15267192c0 · llama : refactor tensor offloading as callback · Updated 2023-10-29 12:04:36 +01:00

6791
15

8a86b95e87 · quantize : --pure option for disabling k-quant mixtures · Updated 2023-10-28 22:37:03 +02:00

6792
3

de7e0912b6 · convert : ignore tokens if their IDs are within [0, vocab_size) · Updated 2023-10-28 14:01:36 +02:00

6795
1

bbfc62ac2f · sampling : temp == 0.0 -> no probs, temp < 0.0 -> probs · Updated 2023-10-28 13:04:57 +02:00

6803
3

cd3e20fb50 · cuda : fix multi-gpu with tensor cores · Updated 2023-10-27 22:11:50 +02:00

6802
3

49af767fad · build : add compile option to force use of MMQ kernels · Updated 2023-10-27 12:21:04 +02:00

6804
7

d798a17c34 · cuda : add TODO for calling cublas from kernel + using mem pool · Updated 2023-10-24 15:33:24 +02:00

6818
10

6966474928 · cuda : play with faster Q4_0 dequantization · Updated 2023-10-24 09:29:40 +02:00

6818
8

b9bb4cbe86 · Separate bug and enhancement template + no default title · Updated 2023-10-23 17:59:11 +02:00

6818
1