Default Branch

ecd99d6a9a · docs: Fix intel documentation link (#20040) · Updated 2026-03-03 14:50:00 +01:00

Branches

c79d130f74 · make : fix speculative build · Updated 2023-09-04 14:50:04 +02:00

7018
9

847896aba7 · speculative : add --draft CLI arg · Updated 2023-09-03 12:51:07 +02:00

7024
3

8c2b881281 · cuda : poc for norm quants (only -b 1 works) · Updated 2023-08-30 20:42:28 +02:00

7065
3

b4e70822f6 · metal : add poc for normalized Q4_0 and Q4_1 · Updated 2023-08-30 17:47:16 +02:00

7065
7

488e03200e · Merge branch 'master' into gguf-publish-ci · Updated 2023-08-30 10:34:55 +02:00

7070
4

33a5517d87 · llama.cpp : print gguf version · Updated 2023-08-26 23:56:48 +02:00

7112
10

d34472c124 · Fix HellaSwag · Updated 2023-08-26 09:55:39 +02:00

7125
1

0248ca811e · gguf : add notes for tests · Updated 2023-08-25 08:08:05 +02:00

7137
10

977629a34e · Merge branch 'master' into fix-eos · Updated 2023-08-23 21:40:19 +02:00

7153
4

66a66a05a8 · readme : add notice about new file format · Updated 2023-08-21 21:42:14 +02:00

7182
253

6a9e6375b5 · gguf.py : indentation · Updated 2023-08-17 20:53:15 +02:00

7197
205

28046d1e52 · Merge and update · Updated 2023-08-08 23:36:11 +02:00

7250
12

511055722e · undo formatting · Updated 2023-07-28 08:09:14 +02:00

7279
26

af1c9966c8 · gguf : start write tensor info · Updated 2023-07-27 09:32:31 +02:00

7279
15

d273bfd2c9 · allocator: cleanup, more comments · Updated 2023-07-22 15:05:24 +02:00

7350
21

d45c1631bc · metal : rewrite to fit new backend interface correctly (WIP) · Updated 2023-07-20 21:51:19 +02:00

7350
18

0492363137 · mpi : fix after master merge · Updated 2023-07-09 21:23:04 +02:00

7381
21

26cc1bd7a2 · llama : uniform variable names + struct init · Updated 2023-07-05 22:22:17 +02:00

7398
4

ff6e39f138 · use javascript generators as much cleaner API · Updated 2023-07-05 21:03:01 +02:00

7411
20

f46db27ea0 · ci : disable FMA on Mac OS · Updated 2023-07-05 17:29:08 +02:00

7408
5