ik_llama.cpp

main

b2cb4512c5 · Create parameters overview (#1269) · Updated 2026-02-20 07:20:56 +01:00

s6/fix_python adb6b6fb3f · Update GGML_QUANT_SIZES · Updated 2025-04-24 06:06:26 +02:00 git	4211 3616	ZIP TAR.GZ
ik/bitnet_adjustments e79f523bcc · BitNet adjustments · Updated 2025-04-22 08:36:32 +02:00 git	4211 3642	ZIP TAR.GZ
s6/bitnet2b_2501 3d7206e6ea · Support both model names · Updated 2025-04-22 04:47:17 +02:00 git	4211 3643	ZIP TAR.GZ
s6/termux_fix d75c151624 · Attempt fix 13 · Updated 2025-04-21 08:56:05 +02:00 git	4211 3652	ZIP TAR.GZ
ik/tg_tweaks 3e41c56a8a · Minor · Updated 2025-04-16 16:15:05 +02:00 git	4211 3641	ZIP TAR.GZ
ik/faster_avx2_q40 3164fa3310 · Better gemm/gemv on AVX2 fr q4_0_r8 · Updated 2025-04-15 17:12:22 +02:00 git	4211 3638	ZIP TAR.GZ
ik/gemma_q80_kvcache a164a50a36 · We need also these · Updated 2025-04-15 12:56:29 +02:00 git	4211 3638	ZIP TAR.GZ
ik/imatrix_lsim 8bff04c9d6 · Use stripped tensor name, not src0->name · Updated 2025-04-14 18:00:06 +02:00 git	4211 3638	ZIP TAR.GZ
ik/hide_imatrix 4ed6076940 · Add ability to hide imatrix details in llama-quantize · Updated 2025-04-14 15:36:57 +02:00 git	4211 3635	ZIP TAR.GZ
ik/improve_iq1m 4291d7e1e6 · Minor · Updated 2025-04-13 07:34:43 +02:00 git	4211 3636	ZIP TAR.GZ
ik/fix_kld 9b24ae7fc6 · Fix KLD precision · Updated 2025-04-12 09:01:20 +02:00 git	4211 3633	ZIP TAR.GZ
ik/l4_rms_norm c5f1a0ad25 · Correct L4 rms_norm · Updated 2025-04-11 10:45:33 +02:00 git	4211 3632	ZIP TAR.GZ
ik/llama4 b51661bbff · llama4: this seems to be working · Updated 2025-04-09 11:02:22 +02:00 git	4211 3632	ZIP TAR.GZ
ik/improve_iq2ks 80846bb2c9 · WIP · Updated 2025-04-08 16:16:16 +02:00 git	4211 3632	ZIP TAR.GZ
ik/mla_guard 5ec2cb63ae · Guard against attempts to use MLA for non-MLA models · Updated 2025-04-08 08:45:08 +02:00 git	4211 3630	ZIP TAR.GZ
ik/update_license ae7cf9a766 · More · Updated 2025-04-07 16:58:09 +02:00 git	4211 3629	ZIP TAR.GZ
ik/copyright 8b9be1a048 · Add copyright notices · Updated 2025-04-07 10:19:54 +02:00 git	4211 3623	ZIP TAR.GZ
ik/try_fa_no_q80_repack 0dbcd57267 · Try not repacking q8_0 for FA computations · Updated 2025-04-06 08:49:59 +02:00 git	4211 3623	ZIP TAR.GZ
ik/fix_cuda_memcpy_async c2bab6cee5 · We need to synchronize before using device to host async memcpy · Updated 2025-04-05 14:28:20 +02:00 git	4211 3623	ZIP TAR.GZ
ik/improve_iq2_xs fe157dee95 · Better iq2_xs quantization · Updated 2025-04-05 10:51:26 +02:00 git	4211 3623	ZIP TAR.GZ

... 21 22 23 24 25 ...

Default Branch

Branches