ik_llama.cpp

History

Kawrakow 87e4762720 Port mdmd from mainline + Qwen2/2.5-VL support (#798 ) * Add mtmd: the beginning * Add mtmd: mtmd.cpp compiles * Add mtmd: clip initialization compiles * Add mtmd: clip.cpp compiles * Add mtmd: builds successfully * Add CPU implementation for GGML_OP_GLU * Add CUDA implementation for GGML_OP_GLU * Add CPU implementation for GGML_OP_CONV_2D and GGML_OP_CONV_2D_DW * Add CUDA implementation for GGML_OP_CONV_2D and GGML_OP_CONV_2D_DW * Add mtmd: refresh CPU rope * Add mtmd: refresh CUDA rope * Add mtmd: add Qwen2-VL * Add mtmd: Qwen2.5-VL text seems to work with this change * Add mtmd: fix swiglu * Add mtmd: use LOG_TEE so generated tokens show up in terminal * Add mtmd: do not attempt to load a GPU backend if none are available * GLU, not GPU * Fix typo * Fix new/free mismatch * LOG stuff * Add mtmd: this fixes gibberish on second image --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>		2025-09-27 08:45:29 +02:00
..
ggml-alloc.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
ggml-backend.h	Offload only activated experts to the GPU (#698 )	2025-09-04 12:22:30 +02:00
ggml-blas.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
ggml-cann.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
ggml-cpp.h	Port mdmd from mainline + Qwen2/2.5-VL support (#798 )	2025-09-27 08:45:29 +02:00
ggml-cuda.h	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
ggml-kompute.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
ggml-metal.h	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
ggml-rpc.h	Fix non rpc build error (#506 )	2025-06-08 17:27:00 +03:00
ggml-sycl.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
ggml-vulkan.h	Vulkan: a fresh start (#608 )	2025-07-15 08:03:13 +02:00
ggml.h	Port mdmd from mainline + Qwen2/2.5-VL support (#798 )	2025-09-27 08:45:29 +02:00