llama.cpp/tools/mtmd/models
Xuan-Son Nguyen 21a4933042
mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441)
* add qwen3a

* wip

* vision ok

* no more deepstack for audio

* convert ASR model ok

* qwen3 asr working

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* nits

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* fix bad merge

* fix multi inheritance

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-04-12 23:57:25 +02:00
..
cogvlm.cpp
conformer.cpp
deepseekocr.cpp
dotsocr.cpp
gemma4a.cpp
gemma4v.cpp
glm4v.cpp
hunyuanocr.cpp
internvl.cpp
kimik25.cpp
kimivl.cpp
llama4.cpp
llava.cpp
minicpmv.cpp
mobilenetv5.cpp
models.h
nemotron-v2-vl.cpp
paddleocr.cpp
pixtral.cpp
qwen2vl.cpp
qwen3a.cpp
qwen3vl.cpp
siglip.cpp
step3vl.cpp
whisper-enc.cpp
youtuvl.cpp