mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-03-19 21:51:36 +01:00
This commit refactors the original model embedding script to include a device selection option. Users can now specify the device (cpu, cuda, mps, auto) via command-line arguments. It also refactors the code to be more structured. |
||
|---|---|---|
| .. | ||
| causal | ||
| embedding | ||
| utils | ||