mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-03-18 05:00:48 +01:00
This commit adds a --verbose flag to the run-org-model.py script to enable or disable detailed debug output, such as input and output tensors for each layer. Debug utilities (summarize, debug_hook, setup_rope_debug) have been moved to utils/common.py. The motivation for this is that the detailed debug output can be useful for diagnosing issues with model conversion or execution, but it can also produce a large amount of output that may not always be needed. The script will also be further cleaned/refactored in follow-up commits. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| check-nmse.py | ||
| common.py | ||
| create-collection-add-model.sh | ||
| curl-embedding-server.sh | ||
| hf-add-model-to-collection.py | ||
| hf-create-collection.py | ||
| hf-create-model.py | ||
| hf-upload-gguf-model.py | ||
| inspect-converted-model.sh | ||
| inspect-org-model.py | ||
| perplexity-gen.sh | ||
| perplexity-run-simple.sh | ||
| perplexity-run.sh | ||
| quantize.sh | ||
| run-embedding-server.sh | ||
| semantic_check.py | ||