mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-03-13 02:30:36 +01:00
* Add paramater buffer pool, batching of submissions, refactor command building/submission * Add header for linux builds * Free staged parameter buffers at once * Format with clang-format * Fix thread-safe implementation * Use device implicit synchronization * Update workflow to use custom release * Remove testing branch workflow * Disable set_rows until it's implemented * Fix potential issue around empty queue submission * Try synchronous submission * Try waiting on all futures explicitly * Add debug * Add more debug messages * Work on getting ssh access for debugging * Debug on failure * Disable other tests * Remove extra if * Try more locking * maybe passes? * test * Some cleanups * Restore build file * Remove extra testing branch ci |
||
|---|---|---|
| .. | ||
| bench.yml.disabled | ||
| build-cmake-pkg.yml | ||
| build-linux-cross.yml | ||
| build.yml | ||
| close-issue.yml | ||
| docker.yml | ||
| editorconfig.yml | ||
| gguf-publish.yml | ||
| labeler.yml | ||
| pre-tokenizer-hashes.yml | ||
| python-check-requirements.yml | ||
| python-lint.yml | ||
| python-type-check.yml | ||
| release.yml | ||
| server.yml | ||
| update-ops-docs.yml | ||
| winget.yml | ||