llama.cpp

mirror of https://github.com/ggerganov/llama.cpp synced 2026-03-02 05:09:23 +01:00

History

Georgi Gerganov f5a77a629b Introduce C-style API (#370 ) * Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning	2023-03-22 07:32:36 +02:00
..
ggml-vocab.bin	Introduce C-style API (#370 )	2023-03-22 07:32:36 +02:00

Georgi Gerganov f5a77a629b

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

2023-03-22 07:32:36 +02:00

ggml-vocab.bin

Introduce C-style API (#370 )

2023-03-22 07:32:36 +02:00