llama.cpp

History

Andrei Betlen 11eae75211 perf: avoid allocating new buffers during sampling		2023-07-07 19:28:53 -04:00
..
server	Add setting to control request interruption	2023-07-07 03:37:23 -04:00
__init__.py	Black formatting	2023-03-24 14:59:29 -04:00
llama.py	perf: avoid allocating new buffers during sampling	2023-07-07 19:28:53 -04:00
llama_cpp.py	Update llama.cpp	2023-07-06 17:57:56 -04:00
llama_types.py	Allow first logprob token to be null to match openai api	2023-05-19 02:04:57 -04:00