llama.cpp/llama_cpp
2023-07-07 19:28:53 -04:00
..
server Add setting to control request interruption 2023-07-07 03:37:23 -04:00
__init__.py Black formatting 2023-03-24 14:59:29 -04:00
llama.py perf: avoid allocating new buffers during sampling 2023-07-07 19:28:53 -04:00
llama_cpp.py Update llama.cpp 2023-07-06 17:57:56 -04:00
llama_types.py Allow first logprob token to be null to match openai api 2023-05-19 02:04:57 -04:00