llama.cpp/llama_cpp
2023-11-10 04:41:19 -05:00
..
server Fix: default max_tokens matches openai api (16 for completion, max length for chat completion) 2023-11-10 02:49:27 -05:00
__init__.py Bump version 2023-11-08 00:54:54 -05:00
_utils.py Clean up stdout / stderr suppression 2023-11-03 13:02:15 -04:00
llama.py Potential bugfix for eval 2023-11-10 04:41:19 -05:00
llama_chat_format.py Fix: add default stop sequence to chatml chat format 2023-11-10 04:24:48 -05:00
llama_cpp.py Update llama.cpp 2023-11-05 16:57:10 -05:00
llama_grammar.py Add $ref and $defs support to json schema converter 2023-11-10 02:50:46 -05:00
llama_types.py Add missing tool_calls finish_reason 2023-11-10 02:51:06 -05:00
llava_cpp.py Multimodal Support (Llava 1.5) (#821) 2023-11-07 22:48:51 -05:00
py.typed Add py.typed 2023-08-11 09:58:48 +02:00