llama.cpp

History

Andrei ab028cb878 Migrate inference to llama_batch and llama_decode api (#795 ) * Add low-level batching notebook * fix: tokenization of special characters: (#850) It should behave like llama.cpp, where most out of the box usages treat special characters accordingly * Update CHANGELOG * Cleanup * Fix runner label * Update notebook * Use llama_decode and batch api * Support logits_all parameter --------- Co-authored-by: Antoine Lizee <antoine.lizee@gmail.com>		2023-11-02 20:13:57 -04:00
..
server	Update llama.cpp	2023-11-02 14:28:15 -04:00
__init__.py	Bump version	2023-11-02 15:53:01 -04:00
llama.py	Migrate inference to llama_batch and llama_decode api (#795 )	2023-11-02 20:13:57 -04:00
llama_chat_format.py	Fix repeat greeting (#808 )	2023-10-15 13:52:21 -04:00
llama_cpp.py	Update llama.cpp	2023-11-02 14:28:15 -04:00
llama_grammar.py	Fix typos in llama_grammar	2023-08-17 21:00:44 +09:00
llama_types.py	Update llama_types.py (#849 )	2023-11-01 18:50:11 -04:00
py.typed	Add py.typed	2023-08-11 09:58:48 +02:00
utils.py	Fix name 'open' is not defined exception. Closes #860	2023-11-02 15:30:55 -04:00