Commit graph

1292 commits

Author SHA1 Message Date
Radoslav Gerganov 8e44a32075
Add support for running the server with SSL (#994) 2023-12-11 20:47:11 -05:00
Tanner Hobson ef22e478db
Replace logits_to_logprobs implementation with numpy equivalent to llama.cpp (#991)
See #990. This change makes the logits_to_logprobs function equivalent to the version in the llama.cpp repository. It uses numpy so it's much faster than the previous version.
2023-12-11 20:46:27 -05:00
zocainViken ac35f68e4d
Fix UnsupportedOperation: fileno in suppress_stdout_stderr (#961)
* bug fixing

* llava from readme got this error: UnsupportedOperation: fileno   quick fix by checking hasattr

* multi modal params fix: add logits = True -> to make llava work

* multi modal params fix: add logits = True -> to make llava work

---------

Co-authored-by: Andrei <abetlen@gmail.com>
2023-12-11 20:44:51 -05:00
chiensen b938cccf05
Add Pygmalion chat format (#986) 2023-12-11 20:44:04 -05:00
zocainViken 6bbeea07ae
README.md multimodal params fix (#967)
multi modal params fix: add logits = True -> to make llava work
2023-12-11 20:41:38 -05:00
Aniket Maurya c1d92ce680
fix minor typo (#958)
* fix minor typo

* Fix typo

---------

Co-authored-by: Andrei <abetlen@gmail.com>
2023-12-11 20:40:38 -05:00
Andrei Betlen e9bc4c4baf Fix docker build 2023-12-11 10:39:51 -05:00
Andrei Betlen c1e73e73a3 Bump version 2023-12-11 10:26:42 -05:00
Andrei Betlen ec26f364cc Remove f16_kv 2023-12-11 10:25:37 -05:00
Andrei Betlen f1edc66b21 Update llama.cpp 2023-12-11 10:21:35 -05:00
Andrei Betlen f3b844ed0a Update llama.cpp 2023-11-29 05:40:22 -05:00
kddubey b069d06346
Fix #891 (#952) 2023-11-29 05:39:52 -05:00
Andrei Betlen ad963a0961 Bump version 2023-11-28 04:58:20 -05:00
Andrei Betlen e3941d9c67 Make building llava optional 2023-11-28 04:55:21 -05:00
Andrei Betlen 74f1949206 Update llama.cpp 2023-11-28 04:54:51 -05:00
Andrei Betlen fb32f9d438 docs: Update README 2023-11-28 03:15:01 -05:00
Andrei Betlen 43e006a291 docs: Remove divider 2023-11-28 02:41:50 -05:00
Andrei Betlen 2cc6c9ae2f docs: Update README, add FAQ 2023-11-28 02:37:34 -05:00
Andrei Betlen 7f3704b896 Bump version 2023-11-27 19:14:25 -05:00
Andrei Betlen f99b2385ee Update llama.cpp 2023-11-27 19:03:10 -05:00
Andrei Betlen 396dbf0b2b docs: Improve low-level docstrings 2023-11-27 19:03:02 -05:00
Andrei Betlen 9c68b1804a docs: Add api reference links in README 2023-11-27 18:54:07 -05:00
Andrei Betlen 174ef3ddf6 docs: Add headings to API reference 2023-11-27 18:42:15 -05:00
Andrei Betlen 41428244f0 docs: Fix README indentation 2023-11-27 18:29:13 -05:00
Andrei Betlen 1539146a5e docs: Fix README docs link 2023-11-27 18:21:00 -05:00
Andrei Betlen a928893d03 Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-11-26 15:57:13 -05:00
Andrei Betlen 6308f21d5e docs: Update Llama docs 2023-11-26 15:56:40 -05:00
Anton Vice aa5a7a1880
Update README.md (#940)
.ccp >> .cpp
2023-11-26 15:39:38 -05:00
Gardner Bickford c2d63a7148
fix: Typo in the Open Orca chat format #874 (#947) 2023-11-26 15:39:18 -05:00
Andrei Betlen f03a38e62a Update llama.cpp 2023-11-26 15:38:22 -05:00
Andrei Betlen 1a7bf2037b docs: Update openapi endpoint names 2023-11-24 03:39:29 -05:00
Andrei Betlen 4026166e68 docs: Update completion and chat_completion parameter docstrings 2023-11-24 03:24:19 -05:00
Andrei Betlen 945e20fa2c docs: update link 2023-11-24 00:18:32 -05:00
Andrei Betlen e6a36b840e docs: edit function calling docs 2023-11-24 00:17:54 -05:00
Andrei Betlen 8c3aa7858b Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-11-24 00:15:36 -05:00
Andrei Betlen 19e02f1f87 docs: Add link to function calling notebook 2023-11-24 00:15:02 -05:00
Andrei Betlen de2e2bc083 misc fix verbose printing in functionary model 2023-11-23 20:14:23 -05:00
Andrei Betlen 36048d46af Update llama.cpp 2023-11-23 16:26:00 -05:00
mrfakename d68fc07b1b
Add Zephyr format (#937) 2023-11-23 01:20:08 -05:00
caiyesd 4184835078
Add chat format to support baichuan (#938)
Signed-off-by: caiyesd <caiyesd@gmail.com>
2023-11-23 01:19:50 -05:00
Andrei Betlen 4474157949 ci: tag built docker images with current version 2023-11-23 01:06:47 -05:00
Andrei Betlen 21abefa488 docs: Add grammar and types to api reference 2023-11-23 00:27:41 -05:00
Andrei Betlen 6aab77de04 docs: Fix module import bug 2023-11-23 00:27:22 -05:00
Andrei Betlen c647f01609 Add from_json_schema to LlamaGrammar 2023-11-23 00:27:00 -05:00
Andrei Betlen be1f64d569 docs: Add docstrings from llama.cpp 2023-11-23 00:26:26 -05:00
Andrei Betlen 31cf0ec680 docs: Fix mkdocstrings heading level 2023-11-22 23:45:19 -05:00
Andrei Betlen e349f314b4 docs: Fix API Reference page 2023-11-22 23:45:02 -05:00
Andrei Betlen b6bb7ac76a docs: Add Llama class example 2023-11-22 23:10:04 -05:00
Andrei Betlen c5173b0fb3 docs: Configure mkdocstrings 2023-11-22 23:09:42 -05:00
Andrei Betlen 3303ebe92b docs: Add dark mode and pymarkdown extensions 2023-11-22 22:47:22 -05:00