Commit Graph

17 Commits (436e56193199a1625f8c561069f702e8840a9e08)

Author SHA1 Message Date
Stephan Walter 436e561931
all : be more strict about converting float to double (#458)
* Be more strict about converting float to double

* Test equivalence of round, SILU implementations

Test module is commented out in CMakeLists.txt because the tests may
take a long time, depending on how much the compiler optimizes.

* Fix softmax in perplexity.cpp

* all : prefer float over double where appropriate

* perplexity : add <cmath>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
1 year ago
Stephan Walter c1f885067c
ggml : introduce structs for the q4 data blocks (#356)
* Introduce structs for the q4 data blocks

* ggml : rename quant struct variables + fix ARM_NEON

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
1 year ago
anzz1 7b8dbcb78b
main.cpp fixes, refactoring (#571)
- main: entering empty line passes back control without new input in interactive/instruct modes
- instruct mode: keep prompt fix
- instruct mode: duplicate instruct prompt fix
- refactor: move common console code from main->common
1 year ago
Marco Matthies 7e5395575a
Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) 1 year ago
Stephan Walter b391579db9
Update README and comments for standalone perplexity tool (#525) 1 year ago
anzz1 7a87d31f4f
[main] fix infinite generation (-n == -1) (#523) 1 year ago
Harald Fernengel 33e35b8fe8
Exit from interactive mode if input stream is bad (#491)
Allow exiting the interactive prompt also with CTRL-D on Unix and CTRL-Z
on Windows.
1 year ago
anzz1 34ab526843
(Windows) Set console to UTF-8 on init (#420)
Sets console codepage to 65001 (CP_UTF8) on start for both input and output, should fix problems with UTF-8 characters.
1 year ago
Georgi Gerganov c2b25b6912
Fix colors enabling on WIN32 1 year ago
Georgi Gerganov 79b2b266db
If n_predict == -1, generate forever 1 year ago
Georgi Gerganov e2d490dafd
Inifinite generation via context swapping (#71) 1 year ago
Georgi Gerganov 03f7e33560
Cleanup STL headers + fix embedding examples + minor stuff 1 year ago
Georgi Gerganov 55ad42af84
Move chat scripts into "./examples" 1 year ago
Georgi Gerganov a316a425d0
Overhaul the examples structure
- main -> examples
- utils -> examples (renamed to "common")
- quantize -> examples
- separate tools for "perplexity" and "embedding"

Hope I didn't break something !
1 year ago
Georgi Gerganov 04c6f5ed6f
Immediately start processing the prompt before user input has been provided (#476) 1 year ago
Mathieu Nayrolles 3f9c6135e4
fix typo in chatLLaMa (#368)
The prompt contains a typo where 'alound' is used instead of 'aloud'.
1 year ago
Jean-Christophe Hoelt 3ab3e6582f
Add chatLLaMa script (#198)
* Add chatLLaMa script

* Fix shellcheck errors and do some cleanup

* Move chatLLaMa script to `examples` directory

* Reduce chatLLaMa context size to 2048

Ref d7def1a752

* Include n_predict to 2048 in examples/chatLLaMa
1 year ago