llama.cpp

Commit Graph

Author	SHA1	Message	Date
Pavol Rusnak	8b679987cd	Fix whitespace, add .editorconfig, add GitHub workflow (#883 )	1 year ago
qouoq	a0caa34b16	Add BAIR's Koala to supported models (#877 )	1 year ago
Pavol Rusnak	d2beca95dc	Make docker instructions more explicit (#785 )	1 year ago
Georgi Gerganov	3416298929	Update README.md	1 year ago
Georgi Gerganov	8d10406d6e	readme : change logo + add bindings + add uis + add wiki	1 year ago
Adithya Balaji	594cc95fab	readme : update with CMake and windows example (#748 ) * README: Update with CMake and windows example * README: update with code-review for cmake build	1 year ago
Thatcher Chamberlin	d8d4e865cd	Add a missing step to the gpt4all instructions (#690 ) `migrate-ggml-2023-03-30-pr613.py` is needed to get gpt4all running.	1 year ago
rimoliga	d0a7f742e7	readme: replace termux links with homepage, play store is deprecated (#680 )	1 year ago
Pavol Rusnak	9733104be5	drop quantize.py (now that models are using a single file)	1 year ago
Georgi Gerganov	3df890aef4	readme : update supported models	1 year ago
Georgi Gerganov	b467702b87	readme : fix typos	1 year ago
Georgi Gerganov	516d88e75c	readme : add GPT4All instructions (close #588 )	1 year ago
Stephan Walter	b391579db9	Update README and comments for standalone perplexity tool (#525 )	1 year ago
Georgi Gerganov	348d6926ee	Add logo to README.md	1 year ago
Georgi Gerganov	55ad42af84	Move chat scripts into "./examples"	1 year ago
Georgi Gerganov	4a7129acd2	Remove obsolete information from README	1 year ago
Gary Mulder	f4f5362edb	Update README.md (#444 ) Added explicit bolded instructions clarifying that people need to request access to models from Facebook and never through through this repo.	1 year ago
Georgi Gerganov	b6b268d441	Add link to Roadmap discussion	1 year ago
Stephan Walter	a50e39c6fe	Revert "Delete SHA256SUMS for now" (#429 ) * Revert "Delete SHA256SUMS for now (#416)" This reverts commit `8eea5ae0e5`. * Remove ggml files until they can be verified * Remove alpaca json * Add also model/tokenizer.model to SHA256SUMS + update README --------- Co-authored-by: Pavol Rusnak <pavol@rusnak.io>	1 year ago
Gary Mulder	8a3e5ef801	Move model section from issue template to README.md (#421 ) * Update custom.md * Removed Model section as it is better placed in README.md * Updates to README.md model section * Inserted text that was removed from issue template about obtaining models from FB and links to papers describing the various models * Removed IPF down links for the Alpaca 7B models as these look to be in the old data format and probably shouldn't be directly linked to, anyway * Updated the perplexity section to point at Perplexity scores #406 discussion	1 year ago
Georgi Gerganov	93208cfb92	Adjust repetition penalty ..	1 year ago
Georgi Gerganov	03ace14cfd	Add link to recent podcast about whisper.cpp and llama.cpp	1 year ago
Gary Linscott	40ea807a97	Add details on perplexity to README.md (#395 )	1 year ago
Georgi Gerganov	56817b1f88	Remove temporary notice and update hot topics	1 year ago
Gary Mulder	da0e9fe90c	Add SHA256SUMS file and instructions to README how to obtain and verify the downloads Hashes created using: sha256sum models/B/.pth models/[7136]B/ggml-model-f16.bin models/[7136]B/ggml-model-q4_0.bin > SHA256SUMS	1 year ago
Georgi Gerganov	3366853e41	Add notice about pending change	1 year ago
Georgi Gerganov	1daf4dd712	Minor style changes	1 year ago
Georgi Gerganov	dc6a845b85	Add chat.sh script	1 year ago
Georgi Gerganov	3bfa3b43b7	Fix convert script, warnings alpaca instructions, default params	1 year ago
Kevin Kwok	e0ffc861fa	Update IPFS links to quantized alpaca with new tokenizer format (#352 )	1 year ago
Mack Straight	074bea2eb1	sentencepiece bpe compatible tokenizer (#252 ) * potential out of bounds read * fix quantize * style * Update convert-pth-to-ggml.py * mild cleanup * don't need the space-prefixing here rn since main.cpp already does it * new file magic + version header field * readme notice * missing newlines Co-authored-by: slaren <2141330+slaren@users.noreply.github.com>	1 year ago
Suaj Carrot	7392f1cd2c	Improved quantize script (#222 ) * Improved quantize script I improved the quantize script by adding error handling and allowing to select many models for quantization at once in the command line. I also converted it to Python for generalization as well as extensibility. * Fixes and improvements based on Matt's observations Fixed and improved many things in the script based on the reviews made by @mattsta. The parallelization suggestion is still to be revised, but code for it was still added (commented). * Small fixes to the previous commit * Corrected to use the original glob pattern The original Bash script uses a glob pattern to match files that have endings such as ...bin.0, ...bin.1, etc. That has been translated correctly to Python now. * Added support for Windows and updated README to use this script New code to set the name of the quantize script binary depending on the platform has been added (quantize.exe if working on Windows) and the README.md file has been updated to use this script instead of the Bash one. * Fixed a typo and removed shell=True in the subprocess.run call Fixed a typo regarding the new filenames of the quantized models and removed the shell=True parameter in the subprocess.run call as it was conflicting with the list of parameters. * Corrected previous commit * Small tweak: changed the name of the program in argparse This was making the automatic help message to be suggesting the program's usage as being literally "$ Quantization Script [arguments]". It should now be something like "$ python3 quantize.py [arguments]".	1 year ago
Georgi Gerganov	160bfb217d	Update hot topics to mention Alpaca support	1 year ago
Georgi Gerganov	a4e63b73df	Add instruction for using Alpaca (#240 )	1 year ago
Pavol Rusnak	6f61c18ec9	Fix typo in readme	1 year ago
Pavol Rusnak	1e5a6d088d	Add note about Python 3.11 to readme	1 year ago
Pavol Rusnak	554b541521	Add memory/disk requirements to readme	1 year ago
Georgi Gerganov	e81b9c81c1	Update Contributing section	1 year ago
Stephan Walter	367946c668	Don't tell users to use a bad number of threads (#243 ) The readme tells people to use the command line option "-t 8", causing 8 threads to be started. On systems with fewer than 8 cores, this causes a significant slowdown. Remove the option from the example command lines and use /proc/cpuinfo on Linux to determine a sensible default.	1 year ago
Bernat Vadell	2af23d3043	🚀 Dockerize llamacpp (#132 ) * feat: dockerize llamacpp * feat: split build & runtime stages * split dockerfile into main & tools * add quantize into tool docker image * Update .devops/tools.sh Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * add docker action pipeline * change CI to publish at github docker registry * fix name runs-on macOS-latest is macos-latest (lowercase) * include docker versioned images * fix github action docker * fix docker.yml * feat: include all-in-one command tool & update readme.md --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	1 year ago
Georgi Gerganov	721311070e	Update README.md	1 year ago
Georgi Gerganov	ac15de7895	Expand "Contributing" section	1 year ago
Georgi Gerganov	273abc47ff	Update hot topics - RMSnorm	1 year ago
moritzbrantner	27944c4206	fixed typo (#178 )	1 year ago
Musab Gultekin	977295c700	Fix potential licensing issue (#126 ) * Update README.md * Update README.md remove facebook	1 year ago
Radoslav Gerganov	60f819a2b1	Add section to README on how to run the project on Android (#130 )	1 year ago
Georgi Gerganov	97ab2b2578	Add Misc section + update hot topics + minor fixes	1 year ago
Georgi Gerganov	7ec903d3c1	Update contribution section, hot topics, limitations, etc.	1 year ago
Pavol Rusnak	d1f224712d	Add quantize script for batch quantization (#92 ) * Add quantize script for batch quantization * Indentation * README for new quantize.sh * Fix script name * Fix file list on Mac OS --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	1 year ago
Georgi Gerganov	1808ee0500	Add initial contribution guidelines	1 year ago

1 2

78 Commits (8b679987cdce292ff36bd741f6715e4927e26f9b)