bjoernpl / llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
☆48Updated last year
Alternatives and similar repositories for llama_gradio_interface:
Users that are interested in llama_gradio_interface are comparing it to the libraries listed below
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Merge LLM that are split in to parts☆26Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated 9 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- ☆31Updated last year
- Image Diffusion block merging technique applied to transformers based Language Models.☆54Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆12Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 10 months ago
- Inference code for LLaMA models☆46Updated last year
- Framework agnostic python runtime for RWKV models☆145Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Inference code for LLaMA 2 models☆31Updated 7 months ago
- 8-bit CUDA functions for PyTorch☆44Updated last year
- Conversational Language model toolkit for training against human preferences.☆41Updated 10 months ago
- ☆27Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆18Updated 5 months ago
- Pressure testing the context window of open LLMs☆22Updated 5 months ago
- Instruct-tune LLaMA on consumer hardware☆73Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆64Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- ☆52Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Yet Another LLaMA/ALPACA Discord Bot☆72Updated last year
- ☆62Updated 6 months ago
- Tune MPTs☆84Updated last year