Ronsor / llama-tools
Tools for the LLaMA language model
☆12Updated last year
Alternatives and similar repositories for llama-tools:
Users that are interested in llama-tools are comparing it to the libraries listed below
- An unsupervised model merging algorithm for Transformers-based language models.☆102Updated 9 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- ☆27Updated last year
- ☆22Updated last year
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆65Updated last year
- The first AI artist☆32Updated last year
- GRDN.AI app for garden optimization☆70Updated 11 months ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13Updated last year
- Flexible Python package for managing and extending LLM based agents☆25Updated last year
- LLM finetuning☆42Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆156Updated last year
- Demo of an "always-on" AI assistant.☆23Updated 11 months ago
- ☆60Updated last year
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆28Updated last year
- ☆31Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Demo of ConversationEntityMemory in Streamlit.☆52Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆167Updated 8 months ago
- 4 bits quantization of SantaCoder using GPTQ☆53Updated last year
- ☆74Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 8 months ago