xaedes / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama.cpp
- QuIP quantization☆46Updated 7 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 5 months ago
- ☆27Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆22Updated 9 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 7 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- ☆36Updated 3 months ago
- Collection of autoregressive model implementation☆66Updated last week
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- Training Models Daily☆17Updated 10 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆73Updated 3 weeks ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Training hybrid models for dummies.☆15Updated 2 weeks ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆54Updated 2 months ago
- Simple LLM inference server☆17Updated 5 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week
- ☆43Updated 3 months ago
- ☆49Updated 8 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 4 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆30Updated 3 months ago
- ☆40Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- ☆55Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- look how they massacred my boy☆54Updated 3 weeks ago
- ☆64Updated 5 months ago
- ☆40Updated last week
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year