bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆22Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for exllamav2-openai-server
- ☆20Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 8 months ago
- ☆27Updated last year
- entropix style sampling + GUI☆25Updated 3 weeks ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- ☆49Updated 8 months ago
- ☆53Updated 5 months ago
- GPT-2 small trained on phi-like data☆65Updated 9 months ago
- QuIP quantization☆46Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- Simple examples using Argilla tools to build AI☆42Updated this week
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆41Updated 2 weeks ago
- ☆37Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 3 weeks ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆100Updated 6 months ago
- ☆72Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- ☆21Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- Train Llama Loras Easily☆29Updated last year