aneeshjoy / vllm-windows
Docker compose to run vLLM on Windows
☆28Updated 8 months ago
Related projects: ⓘ
- ☆50Updated 3 months ago
- A pipeline parallel training script for LLMs.☆79Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆77Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated last month
- ☆71Updated last year
- A fast batching API to serve LLM models☆172Updated 4 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆53Updated 3 weeks ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Evaling and unaligning Chinese LLM censorship☆23Updated 3 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆74Updated 5 months ago
- ☆53Updated this week
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆139Updated 11 months ago
- ☆101Updated 6 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆139Updated 7 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆101Updated 2 months ago
- All the world is a play, we are but actors in it.☆46Updated 2 months ago
- ☆82Updated 3 weeks ago
- ☆51Updated last month
- automatically quant GGUF models☆119Updated this week
- A python package for developing AI applications with local LLMs.☆137Updated 2 months ago
- run ollama & gguf easily with a single command☆46Updated 4 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆85Updated this week
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆84Updated 2 weeks ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally)☆64Updated this week
- ☆73Updated 8 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆68Updated 3 weeks ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3☆33Updated last week
- ☆21Updated this week