NickCrews / llama-cpp-server-pythonLinks
Bootstrap a server from llama-cpp in a few lines of python
☆12Updated last year
Alternatives and similar repositories for llama-cpp-server-python
Users that are interested in llama-cpp-server-python are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- ☆141Updated 5 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Enhancing Translation with RAG-Powered Large Language Models☆89Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- Code for the MTEB Arena☆24Updated 7 months ago
- ☆242Updated 4 months ago
- entropix style sampling + GUI☆27Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 10 months ago
- ☆109Updated 5 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Sentence Transformers API: An OpenAI compatible embedding API server☆70Updated last year
- Function Calling Benchmark & Testing☆92Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Work with your business data using natural language☆19Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Easily view and modify JSON datasets for large language models☆87Updated 8 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- A pipeline for LLM knowledge distillation☆112Updated 10 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- ☆101Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆157Updated last year
- unsloth-5090-multiple☆60Updated 8 months ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆246Updated 2 years ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆167Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆110Updated 8 months ago