mesolitica / transformers-openai-apiLinks
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆23Updated 2 months ago
Alternatives and similar repositories for transformers-openai-api
Users that are interested in transformers-openai-api are comparing it to the libraries listed below
Sorting:
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month
- ☆53Updated last year
- entropix style sampling + GUI☆26Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 6 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 9 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- ☆43Updated 3 months ago
- Simple examples using Argilla tools to build AI☆53Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Lego for GRPO☆28Updated last week
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 11 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆70Updated 7 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆55Updated 8 months ago
- LLM inference in C/C++☆21Updated 2 months ago
- ☆114Updated 5 months ago
- ☆17Updated 5 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆24Updated 4 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 8 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆40Updated 3 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- ☆66Updated last year