mesolitica / transformers-openai-apiLinks

Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.

☆24

Alternatives and similar repositories for transformers-openai-api

Users that are interested in transformers-openai-api are comparing it to the libraries listed below

Sorting:

agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆86Updated last month
nyunAI / PruneGPT
☆53Updated last year
FishiaTee / Tumera
Yet another frontend for LLM, written using .NET and WinUI 3
☆10Updated 7 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 9 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 7 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆62Updated 10 months ago
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year
the-crypt-keeper / tcurtsni
Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?
☆22Updated last year
severian42 / Proteus-The-Genesis-LLM
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆22Updated 6 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 7 months ago
lightblue-tech / lb-reranker
☆23Updated 4 months ago
rodrigobaron / anthill
☆24Updated 5 months ago
keeeeenw / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆11Updated last year
brittlewis12 / autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆21Updated 10 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆20Updated 8 months ago
latent-variable / r1_reasoning_effort
Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.
☆19Updated 4 months ago
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 7 months ago
JakeFurtaw / Chat-RAG
Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…
☆22Updated last month
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆49Updated 4 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆20Updated 6 months ago
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆31Updated 2 months ago
huggingface / feel
☆11Updated 2 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 3 weeks ago
mkurman / grpo-llm-evaluator
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…
☆44Updated last month
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
cognitivecomputations / kraken
☆66Updated last year
teknium1 / ShareGPT-Builder
☆114Updated 6 months ago