malaysia-ai / transformers-openai-api

Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.

☆20

Alternatives and similar repositories for transformers-openai-api:

Users that are interested in transformers-openai-api are comparing it to the libraries listed below

nyunAI / PruneGPT
☆53Updated 9 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆54Updated this week
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆67Updated 4 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆21Updated 3 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆59Updated 7 months ago
FishiaT / Tumera
Yet another frontend for LLM, written using .NET and WinUI 3
☆10Updated 4 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 6 months ago
cognitivecomputations / SystemChat
☆30Updated 8 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆64Updated 4 months ago
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆17Updated 5 months ago
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆37Updated 3 weeks ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 4 months ago
arcee-ai / DAM
☆48Updated 4 months ago
EdwardDali / EntropixLab
entropix style sampling + GUI
☆25Updated 4 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆19Updated 3 months ago
cognitivecomputations / kraken
☆65Updated 9 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆44Updated 10 months ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆36Updated last year
the-crypt-keeper / tcurtsni
Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?
☆21Updated 8 months ago
teknium1 / ShareGPT-Builder
☆111Updated 3 months ago
nexusflowai / nexusraven-pip
☆38Updated last year
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆41Updated 2 months ago
Glavin001 / Data2AITextbook
🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)
☆26Updated last year