mesolitica / transformers-openai-api
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆22Updated last month
Alternatives and similar repositories for transformers-openai-api:
Users that are interested in transformers-openai-api are comparing it to the libraries listed below
- ☆53Updated 10 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- Modified Beam Search with periodical restart☆12Updated 7 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 5 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆41Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆66Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆60Updated this week
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 2 weeks ago
- ☆41Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- ☆14Updated this week
- BH hackathon☆14Updated last year
- Lego for GRPO☆27Updated 3 weeks ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆112Updated 4 months ago