malaysia-ai / transformers-openai-api
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆19Updated 2 months ago
Alternatives and similar repositories for transformers-openai-api:
Users that are interested in transformers-openai-api are comparing it to the libraries listed below
- ☆52Updated 8 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 7 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆26Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 5 months ago
- entropix style sampling + GUI☆25Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆59Updated 3 months ago
- Modified Beam Search with periodical restart☆12Updated 5 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆18Updated 5 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆21Updated 11 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- ☆38Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆39Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆42Updated 6 months ago
- ☆30Updated 7 months ago
- ☆65Updated 8 months ago