plaggy / fast-whisper-server
ASR + diarization model server with speculative decoding
☆60Updated 10 months ago
Alternatives and similar repositories for fast-whisper-server:
Users that are interested in fast-whisper-server are comparing it to the libraries listed below
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- ☆51Updated 8 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year
- All the world is a play, we are but actors in it.☆49Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆52Updated 5 months ago
- Universal text classifier for generative models☆23Updated 8 months ago
- ☆30Updated 9 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆119Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆22Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆21Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆52Updated 4 months ago
- ☆99Updated 7 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆167Updated last year
- ☆112Updated 3 months ago
- ☆171Updated 7 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated this week
- auto fine tune of models with synthetic data☆75Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- huggingface chat-ui integration with mlx-lm server☆60Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆66Updated 5 months ago