plaggy / fast-whisper-server
ASR + diarization model server with speculative decoding
☆53Updated 7 months ago
Alternatives and similar repositories for fast-whisper-server:
Users that are interested in fast-whisper-server are comparing it to the libraries listed below
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆20Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 4 months ago
- Scripts to create your own moe models using mlx☆85Updated 10 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- ☆96Updated 4 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆118Updated 11 months ago
- ☆107Updated 3 weeks ago
- All the world is a play, we are but actors in it.☆47Updated this week
- A framework for evaluating function calls made by LLMs☆36Updated 5 months ago
- Simple examples using Argilla tools to build AI☆51Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- ☆51Updated 5 months ago
- ☆13Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- ☆77Updated 10 months ago
- Data Questionnaire Agent Chatbot☆63Updated this week
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated 11 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- Simple Graph Memory for AI applications☆81Updated 5 months ago
- VideoDB Python SDK☆63Updated this week
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆66Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆38Updated last week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- A pipeline parallel training script for LLMs.☆116Updated this week