kroko-ai / kroko-onnxLinks
Kroko ASR - Speech-to-text
☆130Updated 3 months ago
Alternatives and similar repositories for kroko-onnx
Users that are interested in kroko-onnx are comparing it to the libraries listed below
Sorting:
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆131Updated last week
- A web application that converts speech to speech 100% private☆82Updated 7 months ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 4 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Updated last year
- ☆19Updated 6 months ago
- A simple tool to anonymize LLM prompts.☆66Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated 11 months ago
- A novel media player that allows you to navigate by speaker☆85Updated last month
- ☆49Updated 11 months ago
- This is the backend for the entire Amurex project.☆145Updated 9 months ago
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆57Updated 2 weeks ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆229Updated 5 months ago
- A simple to use python library for creating podcasts with support for many LLM and TTS providers☆102Updated 3 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- OLLama IMage CAtegorizer☆70Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 2 weeks ago
- ☆29Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆168Updated last week
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆45Updated this week
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆269Updated last month
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Updated 6 months ago
- ☆51Updated 3 months ago
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Updated 5 months ago
- ☆54Updated 8 months ago
- On-device streaming text-to-speech engine powered by deep learning☆127Updated last week
- The PyVisionAI Official Repo☆111Updated 6 months ago
- ☆15Updated 11 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆127Updated 4 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆90Updated 11 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆105Updated 2 months ago