Audio transcription using mlx whisper and vad silence processing
☆17Oct 14, 2024Updated last year
Alternatives and similar repositories for mlx_speech2text
Users that are interested in mlx_speech2text are comparing it to the libraries listed below
Sorting:
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Oct 30, 2024Updated last year
- Transform your CapsLock into an AI key! This AutoHotkey app puts powerful AI capabilities right at your fingertips, supercharging your Wi…☆21Oct 31, 2025Updated 4 months ago
- Bluetooth plugin for Flutter☆10Dec 19, 2022Updated 3 years ago
- Plugin QGIS☆10Jan 16, 2023Updated 3 years ago
- Bugtracker of novel-ebook.com☆12Aug 11, 2021Updated 4 years ago
- Twitch配信中にレイドがきたときに、⾃動で「/shoutout レイド元のユーザー名」公式コマンドの実⾏や、指定したメッセージをチャット欄に表⽰してくれる、ボットアプリ☆10Jun 7, 2025Updated 8 months ago
- OPI5 open micro desk design.☆13Mar 6, 2023Updated 2 years ago
- The framework for creating a new platform (like game engine).☆10Jan 11, 2026Updated last month
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆11Sep 19, 2025Updated 5 months ago
- ISDB-S3 fork☆10Dec 13, 2024Updated last year
- This is a frontend to the Inkscape command line feature to allow the user to perform batch conversions of SVG files.☆15Dec 10, 2013Updated 12 years ago
- Project-agnostic, composable configuration system for AI-assisted development workflows. Single source of truth for agentic tools (Claude…☆23Feb 24, 2026Updated last week
- Convert your PDF files into word documents or different image formats locally without uploading some servers unknown.☆18Jun 2, 2023Updated 2 years ago
- ☆26Feb 26, 2026Updated last week
- MPEG-2 TS packect check☆12Jun 3, 2024Updated last year
- ☆31Updated this week
- Easily spot, fix and detect badly written code in your project without breaking a sweat. Your GPT4 and LLM powered programming AI realtim…☆21Mar 2, 2025Updated last year
- Dockerで構築するMirakurun + EDCB + KonomiTVなTV視聴・録画環境☆15Jan 18, 2026Updated last month
- An application to display the text of the Hebrew Bible (Leningrad codex) along with an English translation (1917 JPS) and an audio record…☆13Jul 17, 2015Updated 10 years ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- Template repository for SillyTavern extensions using React and Webpack.☆15Updated this week
- ☆10Jun 24, 2022Updated 3 years ago
- Collected Latin files from the Perseus Digital Library☆13Jun 21, 2017Updated 8 years ago
- ATSC 3.0 to MPEG-2 TS Converter☆21Sep 11, 2025Updated 5 months ago
- 無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン☆10Jan 30, 2023Updated 3 years ago
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆16Feb 24, 2025Updated last year
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated 2 months ago
- ☆13Oct 4, 2024Updated last year
- ☆10Aug 27, 2025Updated 6 months ago
- Concurrent data extraction from unstructured text and images using AI models.☆18Aug 10, 2025Updated 6 months ago
- 自分用ビルドスクリプト集☆10Updated this week
- A tool for automatic English to Katakana conversion☆15Nov 26, 2025Updated 3 months ago
- OData Browser for the iPhone☆26Aug 7, 2010Updated 15 years ago
- Claude Code plugin for code review skills and verification workflows. Python, Go, React, FastAPI, BubbleTea, and AI frameworks (Pydantic …☆31Feb 13, 2026Updated 2 weeks ago
- ☆22Jan 27, 2026Updated last month
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- A set of AI-powered slash commands for Claude Code and OpenSkills (support Cursor, Windsurf and Gemini CLI) that help you understand any …☆29Jan 14, 2026Updated last month
- Give Claude a variety of iconic Starcraft, WarCraft and AoE unit sounds! Also manages audio and system notifications☆38Feb 13, 2026Updated 2 weeks ago
- Tokenizer for Text to Speech (TTS) models☆13Jan 16, 2025Updated last year