Hugo-Dz / on-device-transcriptionLinks
A ready-to-use, minimal app that converts any speech into text.
☆377Updated last year
Alternatives and similar repositories for on-device-transcription
Users that are interested in on-device-transcription are comparing it to the libraries listed below
Sorting:
- AI-powered dictation tool☆464Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆212Updated 3 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆132Updated last year
- ☆209Updated 11 months ago
- ☆285Updated 10 months ago
- A simple and fast backend API, based on Hono, that can search for relevant content on the internet using keywords and convert it into a f…☆249Updated last year
- HTML to Markdown converter and crawler.☆608Updated 2 years ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆161Updated last week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆500Updated 5 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆242Updated last year
- Yet another open source Perplexity☆460Updated last year
- A class for generating realistic audio (TTS) for podcasts and dialogues.☆64Updated last year
- Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features AI chat, Text-to-Speech (TTS…☆108Updated 7 months ago
- podcastfy.ai gradio demo app☆332Updated last year
- Integrate LLM's into your OS. For any issues or ideas, message us in the discord server below!☆146Updated 8 months ago
- ☆151Updated last year
- ☆60Updated last year
- Tell a story and get a live feed of images.☆139Updated last year
- Semantic Search on Wikipedia with Upstash Vector☆472Updated last month
- ☆253Updated 11 months ago
- Examples for Cerebrium Serverless GPUs☆515Updated last week
- ML-powered speech synthesis directly in your browser☆171Updated 11 months ago
- ☆170Updated last year
- Chat with any website on your local machine☆85Updated last year
- Example UI implementing the RTVI web client☆475Updated last year
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…☆231Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated 2 months ago
- A real-time Agent framework for audio and video.☆169Updated last week
- Open Source AI Math Notes☆497Updated last year
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆319Updated 4 months ago