Hugo-Dz / on-device-transcriptionLinks
A ready-to-use, minimal app that converts any speech into text.
☆369Updated 11 months ago
Alternatives and similar repositories for on-device-transcription
Users that are interested in on-device-transcription are comparing it to the libraries listed below
Sorting:
- AI-powered dictation tool☆405Updated 7 months ago
- HTML to Markdown converter and crawler.☆571Updated last year
- Tell a story and get a live feed of images.☆136Updated last year
- ☆206Updated 4 months ago
- ☆275Updated 4 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆131Updated 10 months ago
- A simple and fast backend API, based on Hono, that can search for relevant content on the internet using keywords and convert it into a f…☆247Updated last year
- podcastfy.ai gradio demo app☆334Updated 6 months ago
- Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features AI chat, Text-to-Speech (TTS…☆97Updated last month
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆158Updated 8 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆290Updated this week
- ☆171Updated 10 months ago
- Chat with any website on your local machine☆80Updated 11 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆206Updated 6 months ago
- A real-time Agent framework for audio and video.☆137Updated last week
- Real-Time Voice Inference Web SDK☆246Updated last week
- Semantic Search on Wikipedia with Upstash Vector☆467Updated 2 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆210Updated last year
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆491Updated 5 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆211Updated 8 months ago
- an AI Github Rlease Tracker Powered by Cloudflare☆55Updated 3 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆237Updated last year
- A class for generating realistic audio (TTS) for podcasts and dialogues.☆60Updated 6 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆243Updated 9 months ago
- ☆156Updated 7 months ago
- ☆274Updated 7 months ago
- The simplest open-source implementation of perplexity.ai☆313Updated 5 months ago
- Self-hosted voice chat with LLMs☆432Updated 3 months ago
- Ortlin - OpenAI User Interface☆57Updated 7 months ago
- Example UI implementing the RTVI web client☆477Updated 6 months ago