Hugo-Dz / on-device-transcriptionLinks
A ready-to-use, minimal app that converts any speech into text.
☆375Updated last year
Alternatives and similar repositories for on-device-transcription
Users that are interested in on-device-transcription are comparing it to the libraries listed below
Sorting:
- AI-powered dictation tool☆439Updated 9 months ago
- ☆282Updated 6 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆131Updated last year
- ☆209Updated 7 months ago
- Tell a story and get a live feed of images.☆138Updated last year
- HTML to Markdown converter and crawler.☆589Updated last year
- Integrate LLM's into your OS. For any issues or ideas, message us in the discord server below!☆145Updated 4 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆239Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆209Updated 8 months ago
- Yet another open source Perplexity☆451Updated 10 months ago
- Chat with any website on your local machine☆85Updated last year
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆499Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆215Updated 10 months ago
- ☆149Updated last year
- Semantic Search on Wikipedia with Upstash Vector☆472Updated 5 months ago
- A simple and fast backend API, based on Hono, that can search for relevant content on the internet using keywords and convert it into a f…☆246Updated last year
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆161Updated 11 months ago
- ☆170Updated last year
- Example UI implementing the RTVI web client☆477Updated 9 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆556Updated 3 months ago
- podcastfy.ai gradio demo app☆334Updated 9 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆158Updated 11 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated 11 months ago
- A real-time Agent framework for audio and video.☆149Updated 3 months ago
- ☆249Updated 7 months ago
- Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features AI chat, Text-to-Speech (TTS…☆103Updated 3 months ago
- Generate accurate transcripts using Apple's MLX framework☆436Updated 4 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆305Updated last month
- an AI Github Rlease Tracker Powered by Cloudflare☆57Updated 6 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year