gladiaio / gladia-samples
☆26Updated 3 months ago
Related projects: ⓘ
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆153Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆188Updated 2 months ago
- ☆59Updated last year
- Enable AI powered semantic search on your documents, easy to install, simple to configure,handles multiple file formats (txt,csv, pdf, do…☆75Updated 2 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆64Updated last month
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- WIP exploration using Twilio Media Streams and Generative AI☆34Updated 7 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆94Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆24Updated last year
- Transcription with speaker diarization pipeline☆81Updated last year
- Self-hosted AI voice agent☆50Updated 3 weeks ago
- Speaker Diarization with Transformers☆57Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆34Updated last week
- An autonomous agent for software testing: concept☆15Updated last month
- streaming speech to text server using Whisper☆75Updated last year
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆62Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆43Updated last week
- Official Python SDK for Deepgram's automated speech recognition APIs.☆209Updated this week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated last year
- A python library to find differences between audio and transcriptions☆14Updated 10 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆35Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆132Updated 4 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆44Updated last year
- Create an LJSpeech structured voice dataset on wave input☆16Updated 2 months ago
- ☆64Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆132Updated last year
- AI real estate agent☆31Updated 7 months ago