AssemblyAI-Community / assemblyai-and-python-in-5-minutesLinks
Repo for hosting tutorial code associated with the "AssemblyAI and Python in 5 Minutes" blog by AssemblyAI
☆12Updated last year
Alternatives and similar repositories for assemblyai-and-python-in-5-minutes
Users that are interested in assemblyai-and-python-in-5-minutes are comparing it to the libraries listed below
Sorting:
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Demo FastAPI WebSocket Audio☆40Updated 4 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Streaming AI assistant with ChatGPT, FastAPI, WebSockets and React ✨🤖🚀☆26Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆23Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆38Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆20Updated 8 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- ☆20Updated 2 months ago
- Runpod WhisperX Docker Container Repo☆15Updated last year
- ☆64Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- On-device speaker recognition engine powered by deep learning☆36Updated this week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Speaker diarization service☆23Updated 2 months ago
- Transcription and diarization (speaker identification)☆33Updated 2 years ago
- Audio search using Azure Cognitive Search☆23Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago
- create dataset from list of youtube links easily☆19Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆14Updated last year
- A collection of simple transformer based chatbots.☆18Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- A streaming whisper server for on-prem transcription☆20Updated 10 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆146Updated last year
- Tunable pipelines☆34Updated 4 months ago
- Real time web based Speech-to-Text app with Streamlit☆250Updated 2 years ago