daanzu / py-webrtcvad-wheels
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
☆15Updated last month
Alternatives and similar repositories for py-webrtcvad-wheels:
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Faster Whisper ASR transcription with CTranslate2☆19Updated 2 months ago
- An open, comprehensive catalog of scholarship, connecting papers, authors, institutions, and journals.☆10Updated last year
- Experiments with Hugging Face 🔬 🤗☆45Updated 5 months ago
- A fork of https://people.csail.mit.edu/hubert/git/pyaudio.git. Last synchronized on 20231119.☆31Updated 6 months ago
- Development repository for Integrated Speech Corpus Analaysis (ISCAN)☆9Updated 2 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- ☆13Updated this week
- Simple implementation of a GPT (training and inference) in PyTorch.☆10Updated last year
- Cleaning discord data for NLP☆27Updated 3 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam) and LookAhead in one codebase☆11Updated 3 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆37Updated last month
- Finite-state script normalization and processing utilities☆38Updated this week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆93Updated this week
- Fast Neural Machine Translation in C++ - development repository☆19Updated 8 months ago
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 2 weeks ago
- arXiv plain text extraction☆41Updated 2 years ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆21Updated 3 months ago
- ☆24Updated 2 years ago
- ☆28Updated last month
- Automatically generate and overlay subtitles for any video using OpenAi Whisper☆16Updated 2 years ago
- LLM access to models by Anthropic, including the Claude series☆13Updated last month
- CLI for llama.cpp with various commands to guide, edit, and regenerate tokens on the fly.☆11Updated last week
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago