skrbnv / javadLinks
β65Updated 11 months ago
Alternatives and similar repositories for javad
Users that are interested in javad are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β132Updated 5 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximationβ143Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β127Updated 5 months ago
- β348Updated 3 months ago
- β385Updated last year
- Very fast, accurate speaker diarizationβ205Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β104Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLXβ184Updated 5 months ago
- Collection of Open Source Speech Dataβ164Updated 3 months ago
- Official implementation of the TTS model Lina-Speechβ175Updated last year
- β319Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.β138Updated 3 months ago
- β158Updated last month
- Open TTS models, built for streaming on the edgeβ44Updated 9 months ago
- Speaker Diarization with Transformersβ69Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β47Updated 4 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on Onβ¦β227Updated 7 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ194Updated 8 months ago
- A lightweight Python package for Automatic Speech Recognition using ONNX modelsβ214Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β69Updated 2 months ago
- β54Updated last week
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ72Updated last week
- Audio tokenization, in the fastest way possible!β53Updated last year
- Drax: Speech Recognition with Discrete Flow Matchingβ73Updated 3 months ago
- Fast audio super resolution from 16khz to 48khz.β177Updated last week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLMβ291Updated 8 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β86Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β179Updated last year
- Real-time Speech-Text Foundation Model Toolkit (wip)β249Updated 9 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β412Updated last year