An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
☆20Sep 17, 2024Updated last year
Alternatives and similar repositories for atra
Users that are interested in atra are comparing it to the libraries listed below
Sorting:
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆26Nov 3, 2025Updated 4 months ago
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- ☆16Sep 12, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆36Aug 8, 2023Updated 2 years ago
- Detect emotion from audio☆13Nov 20, 2018Updated 7 years ago
- ☆18Sep 19, 2023Updated 2 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- Persian Grapheme-to-Phoneme (G2P) converter☆21Dec 15, 2020Updated 5 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- ☆30Apr 8, 2024Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated last week
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 2 years ago
- ☆30Jun 12, 2025Updated 8 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆38Jun 25, 2025Updated 8 months ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 7 years ago
- real time face swap and one-click video deepfake with only a single image☆12Sep 13, 2024Updated last year
- This is a tool that can make you run intel openVINO Demos and samples easily.☆11Jan 31, 2023Updated 3 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Mar 18, 2024Updated last year
- A Kivy tutorial for PyOhio 2013☆14Apr 30, 2014Updated 11 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- [ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM☆12Feb 12, 2026Updated 2 weeks ago
- PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k☆11Mar 14, 2024Updated last year
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- windows端翻译软件。提供划词翻译、截图翻译、ai翻译等功能☆12Apr 24, 2025Updated 10 months ago