☆18Apr 28, 2021Updated 5 years ago
Alternatives and similar repositories for data-acquisition-pipeline
Users that are interested in data-acquisition-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Aug 3, 2021Updated 4 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- ☆37Mar 26, 2024Updated 2 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Sep 22, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 3 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Synthetically generate random text document images with ground-truth☆12Jul 20, 2021Updated 4 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Parse Searchable Electoral Rolls☆12Apr 20, 2025Updated last year
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Websockets <-> Riva proxy service. Audiocodes compatible.☆20Mar 31, 2023Updated 3 years ago
- Python library to write, read, and verify transparency metadata in audio files for AI transparency compliance.☆18Aug 17, 2025Updated 9 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆13May 1, 2026Updated 3 weeks ago
- ☆13Dec 15, 2022Updated 3 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- ☆14Feb 27, 2021Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Assistance component base for Dicio assistant components☆13Apr 23, 2026Updated last month
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 4 months ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago