☆18Apr 28, 2021Updated 4 years ago
Alternatives and similar repositories for data-acquisition-pipeline
Users that are interested in data-acquisition-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Aug 3, 2021Updated 4 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- ☆37Mar 26, 2024Updated 2 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Sep 22, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- A open-source flutter based app to help keep track of your repertoir!☆23Updated this week
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Synthetically generate random text document images with ground-truth☆12Jul 20, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language