gweltou / anaouder-cliLinks
Anaouder mouezh e Brezhoneg gant Vosk
☆16Updated 2 months ago
Alternatives and similar repositories for anaouder-cli
Users that are interested in anaouder-cli are comparing it to the libraries listed below
Sorting:
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated 3 weeks ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Updated last year
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Updated 3 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Updated 2 years ago
- Whisper finetuning☆15Updated 10 months ago
- Machine learning tools and framework for automatic music transcription.☆36Updated last year
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Updated last year
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 5 years ago
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Updated last year
- Wenet speech to text for react native☆10Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- ☆11Updated 5 months ago
- Implementation of the paper "Can Large Language Models Predict Audio Effects Parameters from Natural Language?"☆26Updated 8 months ago
- Sisyphus recipies for ASR☆18Updated this week
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago