☆10Mar 29, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec2
Users that are interested in wav2vec2 are comparing it to the libraries listed below
Sorting:
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆23Jun 3, 2023Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )☆19Aug 2, 2021Updated 4 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆36Jan 5, 2026Updated last month
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- Hugging Face Download (Cache) Manager☆22Aug 7, 2022Updated 3 years ago
- ☆13Dec 12, 2021Updated 4 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Nov 9, 2020Updated 5 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆34Sep 14, 2021Updated 4 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- Chat with your data while uploading a pdf file and using a local LLM.☆11Mar 19, 2024Updated last year
- ProbPy is a comprehensive repository dedicated to providing an extensive collection of probability puzzles, riddles, and solutions typica…☆11Jun 27, 2023Updated 2 years ago
- A complete pipeline for fine-tuning YOLOv8 pose models with custom datasets. Supports automatic and semi-automatic annotation for efficie…☆15Feb 9, 2025Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated 11 months ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)☆11Nov 29, 2024Updated last year
- Itzuli® Machine Translation Engine API libraries☆11Feb 2, 2026Updated last month
- ☆10Oct 27, 2023Updated 2 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- This is a plugin for ImageJ2 for multifractal analysis of 2D and 3D images. Cite: MULTIFRAC: An ImageJ plugin for multiscale characteriza…☆12Aug 28, 2020Updated 5 years ago
- audio time-stretching and pitch-shifting library and utility program☆10Dec 27, 2016Updated 9 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Pytorch implementation of Google TCAV☆10Jan 11, 2019Updated 7 years ago
- Example of bazel python cpp binding☆10May 27, 2023Updated 2 years ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 3 years ago
- ☆10May 2, 2020Updated 5 years ago
- Fancylit is a python module that contains pre-packaged Streamlit code to render fancy visualizations, run modeling tasks, and data explor…☆11Oct 19, 2021Updated 4 years ago
- 2021 Line Webtoon Year-in-Review Project :: Animation Production☆10Feb 21, 2023Updated 3 years ago
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- BigBlueButton API for .NET☆11Sep 12, 2022Updated 3 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago