lu-wo/whisbert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lu-wo/whisbert)

lu-wo / whisbert

babyLM WhisBERT code

☆19

Alternatives and similar repositories for whisbert

Users that are interested in whisbert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI-S2-Lab / GPT-Talker
View on GitHub
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆45Oct 28, 2024Updated last year
nguyenvulebinh / AV-HuBERT-S2S
View on GitHub
Huggingface Implementation of AV-HuBERT on the MuAViC Dataset
☆19Mar 6, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
savasy / TC32
View on GitHub
Text Classification Dataset for Turkish Language
☆10Nov 16, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lancopku / Augmented_Data_for_FST
View on GitHub
The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).
☆12May 14, 2020Updated 6 years ago
google-research-datasets / LLAMA1-Test-Set
View on GitHub
We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…
☆23Mar 14, 2024Updated 2 years ago
boun-tabi / SQuAD-TR
View on GitHub
☆11Jun 8, 2024Updated 2 years ago
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
taehong-moon / ee-diffusion
View on GitHub
Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'
☆20Jul 24, 2024Updated 2 years ago
keonlee9420 / evaluate-zero-shot-tts
View on GitHub
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
☆97Mar 12, 2025Updated last year
cpdu / unicats
View on GitHub
☆63Jan 15, 2024Updated 2 years ago
Alexander-H-Liu / dinosr
View on GitHub
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆53Jan 18, 2024Updated 2 years ago
nicholasmfraser / rbiorxiv
View on GitHub
R client for the bioRxiv API (https://api.biorxiv.org/)
☆15May 18, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NathanDuran / MRDA-Corpus
View on GitHub
Utilities for Processing the Meeting Recorder Dialogue Act Corpus
☆38Jan 24, 2021Updated 5 years ago
petezh / OpenD5
View on GitHub
Tasks for describing differences between text distributions.
☆17Aug 9, 2024Updated last year
SALT-NLP / Impressions
View on GitHub
Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…
☆11Dec 13, 2023Updated 2 years ago
ljos / navnkjenner
View on GitHub
Named-Entity Recognition for Norwegian Bokmål and Nynorsk
☆12Aug 5, 2019Updated 6 years ago
abrahamnunes / fitr
View on GitHub
Tools for computational psychiatry research.
☆12Dec 8, 2024Updated last year
outerbounds / tutorials
View on GitHub
☆13Jun 7, 2024Updated 2 years ago
sb-b / BOUN-PARS
View on GitHub
☆15Jan 10, 2022Updated 4 years ago
AlexDoumas / BrPong_1
View on GitHub
☆10Feb 25, 2026Updated 5 months ago
stephenc222 / example-ocr-with-multi-modal-llms
View on GitHub
An example project demonstrating how to perform OCR with multi-modal LLMs
☆10Mar 14, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sungnyun / avsr-temporal-dynamics
View on GitHub
(SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition
☆13Oct 22, 2024Updated last year
eloimoliner / CQT_pytorch
View on GitHub
Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters
☆36Jul 7, 2026Updated 2 weeks ago
thuhcsi / DiffVar
View on GitHub
☆30Aug 12, 2023Updated 2 years ago
DistrictDataLabs / dedupe-examples
View on GitHub
Examples for using the dedupe library
☆10Feb 22, 2016Updated 10 years ago
YannDubs / Mini_Decodable_Information_Bottleneck
View on GitHub
Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.
☆12Oct 20, 2020Updated 5 years ago
monatis / tqp
View on GitHub
Dataset and pretrained model for question paraphrasing in Turkish
☆15Jun 7, 2021Updated 5 years ago
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated last year
yl4579 / AuxiliaryASR
View on GitHub
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
☆127Jun 16, 2022Updated 4 years ago
WangHelin1997 / SSR-Speech
View on GitHub
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis
☆154Jan 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
msaddler / pitchnet
View on GitHub
Code to accompany "Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception" (Saddler…
☆12Oct 24, 2021Updated 4 years ago
MorenoLaQuatra / vad
View on GitHub
Simple voice activity detection (VAD) algorithm in Python
☆15Aug 10, 2023Updated 2 years ago
mk2299 / MultimodalEncoding
View on GitHub
☆14Apr 9, 2021Updated 5 years ago
JaesikKim / HiG2Vec
View on GitHub
☆23Mar 20, 2023Updated 3 years ago
MiyainNYC / Visual-Memorability-through-Caffe
View on GitHub
CNN, Caffe, LaMem,Azure
☆19Apr 30, 2016Updated 10 years ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
mbforbes / physical-commonsense
View on GitHub
Do Neural Language Representations Learn Physical Commonsense?
☆22Dec 28, 2021Updated 4 years ago