bofenghuang / community-eventsLinks

Place where folks can contribute to 🤗 community events

☆9

Alternatives and similar repositories for community-events

Users that are interested in community-events are comparing it to the libraries listed below

Sorting:

FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆84Updated last year
pyannote / pyannote-pipeline
Tunable pipelines
☆34Updated 4 months ago
CouncilDataProject / speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
☆57Updated 6 months ago
revdotcom / speech-datasets
Various speech datasets made available to the public
☆122Updated 6 months ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆80Updated last year
huggingface / open_asr_leaderboard
☆104Updated last month
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆102Updated 4 months ago
nvidia-riva / riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Updated 4 months ago
kingabzpro / WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆17Updated 3 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆52Updated last month
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆115Updated 2 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆21Updated last year
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
RuABraun / texterrors
☆37Updated 2 months ago
diego-fustes / asr-rescoring
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Updated 4 years ago
besacier / ASR2022
☆56Updated 2 years ago
german-asr / megs
A merged version of multiple open-source German speech datasets.
☆31Updated last year
sanchit-gandhi / whisper-flash-attention
☆20Updated 2 years ago
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated last year
ftyers / commonvoice-utils
Linguistic processing for Common Voice
☆55Updated last year
imdatceleste / m-ailabs-dataset
This is the M-AILABS Speech Dataset
☆67Updated 6 months ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
xinjli / asr2k
asr2k
☆50Updated last year
muskang48 / Speaker-Diarization
This project is about performing Speaker diarization for Hindi Language.
☆50Updated 4 years ago
backspacetg / simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆63Updated 2 months ago
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆152Updated last week
msalhab96 / MultiSpeech
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Updated 3 years ago
Lhx94As / E2E-language-diarization
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆18Updated 3 years ago
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆42Updated last year