bofenghuang / community-eventsLinks
Place where folks can contribute to π€ community events
β9Updated 2 years ago
Alternatives and similar repositories for community-events
Users that are interested in community-events are comparing it to the libraries listed below
Sorting:
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β84Updated last year
- Tunable pipelinesβ34Updated 4 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.β57Updated 6 months ago
- Various speech datasets made available to the publicβ122Updated 6 months ago
- Clustering-based methods for overlapping diarizationβ80Updated last year
- β104Updated last month
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ102Updated 4 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Rivaβ91Updated 4 months ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ52Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ115Updated 2 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech represβ¦β21Updated last year
- β38Updated 3 years ago
- β37Updated 2 months ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- β56Updated 2 years ago
- A merged version of multiple open-source German speech datasets.β31Updated last year
- β20Updated 2 years ago
- Fine-Tune Whisper with Transformers and PEFTβ57Updated last year
- Linguistic processing for Common Voiceβ55Updated last year
- This is the M-AILABS Speech Datasetβ67Updated 6 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ75Updated 3 years ago
- asr2kβ50Updated last year
- This project is about performing Speaker diarization for Hindi Language.β50Updated 4 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detectionβ63Updated 2 months ago
- Predicts the level of noise and reverberation on your audiofilesβ152Updated last week
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paperβ21Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>β18Updated 3 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasksβ42Updated last year