Leveraging BERT to Improve Spoken Language Identification
☆17Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for BERT-LID
Users that are interested in BERT-LID are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- 方言分类,pytorch☆44Sep 25, 2018Updated 7 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- ☆18Mar 13, 2024Updated 2 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Aug 31, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf☆12Dec 2, 2024Updated last year
- ☆14Dec 21, 2024Updated last year
- 针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现☆13Apr 3, 2023Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆14Jan 7, 2023Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- ☆11Oct 20, 2022Updated 3 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆59May 29, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Oct 17, 2024Updated last year
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆59Aug 8, 2025Updated 8 months ago
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 8 years ago
- Code for building and experimenting on saliency maps for RL agents.☆12Feb 13, 2020Updated 6 years ago
- ☆11Nov 5, 2025Updated 6 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- Phoneme recognizer based on long temporal context (with ALIZE VAD command added)☆17Apr 7, 2012Updated 14 years ago
- Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)☆12Jun 13, 2023Updated 2 years ago
- Ship remote sensing dataset☆12Jun 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A TensorFlow-based spoken language identification☆99Mar 22, 2023Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- ☆18Jul 22, 2024Updated last year
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- ☆19Dec 29, 2024Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago