NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
Alternatives and similar repositories for NAR-BERT-ASR
Users that are interested in NAR-BERT-ASR are comparing it to the libraries listed below
Sorting:
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆11Aug 10, 2022Updated 3 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- ☆17Jul 22, 2024Updated last year
- A light webserver for monitoring RAM and GPU usage on multiple servers.☆21Mar 31, 2021Updated 4 years ago
- The enhanced version of ZEN, larger and more powerful.☆31Jul 22, 2022Updated 3 years ago
- 一個透過Google App Script發送台科公佈欄資訊的機器人☆23Sep 22, 2022Updated 3 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- ☆24Sep 20, 2024Updated last year
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"☆26Jun 15, 2022Updated 3 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- ☆30Jun 12, 2025Updated 8 months ago
- ☆31Dec 2, 2020Updated 5 years ago
- real time face swap and one-click video deepfake with only a single image☆12Sep 13, 2024Updated last year
- ☆37Mar 30, 2021Updated 4 years ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 7 years ago
- Wind Turbine Blade Image Dateset☆13May 23, 2019Updated 6 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥☆22Updated this week
- [ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM☆12Feb 12, 2026Updated 3 weeks ago
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 3 weeks ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- Ship remote sensing dataset☆12Jun 28, 2022Updated 3 years ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- Fast text chunking algorithms for Python☆12Oct 7, 2020Updated 5 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- Provides a python script that can patch executables compiled with Intel compiler or Intel MKL, for better performance on AMD processors☆12Jun 5, 2022Updated 3 years ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Pythonic Nvidia Codec Library☆17Feb 23, 2026Updated last week
- ☆13Oct 17, 2020Updated 5 years ago