☆31Dec 2, 2020Updated 5 years ago
Alternatives and similar repositories for End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
Users that are interested in End-to-end-E2E-Named-Entity-Recognition-from-English-Speech are comparing it to the libraries listed below
Sorting:
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- ☆12Mar 31, 2020Updated 5 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw☆14Dec 18, 2021Updated 4 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Jul 29, 2021Updated 4 years ago
- ☆37May 20, 2022Updated 3 years ago
- ☆86Jul 31, 2025Updated 7 months ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Official Implementation of Mockingjay in Pytorch☆56Jul 6, 2023Updated 2 years ago
- ☆24Sep 20, 2024Updated last year
- Repository for SLURP paper☆109Apr 20, 2022Updated 3 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis☆10Jan 13, 2024Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- [ARCHIVED] ✨ Full-stack school homepage / TypeScript, Remix (React), Prisma, CI/CD and more☆12Sep 26, 2022Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34May 5, 2018Updated 7 years ago
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆37Apr 14, 2022Updated 3 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆81Mar 12, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- Object-Oriented Programming II☆12Jul 23, 2021Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 5 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 10 months ago
- Parallel processing with sequential output, respecting order of input☆10Feb 20, 2023Updated 3 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 2 years ago
- Run code-llama with 50k tokens using flash attention and better transformer☆12Nov 21, 2023Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 3 weeks ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Archive and make discoverable data and links with schema.org metadata.☆38Nov 4, 2014Updated 11 years ago
- ☆13Oct 17, 2020Updated 5 years ago