A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.
☆28Nov 8, 2025Updated 4 months ago
Alternatives and similar repositories for Awesome-Large-Speech-Model
Users that are interested in Awesome-Large-Speech-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆28Nov 20, 2025Updated 4 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆88Jun 2, 2021Updated 4 years ago
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆52Dec 21, 2023Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆22Dec 17, 2025Updated 3 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated 2 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- fastNLP reimplementation of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction"☆11Dec 11, 2020Updated 5 years ago
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [IEEE, TASLP, 2023] The code of the paper "Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition".☆19Sep 27, 2024Updated last year
- Multilingual Translations of "Foundations of Large Language Models" and NLPBook.☆252Sep 18, 2025Updated 6 months ago
- Paper List☆18Jul 2, 2025Updated 8 months ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆31Dec 6, 2023Updated 2 years ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆26Aug 24, 2025Updated 7 months ago
- ☆11Aug 10, 2022Updated 3 years ago
- ☆31Apr 22, 2024Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Nov 5, 2018Updated 7 years ago
- ☆15Sep 13, 2022Updated 3 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University …☆163Jul 17, 2024Updated last year
- [ICME 2021 Oral] Official implementation for "FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Fil…☆10Mar 29, 2022Updated 4 years ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- [Lab] lab website☆11Mar 23, 2026Updated last week
- Code for "Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning" accepted by ACM Multimedia 2024☆43Jan 15, 2025Updated last year
- The official implement of Freeze-Omni.☆15Jul 10, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆34Apr 22, 2024Updated last year
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 3 years ago
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- ☆15May 29, 2021Updated 4 years ago
- Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。☆282Mar 19, 2026Updated last week
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆28Jun 30, 2025Updated 9 months ago
- Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data☆11Mar 30, 2024Updated 2 years ago