A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.
☆28Nov 8, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Large-Speech-Model
Users that are interested in Awesome-Large-Speech-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- Hybrid f0 estimation using Convolutional Neural Network☆12Apr 29, 2019Updated 6 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- [IEEE, TASLP, 2023] The code of the paper "Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition".☆19Sep 27, 2024Updated last year
- Dataset [ACL 2026]☆32Jul 31, 2025Updated 8 months ago
- Paper List☆18Jul 2, 2025Updated 9 months ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆31Dec 6, 2023Updated 2 years ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 10, 2022Updated 3 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- ☆15Sep 13, 2022Updated 3 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Mar 9, 2024Updated 2 years ago
- [ICME 2021 Oral] Official implementation for "FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Fil…☆10Mar 29, 2022Updated 4 years ago
- Self-host application can generate illustration from a novel by highlighting certain sentences☆13Oct 12, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [Lab] lab website☆11Mar 23, 2026Updated 3 weeks ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated last year
- The official implement of Freeze-Omni.☆15Jul 10, 2025Updated 9 months ago
- Code for "Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning" accepted by ACM Multimedia 2024☆44Jan 15, 2025Updated last year
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 3 years ago
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- ☆15May 29, 2021Updated 4 years ago
- Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。☆289Apr 8, 2026Updated last week
- Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data☆11Mar 30, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 2021MathorCup高校数学建模挑战赛大数据竞赛B题-遥感地块分割-国家一等奖☆12May 1, 2021Updated 4 years ago
- ☆15Jan 25, 2024Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- Optimized Analysis of Semantic Segmentation of Remote Sensing Images Based on FCN☆13Nov 4, 2022Updated 3 years ago
- Android TF-lite App☆16Apr 22, 2020Updated 5 years ago
- 仿效 UNIX 文件管理系统的基于 Django 的网盘☆14Sep 1, 2018Updated 7 years ago