A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.
☆28Nov 8, 2025Updated 7 months ago
Alternatives and similar repositories for Awesome-Large-Speech-Model
Users that are interested in Awesome-Large-Speech-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).☆21Jul 2, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 4 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Hybrid f0 estimation using Convolutional Neural Network☆12Apr 29, 2019Updated 7 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆23Dec 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated 2 years ago
- 中文原生工业测评基准☆17Mar 21, 2024Updated 2 years ago
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- Repo for the FB AI Speech team.☆26Aug 24, 2021Updated 4 years ago
- Paper List☆18Jul 2, 2025Updated 11 months ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆11Jul 10, 2023Updated 2 years ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆30Dec 6, 2023Updated 2 years ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- Understanding deep networks and large models.☆29Jan 23, 2026Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Aug 10, 2022Updated 3 years ago
- ☆31Apr 22, 2024Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- ☆15Sep 13, 2022Updated 3 years ago
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Mar 9, 2024Updated 2 years ago
- [ICME 2021 Oral] Official implementation for "FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Fil…☆11Mar 29, 2022Updated 4 years ago
- Self-host application can generate illustration from a novel by highlighting certain sentences☆13Oct 12, 2025Updated 8 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [Lab] lab website☆12May 29, 2026Updated last month
- ☆21Aug 25, 2021Updated 4 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated 2 years ago
- The official implement of Freeze-Omni.☆16Jul 10, 2025Updated 11 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆29Mar 5, 2024Updated 2 years ago
- 《人工智能程序设计》大作业:吃豆人(成品)☆11Jul 12, 2022Updated 3 years ago
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- 第二届计图人工智能挑战赛,基于Jittor的草图风景图像生成大赛☆10Jan 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15May 29, 2021Updated 5 years ago
- Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。☆304Jun 10, 2026Updated 2 weeks ago
- Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data☆11Mar 30, 2024Updated 2 years ago
- A library of speech gadgets.☆15Oct 15, 2022Updated 3 years ago
- Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval…☆21Feb 1, 2023Updated 3 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- Optimized Analysis of Semantic Segmentation of Remote Sensing Images Based on FCN☆14Nov 4, 2022Updated 3 years ago