xieh97 / dcase2023-audio-retrievalView external linksLinks
Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge
☆10Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for dcase2023-audio-retrieval
Users that are interested in dcase2023-audio-retrieval are comparing it to the libraries listed below
Sorting:
- ☆14Mar 25, 2023Updated 2 years ago
- NanoGPT (124M) in 5 minutes☆14Feb 14, 2025Updated last year
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- Tools for the evaluation of audio captioning.☆18May 23, 2020Updated 5 years ago
- Single Channel Speech Enhancement Methods and Toolbox☆33Mar 2, 2025Updated 11 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆42Mar 7, 2025Updated 11 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 4 months ago
- Text-To-Speech for NotebookLM☆37Jul 20, 2025Updated 6 months ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Lightweight Vue 3 component library to accompany @volverjs/style.☆12Feb 4, 2026Updated last week
- Configuration Space Exploration Framework☆17Oct 13, 2020Updated 5 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- Standardizing Error.captureStackTrace☆13Jan 20, 2026Updated 3 weeks ago
- Sound Angle Estimation by Fusion of Gaussian Mixture Model and Multiple Signal classification☆12Jan 31, 2019Updated 7 years ago
- Repository of files shared during OpenPlanetary Data Cafés☆11Sep 15, 2022Updated 3 years ago
- A browser extension to dynamically make fun of your movies☆11Jan 22, 2026Updated 3 weeks ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆14May 20, 2022Updated 3 years ago
- ☆13Apr 16, 2022Updated 3 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- ☆15May 11, 2025Updated 9 months ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆25Dec 22, 2025Updated last month
- A project that enables Dota 2 to interact with Logitech G910 and G410, as well as Corsair keyboards.☆10Jul 4, 2016Updated 9 years ago
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- LINE Notify with GitHub Actions☆13Feb 10, 2022Updated 4 years ago
- Quirc Decoder QR in the form of WASM☆14Jan 5, 2023Updated 3 years ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- Intermediate Java workshop on variables, abstraction, and design patterns ☕☆10Sep 7, 2017Updated 8 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- 一个基于原生浏览器书签的知识库:用 GitHub Gist 跨浏览器同步书签,并用 AI 为书签生成摘要、标签和封面,提供一个简洁的 Web 端浏览体验。☆30Jan 5, 2026Updated last month
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- 多Agent驱动的实时广播电台☆30Feb 8, 2026Updated last week
- Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆12Jan 29, 2025Updated last year
- GoGPT中文指令数据集构造☆10Jan 29, 2024Updated 2 years ago