We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.
☆1,105Nov 26, 2025Updated 3 months ago
Alternatives and similar repositories for SoundMind
Users that are interested in SoundMind are comparing it to the libraries listed below
Sorting:
- ☆682Jan 20, 2026Updated last month
- ☆279Apr 29, 2025Updated 10 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆313Nov 26, 2025Updated 3 months ago
- ☆297Sep 14, 2025Updated 5 months ago
- [T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv…☆448Dec 13, 2024Updated last year
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,004Apr 3, 2025Updated 10 months ago
- 数据流引擎是一款面向数据集成、数据同步、数据交换、数据共享、任务配置、任务调度的底层数据驱动引擎。数据流引擎采用管执分离、多流层、插件库等体系应对大规模数据任务、数据高频上报、数据高频采集、异构数据兼容的实际数据问题。☆692Updated this week
- ☆371Sep 6, 2025Updated 5 months ago
- 数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构建基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高…☆695Jun 23, 2025Updated 8 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆533Updated this week
- AppPlatform 是一个前沿的大模型应用工程,旨在通过集成的声明式编程和低代码配置工具,简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境,以支持从概念到部署的全流程 AI 应用开发。☆1,421Feb 12, 2026Updated 2 weeks ago
- GENERator: A Long-Context Generative Genomic Foundation Model☆444Feb 10, 2026Updated 2 weeks ago
- AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, pr…☆1,097Sep 10, 2025Updated 5 months ago
- ☆839Jul 7, 2025Updated 7 months ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等 功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,574Updated this week
- FIT: 企业级AI开发框架,提供多语言函数引擎(FIT)、流式编排引擎(WaterFlow)及Java生态的LangChain替代方案(FEL)。原生/Spring双模运行,支持插件热插拔与智能聚散部署,无缝统一大模型与业务系统。☆2,102Updated this week
- [CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.1…☆615May 22, 2025Updated 9 months ago
- Some tools for cloud developers☆407Aug 30, 2024Updated last year
- 电子邮件是一款简化的具备邮件服务器的企业邮箱,支持在将其他主流邮箱的邮件进行导入后自主控制邮件数据安全。电子邮件具备较为简洁的界面风格,以其简洁精确的功能和小巧安全的架构便于企业和政府根据业务要求进行二次开发。电子邮件需要依赖开源的数字底座进行人员岗位管控。☆366Updated this week
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆431Jan 18, 2026Updated last month
- A Speech-to-Text Input Method For Windows☆474Nov 29, 2025Updated 3 months ago
- ☆592Oct 11, 2025Updated 4 months ago
- Fullstack engineer's checklist for your cybersecurity.☆381Jul 11, 2024Updated last year
- ☆418Aug 24, 2024Updated last year
- 网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享,同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。☆353Updated this week
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆306Updated this week
- ⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)☆629Aug 31, 2025Updated 6 months ago
- ☆344Jul 4, 2025Updated 7 months ago
- ☆406Aug 31, 2022Updated 3 years ago
- A Python library for converting images into FPGA-displayable pixel art.☆395Jan 3, 2025Updated last year
- 工作流引擎对内提供单位/机关流程管理规则和内部业务流程的数字化落地实践;对外提供自动化地第三方业务驱动、接口接入和算法单元驱动能力。工作流引擎在提供底层驱动引擎的同时对全局透明监控、安全防御和国产化特色功能进行充分考虑,是内部流程管理和业务算法驱动的不二之选。☆857Updated this week
- Uncommon Objects in 3D dataset☆1,312Nov 13, 2025Updated 3 months ago
- ☆391May 5, 2025Updated 9 months ago
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆207Dec 15, 2025Updated 2 months ago
- Completed this competition in collaboration with Jiang Yan(https://github.com/jy1993) and Guan Shuicheng(https://github.com/guanshuicheng…☆362Nov 6, 2024Updated last year
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆445Aug 16, 2024Updated last year
- Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/☆400May 27, 2024Updated last year
- https://freechat.fun☆552Updated this week
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿 池中转 矿池抽…☆3,882Feb 2, 2026Updated 3 weeks ago