We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.
☆1,106Nov 26, 2025Updated 3 months ago
Alternatives and similar repositories for SoundMind
Users that are interested in SoundMind are comparing it to the libraries listed below
Sorting:
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆314Nov 26, 2025Updated 3 months ago
- ☆682Jan 20, 2026Updated 2 months ago
- ☆279Apr 29, 2025Updated 10 months ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,579Updated this week
- ☆297Sep 14, 2025Updated 6 months ago
- 数据流引擎是一款面向数据集成、数据同步、数据交换、数据共享、任务配置、任务调度的底层数据驱动引擎。数据流引擎采用管执分离、多流层、插件库等体系应对大规模数据任务、数据 高频上报、数据高频采集、异构数据兼容的实际数据问题。☆693Mar 12, 2026Updated last week
- [T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv…☆448Dec 13, 2024Updated last year
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,004Apr 3, 2025Updated 11 months ago
- AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, pr…☆1,099Sep 10, 2025Updated 6 months ago
- 数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构建基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高…☆696Jun 23, 2025Updated 8 months ago
- ☆371Sep 6, 2025Updated 6 months ago
- ☆839Jul 7, 2025Updated 8 months ago
- FIT: 企业级AI开发框架,提供多语言函数引擎(FIT)、流式编排引擎(WaterFlow)及Java生态的LangChain替代方案(FEL)。原生/Spring双模运行,支持插件热插拔与智能聚散部署,无缝统一大模型与业务系统。☆2,105Mar 13, 2026Updated last week
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆533Feb 24, 2026Updated 3 weeks ago
- AppPlatform 是一个前沿的大模型应用工程,旨在通过集成的声明式编程和低代码配置工具,简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境,以支持从概念到部署的全流程 AI 应用开发。☆1,424Mar 13, 2026Updated last week
- Some tools for cloud developers☆408Aug 30, 2024Updated last year
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆432Jan 18, 2026Updated 2 months ago
- 电子邮件是一款简化的具备邮件服务器的企业邮箱,支持在将其他主流邮箱的邮件进行导入后自主控制邮件数据安全。电子邮件具备较为简洁的界面风格,以其简洁精确的功能和小巧安全的架构便于企业和政府根据业务要求进行二次开发。电子邮件需要依赖开源的数字底座进行人员岗位管控。☆368Updated this week
- [CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.1…☆619May 22, 2025Updated 10 months ago
- GENERator: A Long-Context Generative Genomic Foundation Model☆447Feb 10, 2026Updated last month
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,665Mar 14, 2026Updated last week
- A Speech-to-Text Input Method For Windows☆474Nov 29, 2025Updated 3 months ago
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,881Updated this week
- ☆592Oct 11, 2025Updated 5 months ago
- Uncommon Objects in 3D dataset☆1,315Nov 13, 2025Updated 4 months ago
- ☆343Jul 4, 2025Updated 8 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,416Mar 13, 2026Updated last week
- Align Anything: Training All-modality Model with Feedback☆4,634Nov 27, 2025Updated 3 months ago
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- 工作流引擎对内提供单位/机关流程管理规则和内部业务流程的数字化落地实践;对外提供自动化地第三方业务驱动、接口接入和算法单元驱动能力。工作流引擎在提供底层驱动引擎的同时对全局透明监控、安全防御和国产化特色功能进行充分考虑,是内部流程管理和业务算法驱动的不二之选。☆858Mar 12, 2026Updated last week
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆309Feb 27, 2026Updated 3 weeks ago
- ⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)☆629Aug 31, 2025Updated 6 months ago
- ☆405Aug 31, 2022Updated 3 years ago
- 网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享,同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。☆353Updated this week
- Fullstack engineer's checklist for your cybersecurity.☆382Jul 11, 2024Updated last year
- ☆418Aug 24, 2024Updated last year
- Android谷歌上架马甲包垃圾代码混淆☆1,266Jul 16, 2025Updated 8 months ago
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆445Aug 16, 2024Updated last year
- A Python library for converting images into FPGA-displayable pixel art.☆395Jan 3, 2025Updated last year