xid32 / SoundMindLinks
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.
☆1,101Updated 3 weeks ago
Alternatives and similar repositories for SoundMind
Users that are interested in SoundMind are comparing it to the libraries listed below
Sorting:
- DeepWism R2 is a next-generation AGI system built on the T3CEDS framework (Thin-Thick-Thin Crowd Entropy Dynamics System), which redefine…☆1,019Updated 5 months ago
- Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling☆822Updated 2 months ago
- ☆515Updated 9 months ago
- ☆530Updated 10 months ago
- ☆709Updated 6 months ago
- ☆600Updated last month
- Liang - Non functional requirements should be part of function interfaces☆1,012Updated 4 years ago
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Updated 6 months ago
- This is a database project.☆1,020Updated last month
- 日历软件重写☆453Updated 8 months ago
- ☆895Updated 2 months ago
- ☆497Updated 3 months ago
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆557Updated this week
- ☆946Updated 3 months ago
- Framework that enables fine-tuning of vision-language grounding models on custom datasets☆601Updated 8 months ago
- AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, pr…☆1,098Updated 3 months ago
- ☆841Updated 5 months ago
- Translate PDF, Word, PowerPoint, etc. | zotero翻译插件,微信扫码注册,新用户可免费翻译25万汉字或100万个英文字母。超能文献官网:suppr.wilddata.cn;☆658Updated last month
- Vexa is a decentralized AI agent platform built on BNB Chain.☆348Updated 8 months ago
- ☆41Updated 4 months ago
- 职星学院企业培训系统是一套基于点播、直播、考试、培训、面授等功能完善的在线教育系统,开源版是基于商业版精简实现的一个企业员工培训系统,致力于打造一个各行业都适用的在线培训系统、企业培训平台、员工培训系统、企业内部培训系统。☆534Updated 6 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆312Updated 3 weeks ago
- AI-powered document analysis platform built with Next.js, LangChain, PostgreSQL + pgvector. Upload, organize, and chat with documents. In…☆698Updated this week
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,003Updated 8 months ago
- The source code for the ICDE 2026 paper☆59Updated 2 weeks ago
- ☆60Updated 4 months ago
- A public good tool to help users verify Safe (Gnosis Safe) transactions before signing or execution.☆524Updated 7 months ago
- Spring Boot framework for implementing distributed transactions using reliable messaging with RabbitMQ☆413Updated 9 months ago
- A curated list of Model Context Protocol (MCP) servers☆506Updated 2 weeks ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆320Updated 11 months ago