xid32 / SoundMindLinks
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.
☆1,097Updated last month
Alternatives and similar repositories for SoundMind
Users that are interested in SoundMind are comparing it to the libraries listed below
Sorting:
- DeepWism R2 is a next-generation AGI system built on the T3CEDS framework (Thin-Thick-Thin Crowd Entropy Dynamics System), which redefine…☆1,020Updated 3 months ago
- ☆601Updated last year
- Framework that enables fine-tuning of vision-language grounding models on custom datasets☆600Updated 5 months ago
- ☆514Updated 7 months ago
- ☆529Updated 8 months ago
- 新数据洞察方式☆1,006Updated 3 months ago
- This is a database project.☆1,018Updated 2 weeks ago
- Liang - Non functional requirements should be part of function interfaces☆1,012Updated 3 years ago
- 日历软件重写☆452Updated 6 months ago
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆1,300Updated 4 months ago
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,002Updated 6 months ago
- ☆649Updated last week
- Chat with your past.☆609Updated this week
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Updated 3 months ago
- ☆414Updated 3 months ago
- AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, pr…☆964Updated 3 weeks ago
- AI Integrated Professional Document Reader☆619Updated last week
- A public good tool to help users verify Safe (Gnosis Safe) transactions before signing or execution.☆524Updated 4 months ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆319Updated 8 months ago
- PVPAI LLM 🔥The First Open-Source DeFAI Large Language Model Powered by DeepSeek.☆302Updated 8 months ago
- Spring Boot framework for implementing distributed transactions using reliable messaging with RabbitMQ☆414Updated 6 months ago
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆405Updated 8 months ago
- A Trusted Human-Multi-Agent Reinforcement Learning Interaction Framework☆504Updated 2 months ago
- Vexa is a decentralized AI agent platform built on BNB Chain.☆349Updated 6 months ago
- Res-SAM Framework for GPR Underground Hazard Detection☆1,026Updated last week
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆310Updated 8 months ago
- ☆53Updated last month
- AML end to end system☆971Updated 9 months ago
- Firmware for a 100W DC Electronic Load based on STM32F405 and LVGL (Keil MDK Project).☆499Updated 3 months ago
- django vue3 ts admin vben fastapi langchain 寻找远程/全职 Python 岗位机会 WX JUN765462425☆630Updated this week