xid32 / SoundMindLinks
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.
☆589Updated 2 weeks ago
Alternatives and similar repositories for SoundMind
Users that are interested in SoundMind are comparing it to the libraries listed below
Sorting:
- ☆690Updated 3 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆309Updated 5 months ago
- 新数据洞察方式☆342Updated 2 weeks ago
- AIFlow is an AI agentic framework designed to scale digital AI agents on BNB Chain.☆238Updated 4 months ago
- ☆414Updated 3 weeks ago
- H2HDB is a comprehensive database for organising and managing H@H comic collections.☆203Updated last month
- A Trusted Human-Multi-Agent Reinforcement Learning Interaction Framework☆505Updated last month
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆320Updated 5 months ago
- ☆310Updated 3 months ago
- A L4 innovative AGI System Empowering miRNA Drug Discovery☆332Updated last week
- AIGC Creative Suite☆202Updated 2 months ago
- cheper hcaptcha、recaptcha、recaptchav3、turnstile、5s solver bypass☆394Updated last week
- Vexa is a decentralized AI agent platform built on BNB Chain.☆351Updated 3 months ago
- ☆213Updated last month
- Integration repo between Mind Network and eliza☆185Updated 5 months ago
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆404Updated 5 months ago
- Tokenize The Virtual Agents Onchain☆241Updated last month
- A Speech-to-Text Input Method For Windows☆474Updated last month
- Welcome to BlockSeek's official documentation. BlockSeek combines state-of-the-art AI with blockchain technology to revolutionize cryptoc…☆310Updated 5 months ago
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Updated last month
- Rust SDK and CLI for Swarm Framework with Multi-Agent Orchestration☆145Updated 5 months ago
- PVPAI LLM 🔥The First Open-Source DeFAI Large Language Model Powered by DeepSeek.☆304Updated 5 months ago
- Yet another vine copula package, using PyTorch.☆242Updated 2 weeks ago
- Launching the "Agent Creation Toolkit", providing developers with an intuitive and efficient Development Environment, supporting the rapi…☆202Updated 3 months ago
- ☆207Updated 3 months ago
- Viby vibes everything.☆167Updated this week
- Gobi is a lightweight Go BI engine for easy charting and analytics in your projects.☆217Updated this week
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆111Updated 2 months ago
- ☆124Updated 4 months ago
- This is a repository aimed at accelerating the training of MoE models, offering a more efficient scheduling method.☆177Updated 4 months ago