MemVerse: Multimodal Memory for Lifelong Learning Agents
☆137Mar 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for MemVerse
Users that are interested in MemVerse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 5 months ago
- Public Evaluation Result Archieve for BFCL☆29Dec 17, 2025Updated 3 months ago
- Hybrid Deep Retrieval-Augmented Generation across Heterogeneous Data Stores☆40Oct 21, 2025Updated 5 months ago
- [ICML'24] Creative Text-to-Audio Generation via Synthesizer Programming☆39Sep 26, 2024Updated last year
- ☆33Sep 19, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Feb 26, 2024Updated 2 years ago
- ☆12Jun 19, 2024Updated last year
- A construction kit for reinforcement learning environment management.☆408Updated this week
- 国科大部分课程期末复习资料☆18Oct 31, 2022Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated last month
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆14Oct 17, 2025Updated 5 months ago
- VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding☆59Mar 24, 2026Updated 2 weeks ago
- Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion☆118Mar 12, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆36Sep 12, 2025Updated 7 months ago
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 6 months ago
- ☆15Aug 12, 2022Updated 3 years ago
- ☆15Nov 5, 2024Updated last year
- ICCV 2025: Official Implematation of "Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced L…☆71Oct 25, 2025Updated 5 months ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 4 months ago
- A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning☆37Mar 12, 2026Updated last month
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Aug 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆56Aug 13, 2024Updated last year
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 7 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated last month
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- Code of "A Geometric Perspective on Variational Autoencoders" (NeurIPS 2022)☆15Nov 19, 2024Updated last year
- FinSight: Towards Real-World Financial Deep Research. 🎯One ticker, one click, one publication-ready report.☆160Apr 5, 2026Updated last week
- [EMNLP 2024] A Video Chat Agent with Temporal Prior☆32Mar 2, 2025Updated last year
- An autohotkey's script that makes your capslock more powerful☆13Aug 3, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Sep 7, 2023Updated 2 years ago
- The companion repository of Saraga collections, with a companion website, a dump of the dataset, documentation, utility scripts and pytho…☆18Jul 4, 2025Updated 9 months ago
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆18Dec 15, 2025Updated 3 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆20Feb 14, 2025Updated last year
- ☆17Oct 12, 2024Updated last year
- A marker-based augmented reality camera app for the web, powered by AR.js.☆16Dec 4, 2022Updated 3 years ago
- General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.☆45Apr 3, 2026Updated last week