☆22Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for OmniRL
Users that are interested in OmniRL are comparing it to the libraries listed below
Sorting:
- ☆18Jun 10, 2025Updated 8 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 5 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆19Oct 20, 2025Updated 4 months ago
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 10 months ago
- ☆22Feb 13, 2026Updated 3 weeks ago
- ☆11Sep 1, 2025Updated 6 months ago
- ☆14May 20, 2025Updated 9 months ago
- ☆13Oct 3, 2023Updated 2 years ago
- ☆20Apr 15, 2025Updated 10 months ago
- ☆10Oct 5, 2023Updated 2 years ago
- Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations☆18Jan 5, 2025Updated last year
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆26Dec 11, 2025Updated 2 months ago
- [ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?☆20Oct 20, 2025Updated 4 months ago
- [ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models☆21Sep 7, 2025Updated 6 months ago
- A Paper List for Geo-localization Research☆16Sep 2, 2024Updated last year
- ☆15Jul 24, 2024Updated last year
- ☆34Feb 4, 2026Updated last month
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆14Jul 26, 2024Updated last year
- Official implementation of UNAD: Universal Anatomy-initialized Noise Distribution Learning Framework Towards Low-dose CT Denoising☆14Mar 19, 2024Updated last year
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 9 months ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆15Nov 10, 2025Updated 3 months ago
- Official Implementation of Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval☆26Jul 14, 2025Updated 7 months ago
- Image2Points: A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction (ICASSP 2024)☆14Jun 16, 2024Updated last year
- ☆17May 2, 2024Updated last year
- [ICCV2025] WikiAutoGen offical page☆24Feb 6, 2026Updated last month
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 3 months ago
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- The official code respository for "Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding" (ICLR 2024)☆27Mar 8, 2025Updated last year
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆57Oct 14, 2025Updated 4 months ago
- Towards a general language-audio model for computational paralinguistic tasks☆24Dec 14, 2024Updated last year
- Official code implementation of the paper: QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmente…☆38Jan 10, 2026Updated last month
- ☆69Updated this week
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 4 months ago