RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.
☆59Mar 18, 2026Updated last month
Alternatives and similar repositories for RLLaVA
Users that are interested in RLLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Framework for Collaboration of Experts from Benchmark☆13Apr 27, 2025Updated last year
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆39Nov 4, 2025Updated 6 months ago
- 3DSlicer plugin for inpainting lung nodules in 3D chest CT data.☆11Dec 2, 2024Updated last year
- Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…☆48Apr 8, 2026Updated 3 weeks ago
- Repo for paper "Agentic-R: Learning to Retrieve for Agentic Search" (ACL 2026 Findings)☆79Apr 9, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- 飞书Clawdbot配置指南☆42Jan 30, 2026Updated 3 months ago
- [AAAI 2026] Official repository of Circulant Attention☆47Jan 12, 2026Updated 3 months ago
- Learning Matchable Image Transformations☆13Sep 10, 2019Updated 6 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Code for Semantic Adversarial Attacks☆11Oct 12, 2021Updated 4 years ago
- CVPR 2025: 'ZoomLDM: Latent Diffusion Model for multi-scale image generation'☆32Dec 15, 2025Updated 4 months ago
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated 2 months ago
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆161Mar 30, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- This repository contains all the source code needed to reproduce the experiments or review the results obtained in the research paper "…☆13Dec 9, 2023Updated 2 years ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 6 months ago
- Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"☆20Nov 26, 2023Updated 2 years ago
- Data-enriching GAN for retrieving Representative Samples from aTrained Classifier☆14Sep 2, 2020Updated 5 years ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- ICIAP2022 - Learning Semantics for Visual Place Recognition through Multi-Scale Attention☆16May 10, 2022Updated 3 years ago
- This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.☆11Oct 12, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 5 months ago
- ☆14Jun 22, 2022Updated 3 years ago
- ☆17Mar 25, 2025Updated last year
- ☆12Dec 4, 2024Updated last year
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆115Dec 24, 2025Updated 4 months ago
- 存放我的“信息内容安全”实验作业代码☆11May 11, 2019Updated 6 years ago
- ☆49Apr 20, 2026Updated 2 weeks ago
- ☆21Sep 17, 2024Updated last year
- Code for Unsupervised Multi-Target Domain Adaptation: An Information Theoretic Approach☆14Jul 19, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)☆13Sep 2, 2024Updated last year
- ☆44Apr 27, 2026Updated last week
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Awesome list of Mixture-of-Experts (MoE)☆28Jun 11, 2024Updated last year
- XXE - VULNSPY PHP AUDIT☆18Oct 15, 2018Updated 7 years ago
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 8 months ago
- ☆55Apr 13, 2026Updated 3 weeks ago