TinyLoopX/RLLaVA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TinyLoopX/RLLaVA)

TinyLoopX / RLLaVA

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

☆58

Alternatives and similar repositories for RLLaVA

Users that are interested in RLLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhangXJ199 / Bench-CoE
View on GitHub
A Framework for Collaboration of Experts from Benchmark
☆13Apr 27, 2025Updated last year
winci-ai / resa
View on GitHub
An official implementation of ReSA (ICML 2025)
☆27Nov 23, 2025Updated 8 months ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
bhneo / decorrelated_bn
View on GitHub
An implementation of DecorrelatedBN by tensorflow
☆13Jun 30, 2022Updated 4 years ago
zgMin / SNSE-CoT
View on GitHub
Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling"
☆10May 21, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AdaCheng / VidEgoThink
View on GitHub
The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"
☆18Mar 25, 2025Updated last year
MIPS-COLT / MER-MCE
View on GitHub
This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.
☆25Aug 2, 2024Updated last year
ZhangXJ199 / TinyLLaVA-Video-R1
View on GitHub
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning
☆116Dec 24, 2025Updated 7 months ago
winci-ai / CW-RGP
View on GitHub
An official implementation of CW-RGP (NeurIPS 2022, spotlight).
☆21Dec 9, 2022Updated 3 years ago
irfan112 / yowov3-multistreaming-inferencing
View on GitHub
A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension …
☆26May 15, 2026Updated 2 months ago
suntea233 / DualLoRA
View on GitHub
Implementation of ACL 2024 paper "Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation".
☆15Nov 9, 2024Updated last year
zgMin / IT-RER-ABSA
View on GitHub
Official implementation for "Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis"
☆13Mar 23, 2026Updated 4 months ago
THU-MIG / PrefixKV
View on GitHub
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]
☆19Oct 11, 2025Updated 9 months ago
pedr0sorio / lefusion-slicer
View on GitHub
3DSlicer plugin for inpainting lung nodules in 3D chest CT data.
☆11Dec 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
starriver030515 / ChartVerse
View on GitHub
☆19Feb 9, 2026Updated 5 months ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
lostwolves / BEDI
View on GitHub
Benchmark for Embodied Drone Intelligence
☆20Jan 20, 2026Updated 6 months ago
ZhangXJ199 / TinyLLaVA-Video
View on GitHub
A Simple Framework of Small-scale LMMs for Video Understanding
☆114Jun 11, 2025Updated last year
ZhangYiqun018 / StickerConv
View on GitHub
[ACL 2024]
☆60Jun 20, 2024Updated 2 years ago
Nik-V9 / scannetpp
View on GitHub
Undistorted Depth Support for ScanNet++
☆17Dec 8, 2023Updated 2 years ago
SimformSolutionsPvtLtd / SS-ARTreasureHunt
View on GitHub
✨SS-ARTreasureHunt - A Christmas themed Augmented Reality based Treasure - Hunt App built using Unity and AR Foundation 🚀🔥💥
☆27Dec 22, 2023Updated 2 years ago
GigaAI-research / VLA-R1
View on GitHub
☆74Jun 18, 2026Updated last month
Dinghow / UIM
View on GitHub
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [TOMM 2025]
☆25Nov 24, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 10 months ago
KiMomota / Pano360
View on GitHub
[CVPR 2026] Pano360: Perspective to Panoramic Vision with Geometric Consistency
☆19Updated this week
GML-MMGroup / SAGE
View on GitHub
Official implementation of SAGE: a status-aware, execution-grounded planning framework that unifies temporal visual grounding, structured…
☆36Updated this week
forXuyx / Cinego
View on GitHub
🚀 轻量视频🎥 大模型🤖
☆23Apr 27, 2025Updated last year
OpenGVLab / MMT-Bench
View on GitHub
[ICML 2024] | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
☆119Apr 6, 2026Updated 3 months ago
xrenaf / MEMLENS
View on GitHub
☆23Updated this week
xjtupanda / Sparrow
View on GitHub
Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"
☆48Sep 3, 2025Updated 10 months ago
Xuekai-Zhu / key-configuration-of-llms
View on GitHub
☆22Mar 18, 2024Updated 2 years ago
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
DeepLink-org / LightRFT
View on GitHub
LightRFT (Light Reinforcement Fine-Tuning) is an advanced reinforcement learning fine-tuning framework designed for Large Language Models…
☆19Jan 12, 2026Updated 6 months ago
AgentR1 / Claw-R1
View on GitHub
Claw-R1: Empowering OpenClaw with Advanced Agentic RL.
☆193Jun 9, 2026Updated last month
myhub / tf
View on GitHub
归积一款新型Transformer架构
☆15Feb 1, 2026Updated 5 months ago
jefferyZhan / GThinker
View on GitHub
[CVPR 2026] GThinker, Reasoning MLLM, Visual Cues, Visual Rethinking
☆18Mar 9, 2026Updated 4 months ago
ltttpku / CMMP
View on GitHub
☆23Oct 21, 2024Updated last year
ynulihao / SP2000
View on GitHub
Catalogue of Life toolkit for Python
☆11Aug 4, 2020Updated 5 years ago