WangJingyao07 / Embodied-AI-Papers-with-CodeLinks
🎉🎨 This repository contains a reading list of papers on Embodied AI, including LLM/MLLM/VLA.
☆13Updated 5 months ago
Alternatives and similar repositories for Embodied-AI-Papers-with-Code
Users that are interested in Embodied-AI-Papers-with-Code are comparing it to the libraries listed below
Sorting:
- A Survey of Direct Preference Optimization (DPO)☆91Updated 7 months ago
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆150Updated 9 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆67Updated 11 months ago
- ☆136Updated last year
- Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…☆46Updated 2 years ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆86Updated this week
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆187Updated 2 years ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆266Updated last year
- ArxivFlow - Periodic Track on arXiv Paper☆50Updated 5 months ago
- 🎉🎨 Papers, CODE, Datasets for Meta-Learning and Meta-Reinforcement-Learning☆54Updated last year
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆271Updated 4 months ago
- ☆219Updated 6 months ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆15Updated last year
- Open Platform for Embodied Agents☆339Updated last year
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆441Updated 3 weeks ago
- ☆114Updated 3 weeks ago
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆79Updated 9 months ago
- ☆43Updated last year
- [ICLR 2025] A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆90Updated last week
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆840Updated 8 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆41Updated 9 months ago
- llm & rl☆271Updated 3 months ago
- ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning☆51Updated 9 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆85Updated 3 weeks ago
- ☆493Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆79Updated last year
- ☆118Updated 10 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Updated 10 months ago
- ☆193Updated 3 months ago
- ☆130Updated 4 months ago