PKU-RL / LLaMA-RiderLinks
☆29Updated last year
Alternatives and similar repositories for LLaMA-Rider
Users that are interested in LLaMA-Rider are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆62Updated 10 months ago
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆117Updated 11 months ago
- ☆44Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- ☆66Updated 2 years ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Updated last year
- Official implementation for "OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities" (keep updating)☆59Updated last year
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆125Updated 8 months ago
- GPT-4V in Wonderland: LMMs as Smartphone Agents☆134Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆45Updated 7 months ago
- A Production Tool for Embodied AI☆29Updated last year
- ☆37Updated 8 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- ☆92Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆214Updated last month
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆290Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024 Spotlight)☆66Updated last year
- ☆36Updated 11 months ago
- ☆122Updated last year
- ☆19Updated last year
- Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆90Updated 2 years ago
- ☆35Updated 2 years ago
- A simulation of world using GPTs. (depreciated)☆157Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 7 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 6 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆43Updated 8 months ago
- ☆91Updated last year
- The next generation of Multi-Modal Multi-Agent platform.☆104Updated 2 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆40Updated last year