Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆60Feb 18, 2026Updated last month
Alternatives and similar repositories for Simia-Agent-Training
Users that are interested in Simia-Agent-Training are comparing it to the libraries listed below
Sorting:
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆22Jan 24, 2026Updated last month
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆40Updated this week
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆16Nov 6, 2024Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- ☆15May 17, 2022Updated 3 years ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆31Mar 5, 2026Updated 2 weeks ago
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆39Nov 9, 2025Updated 4 months ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆15Jul 21, 2024Updated last year
- [IROS 2025] ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis☆20May 17, 2025Updated 10 months ago
- ☆84Sep 11, 2024Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- Pytorch implementation of “MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures” (NeurIPS 2020 spotlight)☆13Jul 22, 2021Updated 4 years ago
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- Human-in-the-loop Online Rejection Sampling for Robotic Manipulation☆26Nov 3, 2025Updated 4 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆32Jun 16, 2024Updated last year
- ☆21Dec 22, 2024Updated last year
- Efficiently apply modification functions to RLDS/TFDS datasets.☆29Jun 19, 2024Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- 用Kinova Gen3实机实现Rekep☆11Mar 18, 2025Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 3 months ago
- ☆36Feb 11, 2025Updated last year
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆87Updated this week
- The code for the paper 'DetIE: Multilingual Open Information Extraction Inspired by Object Detection' by Vasilkovsky et al.☆20Sep 1, 2022Updated 3 years ago
- ☆13Apr 22, 2025Updated 10 months ago
- ReKep Experiment on UR5 based on kinova arm☆13Apr 25, 2025Updated 10 months ago
- ☆35Feb 15, 2026Updated last month
- support BM25+vecetor☆28May 26, 2025Updated 9 months ago
- ☆16Feb 26, 2024Updated 2 years ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆61Updated this week
- Official implementation of "AIR: Analytic Imbalance Rectifier for Continual Learning"☆20Jan 7, 2025Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- ☆18Oct 12, 2022Updated 3 years ago
- The official implementation of ManiAgent☆25Jan 4, 2026Updated 2 months ago
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- ☆21Jul 22, 2025Updated 7 months ago