Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆65Feb 18, 2026Updated 3 months ago
Alternatives and similar repositories for Simia-Agent-Training
Users that are interested in Simia-Agent-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆47May 9, 2024Updated 2 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- Analysis of evidential models☆15Jun 22, 2023Updated 2 years ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆34Feb 19, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆23Jan 24, 2026Updated 4 months ago
- ☆20Nov 5, 2024Updated last year
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- Synthesizes efficient Z3 strategies tailored to your problem set! Repo for the IJCAI'24 paper: Layered and Staged Monte Carlo Tree Search…☆25May 27, 2026Updated 2 weeks ago
- ☆15May 17, 2022Updated 4 years ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆54Jan 5, 2026Updated 5 months ago
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆42Nov 9, 2025Updated 7 months ago
- A curated list of awesome Neuro-Symbolic AI frameworks, libraries, software, papers, and videos.☆15Nov 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆60Dec 23, 2025Updated 5 months ago
- ☆17Mar 20, 2025Updated last year
- ☆90Sep 11, 2024Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 7 months ago
- [KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation☆34Nov 18, 2025Updated 6 months ago
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated 2 months ago
- A Beginner's Optimization Guide to RDMA, Based on Verbs and RDMA-CM, for High-Performance Computing and Disaggregated Memory Systems☆15Sep 30, 2024Updated last year
- [IROS 2025] ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis☆25May 17, 2025Updated last year
- Human-in-the-loop Online Rejection Sampling for Robotic Manipulation☆25Nov 3, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆12Jun 19, 2025Updated 11 months ago
- Published version of composing programs textbook☆15Mar 8, 2014Updated 12 years ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆32Jun 16, 2024Updated last year
- ☆122Mar 31, 2026Updated 2 months ago
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆62Mar 29, 2026Updated 2 months ago
- Efficiently apply modification functions to RLDS/TFDS datasets.☆32Jun 19, 2024Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 6 months ago
- ☆12Apr 22, 2025Updated last year
- ☆37Feb 11, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Feb 1, 2023Updated 3 years ago
- support BM25+vecetor☆27May 26, 2025Updated last year
- ReKep Experiment on UR5 based on kinova arm☆14Apr 25, 2025Updated last year
- ☆16Feb 26, 2024Updated 2 years ago
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- Official implementation of "AIR: Analytic Imbalance Rectifier for Continual Learning"☆20Jan 7, 2025Updated last year
- ☆18Oct 12, 2022Updated 3 years ago