Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"
☆62Feb 18, 2026Updated last month
Alternatives and similar repositories for Simia-Agent-Training
Users that are interested in Simia-Agent-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a survey on deep research☆49Sep 9, 2025Updated 7 months ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆35Feb 19, 2025Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆23Jan 24, 2026Updated 2 months ago
- ☆19Nov 5, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆50Mar 10, 2026Updated last month
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆46Mar 29, 2026Updated last week
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆39Nov 9, 2025Updated 5 months ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆16Jul 21, 2024Updated last year
- ☆16Mar 20, 2025Updated last year
- ☆88Sep 11, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 5 months ago
- Pytorch implementation of “MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures” (NeurIPS 2020 spotlight)☆13Jul 22, 2021Updated 4 years ago
- [KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation☆33Nov 18, 2025Updated 4 months ago
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- [IROS 2025] ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis☆23May 17, 2025Updated 10 months ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆12Jun 19, 2025Updated 9 months ago
- ☆95Mar 31, 2026Updated last week
- Published version of composing programs textbook☆15Mar 8, 2014Updated 12 years ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆32Jun 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 4 months ago
- The code for the paper 'DetIE: Multilingual Open Information Extraction Inspired by Object Detection' by Vasilkovsky et al.☆20Sep 1, 2022Updated 3 years ago
- ☆13Apr 22, 2025Updated 11 months ago
- ReKep Experiment on UR5 based on kinova arm☆14Apr 25, 2025Updated 11 months ago
- ☆36Feb 11, 2025Updated last year
- ☆16Feb 26, 2024Updated 2 years ago
- Official implementation of "AIR: Analytic Imbalance Rectifier for Continual Learning"☆20Jan 7, 2025Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆759Sep 11, 2025Updated 6 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆18Apr 11, 2025Updated last year
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆14Nov 14, 2024Updated last year
- ☆21Jul 22, 2025Updated 8 months ago
- ☆12Aug 6, 2024Updated last year