GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆47Dec 23, 2025Updated 2 months ago
Alternatives and similar repositories for GenEnv
Users that are interested in GenEnv are comparing it to the libraries listed below
Sorting:
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆19Oct 22, 2025Updated 4 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 9 months ago
- ☆33Oct 31, 2024Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Aug 15, 2024Updated last year
- 🧌 Live2d models for cnblog themes.☆11Apr 3, 2022Updated 3 years ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- Source code for the Information Sciences paper "Rumor Detection on Social Media through Mining the Social Circles with High Homogeneity"☆20Jun 10, 2023Updated 2 years ago
- ☆11Mar 11, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last month
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆41Aug 9, 2022Updated 3 years ago
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- Source code for SWIFT, an efficient reward model.☆18Jan 13, 2026Updated last month
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Software that runs reinout.vanrees.org☆20Feb 23, 2026Updated last week
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- [Advanced Photonics Research, 2021] Control tightly focused fields via manipulating pupil functions☆10Dec 25, 2024Updated last year
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- ☆16Jun 25, 2025Updated 8 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆73May 25, 2025Updated 9 months ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- [CVPR 2022] Code for the paper "Quantization-aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging".☆16Oct 6, 2022Updated 3 years ago
- TC3DGS: Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes☆14Dec 15, 2024Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 7 months ago
- ☆10Jun 19, 2024Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- Greedy Adaptive Dictionary (GAD) is a learning algorithm that sets out to find sparse atoms for speech signals.☆11Oct 1, 2018Updated 7 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated last year
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 3 weeks ago
- What Would Portland Do? Generative agent experience☆13Mar 13, 2024Updated last year
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆11Jan 3, 2023Updated 3 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- ☆15Jan 25, 2025Updated last year
- The A2C Reinforcement Learning Algorithm in Pytorch☆16May 13, 2024Updated last year