GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆50Dec 23, 2025Updated 3 months ago
Alternatives and similar repositories for GenEnv
Users that are interested in GenEnv are comparing it to the libraries listed below
Sorting:
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated last month
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 6 months ago
- Source code for SWIFT, an efficient reward model.☆19Jan 13, 2026Updated 2 months ago
- ☆34Oct 31, 2024Updated last year
- ☆11Mar 31, 2023Updated 2 years ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 6 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆22Jun 17, 2025Updated 9 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆49Feb 2, 2026Updated last month
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆52Oct 14, 2025Updated 5 months ago
- ☆19May 3, 2025Updated 10 months ago
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆37Jan 31, 2026Updated last month
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆36Oct 3, 2025Updated 5 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆76May 25, 2025Updated 9 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆26Mar 9, 2026Updated 2 weeks ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆24Jul 31, 2025Updated 7 months ago
- Some Pwn Challenges from winesap.☆14Aug 15, 2019Updated 6 years ago
- Official implementation of AAAI'22 paper "ProtGNN: Towards Self-Explaining Graph Neural Networks"☆50Oct 25, 2022Updated 3 years ago
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 8 months ago
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- Source code for the Information Sciences paper "Rumor Detection on Social Media through Mining the Social Circles with High Homogeneity"☆20Jun 10, 2023Updated 2 years ago
- Short RL☆18May 26, 2025Updated 9 months ago
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆10May 8, 2025Updated 10 months ago
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以及试题☆15Jun 26, 2023Updated 2 years ago
- Forecasting sea surface temperatures of Pacific Ocean using ARIMA model in Python.☆14Jul 21, 2021Updated 4 years ago
- Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用…☆24Apr 14, 2025Updated 11 months ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- This is the code of paper: Robust Mid-Pass Filtering Graph Convolutional Networks.(paper accepted by WWW2023)☆13Feb 17, 2023Updated 3 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 10 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆34Feb 9, 2026Updated last month
- Temporal-Dynamics Aware Adversarial Attacks on Discrete Time Dynamic Graph Models☆17Oct 19, 2024Updated last year
- HeartBench is an evaluation benchmark for the psychological and social sciences field, designed to transcend traditional knowledge and re…☆41Jan 7, 2026Updated 2 months ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- ☆16Jun 25, 2025Updated 8 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Updated this week
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago