Scalable and extensible reinforcement learning for LM agents.
☆111Mar 12, 2026Updated last week
Alternatives and similar repositories for AgentFly
Users that are interested in AgentFly are comparing it to the libraries listed below
Sorting:
- ☆18Apr 30, 2024Updated last year
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆41Jan 28, 2026Updated last month
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Feb 26, 2026Updated 3 weeks ago
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 7 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”☆25Oct 23, 2025Updated 4 months ago
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 6 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆21Jul 23, 2024Updated last year
- A simple visual test-time scaling method for GUI agent grounding☆21Dec 7, 2025Updated 3 months ago
- 2025年深圳大学办公区校园网新版登录脚本。2025 Shenzhen University Office Area Campus Network New Version Login Script☆10Jan 17, 2025Updated last year
- ☆19Jun 21, 2025Updated 9 months ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆31Nov 19, 2025Updated 4 months ago
- ☆16Jul 19, 2024Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆17Apr 10, 2024Updated last year
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆27Mar 9, 2026Updated last week
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 6 months ago
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 2 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆17Mar 10, 2026Updated last week
- ☆22May 5, 2025Updated 10 months ago
- ☆14Feb 12, 2023Updated 3 years ago
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- A mobile application using Chinese OCR☆17Oct 7, 2015Updated 10 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- ☆17Aug 8, 2023Updated 2 years ago
- ☆49Feb 25, 2026Updated 3 weeks ago
- [ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering☆20Jun 8, 2025Updated 9 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 8 months ago
- ☆18Nov 20, 2024Updated last year
- An awesome repository for knowledge-enhanced natural language understanding resources, including related papers, codes and datasets.☆18Sep 21, 2022Updated 3 years ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆18Feb 28, 2025Updated last year
- Official Implementation of wd1☆24Sep 25, 2025Updated 5 months ago
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆24Feb 25, 2026Updated 3 weeks ago