Scalable and extensible reinforcement learning for LM agents.
☆119May 6, 2026Updated last month
Alternatives and similar repositories for AgentFly
Users that are interested in AgentFly are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆44Jan 28, 2026Updated 4 months ago
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆22Nov 11, 2025Updated 7 months ago
- [ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback☆69Jun 3, 2026Updated 2 weeks ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆297Jan 17, 2026Updated 5 months ago
- Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation.☆15Jul 21, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 10 months ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆23Feb 26, 2026Updated 3 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- ☆13May 23, 2024Updated 2 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 11 months ago
- Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”☆28Oct 23, 2025Updated 7 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆14Jan 16, 2025Updated last year
- ☆15Oct 21, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 8 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent grounding☆26Dec 7, 2025Updated 6 months ago
- ☆21Nov 18, 2024Updated last year
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆23Jul 23, 2024Updated last year
- ☆19Jun 21, 2025Updated 11 months ago
- ☆10Apr 5, 2023Updated 3 years ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆37Nov 19, 2025Updated 7 months ago
- ☆16Jul 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Research Project TLDR☆25Jul 28, 2025Updated 10 months ago
- Implementation of [CodingGenie: A Proactive LLM-Powered Programming Assistant]☆13Jan 14, 2025Updated last year
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated 2 years ago
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 5 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆23May 21, 2026Updated 3 weeks ago
- ☆22May 5, 2025Updated last year
- ☆14Feb 12, 2023Updated 3 years ago
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆54Feb 25, 2026Updated 3 months ago
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆28Mar 9, 2026Updated 3 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Jun 7, 2026Updated last week
- 将阿里巴巴开源模型的Dashscope api 转换成Openai 格式☆13Dec 7, 2023Updated 2 years ago
- ☆17Nov 20, 2024Updated last year
- Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)☆16Apr 18, 2024Updated 2 years ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆48Jul 17, 2025Updated 11 months ago