Agent-One-Lab/AgentFly

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Agent-One-Lab/AgentFly)

Agent-One-Lab / AgentFly

Scalable and extensible reinforcement learning for LM agents.

☆121

Alternatives and similar repositories for AgentFly

Users that are interested in AgentFly are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆19Oct 4, 2025Updated 9 months ago
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
wangyu-ustc / LVChat
View on GitHub
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Apr 15, 2024Updated 2 years ago
OpenMLRL / CoMLRL
View on GitHub
Open-Source Library for Fully Cooperative Multi-LLM Reinforcement Learning
☆91Updated this week
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THUDM / AgentRL
View on GitHub
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆309Jan 17, 2026Updated 5 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
VPeterV / RankSpace-Models
View on GitHub
source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"
☆10Sep 26, 2022Updated 3 years ago
LEL-A / GerAlpacaDataCleaned
View on GitHub
German Alpaca Dataset (Cleaned + Translated)
☆26Apr 6, 2023Updated 3 years ago
agno-agi / ai-app
View on GitHub
☆12May 23, 2024Updated 2 years ago
kreasof-ai / OpenFormer
View on GitHub
A hackable library for running and fine-tuning modern transformer models on commodity and alternative GPUs, powered by tinygrad.
☆30Feb 10, 2026Updated 4 months ago
moured / RefChartQA
View on GitHub
Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning
☆14Jul 9, 2025Updated last year
iLearn-Lab / CVPR26-HiconAgent
View on GitHub
[CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents
☆30Mar 9, 2026Updated 4 months ago
Reason-Wang / AutoLearn-GPT
View on GitHub
ChatGPT learns automatically.
☆25May 5, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
homles11 / SaLoRA
View on GitHub
Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”
☆28Oct 23, 2025Updated 8 months ago
UKPLab / arxiv2025-inherent-limits-plms
View on GitHub
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…
☆14Jan 16, 2025Updated last year
Video-Outpainting / VideoOutpainting
View on GitHub
☆30Apr 19, 2022Updated 4 years ago
ZuyiZhou / Awesome-Cross-modal-Reasoning-with-LLMs
View on GitHub
☆16Oct 21, 2024Updated last year
vfleaking / PTST
View on GitHub
Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"
☆22Sep 21, 2025Updated 9 months ago
didiforgithub / SwarmAgent
View on GitHub
🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…
☆13Dec 5, 2023Updated 2 years ago
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
Heidelberg-NLP / CC-SHAP
View on GitHub
Code for "On Measuring Faithfulness of Natural Language Explanations"
☆23Jul 23, 2024Updated last year
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nex-agi / NexGAP
View on GitHub
Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.
☆36Nov 19, 2025Updated 7 months ago
amazon-science / tree-of-traversals
View on GitHub
☆16Jul 19, 2024Updated last year
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆25Jul 28, 2025Updated 11 months ago
sebzhao / CodingGenie
View on GitHub
Implementation of [CodingGenie: A Proactive LLM-Powered Programming Assistant]
☆13Jan 14, 2025Updated last year
dengyang17 / PACIFIC
View on GitHub
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
☆14May 15, 2024Updated 2 years ago
sfasfaffa / DLPO
View on GitHub
Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…
☆10Dec 25, 2025Updated 6 months ago
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
mitkotak / fast_flops
View on GitHub
FLOPS counter for all your GPU benchmarking needs
☆13Aug 8, 2024Updated last year
priyankjaini / discFlowMH
View on GitHub
Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC
☆11Mar 1, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lfy79001 / S3HQA
View on GitHub
[ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering
☆20Jun 8, 2025Updated last year
yjyddq / EOSER-ASS-RL
View on GitHub
Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…
☆28Mar 9, 2026Updated 4 months ago
xcltql666 / DenseDiT
View on GitHub
Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"
☆27Jun 7, 2026Updated last month
starrYYxuan / UniTE
View on GitHub
☆17Nov 20, 2024Updated last year
he-y / Multisize-Dataset-Condensation
View on GitHub
Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)
☆16Apr 18, 2024Updated 2 years ago
Ceaglex / LoVA
View on GitHub
The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…
☆16Feb 27, 2025Updated last year
inclusionAI / M2-Reasoning
View on GitHub
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
☆47Jul 17, 2025Updated 11 months ago