Scalable and extensible reinforcement learning for LM agents.
☆114Apr 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for AgentFly
Users that are interested in AgentFly are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Apr 30, 2024Updated 2 years ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆18Oct 4, 2025Updated 7 months ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆43Jan 28, 2026Updated 3 months ago
- The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**☆14Apr 15, 2024Updated 2 years ago
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆21Nov 11, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆279Jan 17, 2026Updated 3 months ago
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 9 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 9 months ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 3 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 3 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 9 months ago
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆28Mar 9, 2026Updated last month
- ChatGPT learns automatically.☆25May 5, 2023Updated 2 years ago
- Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”☆27Oct 23, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆14Jan 16, 2025Updated last year
- ☆14Oct 21, 2024Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent grounding☆24Dec 7, 2025Updated 4 months ago
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆22Jul 23, 2024Updated last year
- 2025年深圳大学办公区校园网新版登录脚本。2025 Shenzhen University Office Area Campus Network New Version Login Script☆11Jan 17, 2025Updated last year
- ☆19Jun 21, 2025Updated 10 months ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆33Nov 19, 2025Updated 5 months ago
- ☆16Jul 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for Research Project TLDR☆25Jul 28, 2025Updated 9 months ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- use python to control tello drone☆11May 31, 2021Updated 4 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 4 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 7 months ago
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- ☆52Feb 25, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering☆20Jun 8, 2025Updated 10 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Jul 7, 2025Updated 9 months ago
- ☆17Nov 20, 2024Updated last year
- PyTorch implementation of Supercombo, an end-to-end model for Level 2 autonomous driving on a single device (OpenPilot)☆13Jun 27, 2022Updated 3 years ago
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…☆15Feb 27, 2025Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 9 months ago
- Official Implementation of wd1☆29Sep 25, 2025Updated 7 months ago