Agentic Learning Powered by AWorld
☆113Jun 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for AWorld-RL
Users that are interested in AWorld-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository of the paper "BalanceSFT: Improving LLM Function Calling with Balanced Training Signals and Data Hardness…☆56Nov 24, 2025Updated 7 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- ☆14Apr 16, 2024Updated 2 years ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆114Mar 21, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 11 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆46Jun 24, 2025Updated last year
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆166Feb 12, 2026Updated 4 months ago
- ☆19Jan 3, 2025Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Apr 18, 2023Updated 3 years ago
- ☆41May 26, 2026Updated last month
- Application for detecting command and control (C2) communication through network traffic analysis.☆17May 12, 2023Updated 3 years ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 8 months ago
- Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan☆72Aug 11, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dive-into-LLMs Tutorial for Beginners☆26May 14, 2024Updated 2 years ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆24Oct 8, 2024Updated last year
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆305Jan 17, 2026Updated 5 months ago
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- A heuristic, python-based detector for fast-flux botnets.☆13Feb 24, 2012Updated 14 years ago
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 9 months ago
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆21Jun 19, 2025Updated last year
- Official repo of BesiegeField, an interactive and real-time environment for machine construction and simulation (arXiv:2510.14980).☆61Dec 9, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆30Nov 18, 2022Updated 3 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆138Feb 10, 2026Updated 4 months ago
- Interactive visualization of the Gremlin graph database with D3.js☆12Nov 7, 2019Updated 6 years ago
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以及试题☆19Jun 26, 2023Updated 3 years ago
- ☆16Oct 6, 2024Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆84Dec 8, 2025Updated 6 months ago
- ☆23Jan 6, 2025Updated last year
- ☆132Sep 9, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆12May 30, 2022Updated 4 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- This repo consists all my RL work and learnings☆12Dec 5, 2021Updated 4 years ago
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆17Jul 1, 2024Updated 2 years ago
- CORE-ReID: Comprehensive Optimization and Refinement through Ensemble fusion in Domain Adaptation for person re-identification☆16May 7, 2025Updated last year
- 🔬 A collection for those AI (RL / DL / SL / Evoluation / Genetic Algorithm) used in financial market. otherwise, we add Technology Analy…☆13Mar 17, 2024Updated 2 years ago
- This repo is for LinkedIn Learning course: Generative AI and LLMOps: Deploying & Managing LLMs in Production☆13Aug 12, 2024Updated last year