Qwen3-14B Orchestrator Agent Reinforcement Learning. **Achieved 160% improvement** on Stanford's TerminalBench
☆99Nov 3, 2025Updated 6 months ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43Mar 11, 2026Updated last month
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- Tree-based speculative decoding for Apple Silicon (MLX). ~10-15% faster than DFlash on code, ~1.5x over autoregressive. First MLX port wi…☆133Apr 15, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 3 months ago
- ☆12Mar 3, 2023Updated 3 years ago
- ☆20May 14, 2025Updated 11 months ago
- tuimorphic choose-your-own-adventure story game☆19Apr 30, 2026Updated last week
- Local lightning-fast semantic code search built for agents☆41Mar 16, 2026Updated last month
- Data for the MTEB leaderboard☆53Updated this week
- ☆137Mar 31, 2026Updated last month
- lol☆10Mar 12, 2021Updated 5 years ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A macOS .mobileconfig generator for installing fonts on iOS device. 一个 macOS 上的 .mobileconfig 配置生成器,用于给 iOS 设备安装字体。☆12Jun 24, 2020Updated 5 years ago
- ☆22Sep 29, 2025Updated 7 months ago
- ☆38Oct 10, 2025Updated 7 months ago
- An opinionated MCP module for NestJS☆11Apr 10, 2026Updated 3 weeks ago
- ☆28Oct 30, 2025Updated 6 months ago
- Minimal example of MCP for parsing llms.txt☆39Apr 8, 2025Updated last year
- Claude Code Agent Monitoring & Observability on VSCode☆78Updated this week
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 一个聚合AI相关节目的播客rss☆21Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆101Feb 26, 2026Updated 2 months ago
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- AI Scientist by Chicago Human+AI Lab☆125Apr 27, 2026Updated last week
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official public dataset for Famelack.☆42Updated this week
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 5 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- 💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems☆10Dec 5, 2019Updated 6 years ago
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆23Sep 2, 2025Updated 8 months ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.☆12Aug 25, 2021Updated 4 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago