NovaSky-AI/SkyRL-OpenHands

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NovaSky-AI/SkyRL-OpenHands)

NovaSky-AI / SkyRL-OpenHands

☆36

Alternatives and similar repositories for SkyRL-OpenHands

Users that are interested in SkyRL-OpenHands are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,093Updated this week
lmarena / PPE
View on GitHub
☆65May 13, 2025Updated last year
SIMONLQY / RethinkMCTS
View on GitHub
☆34Oct 2, 2024Updated last year
Job-Bench / job-bench-eval
View on GitHub
Official eval scripts for JobBench
☆29Jul 18, 2026Updated last week
vicgalle / refined-dpo
View on GitHub
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
☆13Feb 13, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
gardener-attic / vpa-exporter
View on GitHub
[DEPRECATED] Prometheus exporter for VPA recommendations
☆12Aug 22, 2023Updated 2 years ago
uwsampl / paper-agents
View on GitHub
☆13Dec 9, 2024Updated last year
nabla-containers / nabla-containers.github.io
View on GitHub
Nabla Containers blog
☆12May 26, 2021Updated 5 years ago
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆310Jul 13, 2025Updated last year
SWE-bench / SWE-smith
View on GitHub
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆711Updated this week
bigcode-project / bigcodebench-annotation
View on GitHub
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
☆26Aug 8, 2024Updated last year
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
dxlong2000 / SG-CQG
View on GitHub
[ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation
☆14Jul 11, 2023Updated 3 years ago
zhisbug / ray-scalable-ml-design
View on GitHub
Some microbenchmarks and design docs before commencement
☆11Feb 1, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OpenHands / openhands-aci
View on GitHub
Agent computer interface for AI software engineer.
☆133Apr 16, 2026Updated 3 months ago
jpedro1992 / scheduler-plugins
View on GitHub
Repository for out-of-tree scheduler plugins based on scheduler framework.
☆12Jul 4, 2026Updated 3 weeks ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Updated this week
MiroMindAI / MiroRL
View on GitHub
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆246Aug 27, 2025Updated 10 months ago
frankroeder / goal_conditioned_rl
View on GitHub
Goal-conditioned reinforcement learning like 🔥
☆15Feb 3, 2024Updated 2 years ago
yyht / openrlhf_async_pipline
View on GitHub
☆90Aug 16, 2025Updated 11 months ago
huweim / dataflow_architecture
View on GitHub
Research about dataflow architecture
☆15Nov 30, 2023Updated 2 years ago
JiwooKimAR / dmath
View on GitHub
☆12Feb 16, 2024Updated 2 years ago
openshift / kubernetes-autoscaler
View on GitHub
Autoscaling components for Kubernetes
☆22Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
PKU-ML / Message-Passing-Contrastive-Learning
View on GitHub
Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning
☆11Mar 9, 2023Updated 3 years ago
frankroeder / lanro-gym
View on GitHub
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Jan 27, 2026Updated 5 months ago
liuruoze / Raw-vs-Human-in-AlphaStar
View on GitHub
(TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…
☆10Sep 6, 2021Updated 4 years ago
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆10Jun 2, 2022Updated 4 years ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
idlab-discover / sfc-controller
View on GitHub
SFC controller: extension to the default scheduler (Kube-Scheduler) in Kubernetes to enable scheduling in terms of latency and bandwidth
☆18Jul 3, 2020Updated 6 years ago
Victorwz / zs-nmt-dae
View on GitHub
Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"
☆12May 15, 2023Updated 3 years ago
facebookresearch / swe-rl
View on GitHub
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆712Mar 16, 2025Updated last year
shirley-wu / daco
View on GitHub
[NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
☆14Mar 5, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
agentica-project / verl-pipeline
View on GitHub
Async pipelined version of Verl
☆124Apr 8, 2025Updated last year
abertsch72 / oolong
View on GitHub
A challenging aggregation benchmark for long-context models
☆52Feb 22, 2026Updated 5 months ago
xlang-ai / computer-agent-arena
View on GitHub
[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents
☆67Feb 26, 2026Updated 5 months ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
LLM360 / Reasoning360
View on GitHub
A repo for open research on building large reasoning models
☆151Jul 3, 2026Updated 3 weeks ago
alycialee / beyond-scale-language-data-diversity
View on GitHub
☆13Updated this week
DwanZhang-AI / SePPO
View on GitHub
Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."
☆18Oct 7, 2024Updated last year