☆34Nov 26, 2025Updated 3 months ago
Alternatives and similar repositories for SkyRL-OpenHands
Users that are interested in SkyRL-OpenHands are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,699Updated this week
- A challenging aggregation benchmark for long-context models☆41Feb 22, 2026Updated last month
- ☆63May 13, 2025Updated 10 months ago
- Instruction Following Eval☆16Jan 16, 2025Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- ☆87Aug 16, 2025Updated 7 months ago
- ☆31Oct 2, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- ☆10Jul 13, 2024Updated last year
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆240Aug 27, 2025Updated 6 months ago
- Agent computer interface for AI software engineer.☆121Mar 12, 2026Updated last week
- Research about dataflow architecture☆12Nov 30, 2023Updated 2 years ago
- Customized Inference Engine for Multiverse Models☆25Jun 27, 2025Updated 8 months ago
- ☆94Sep 10, 2025Updated 6 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆189Dec 25, 2025Updated 2 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 5 months ago
- Basic world models☆31Oct 30, 2025Updated 4 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated 2 years ago
- Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning☆11Mar 9, 2023Updated 3 years ago
- Resources for the Enigmata Project.☆80Aug 13, 2025Updated 7 months ago
- Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"☆12May 15, 2023Updated 2 years ago
- Mirror of OpenMesh-Python☆14Feb 21, 2019Updated 7 years ago
- ☆10Mar 30, 2024Updated last year
- ☆78Nov 6, 2025Updated 4 months ago
- ☆13Mar 14, 2026Updated last week
- ☆12Aug 31, 2021Updated 4 years ago
- ☆12Feb 16, 2024Updated 2 years ago
- ☆11Updated this week
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆18Oct 17, 2025Updated 5 months ago
- Video encoding and muxing through libobs in Rust☆34Mar 15, 2026Updated last week
- SEED: Self-supervised Distillation for Visual Representation☆16Jul 20, 2022Updated 3 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 9 months ago
- ☆26Mar 10, 2026Updated last week
- A script that parses PowerView's output for GPO analysis. Integrated into bloodhound to find misconfigurations of URA, SMB signing etc☆15Feb 9, 2020Updated 6 years ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year