benchflow-ai / skillsbenchView external linksLinks
SkillsBench evaluates how well skills work and how effective agents are at using them
☆303Updated this week
Alternatives and similar repositories for skillsbench
Users that are interested in skillsbench are comparing it to the libraries listed below
Sorting:
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 6 months ago
- ☆20Sep 29, 2023Updated 2 years ago
- PowerBiMIP is an open-source, efficient bilevel mixed-integer programming (BiMIP) solver, with a special focus on applications in power a…☆34Jan 31, 2026Updated 2 weeks ago
- ☆19Sep 22, 2025Updated 4 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 9 months ago
- ☆26Mar 11, 2023Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆15Jan 7, 2026Updated last month
- ☆119Jan 6, 2026Updated last month
- ☆82Mar 26, 2024Updated last year
- openASO is a project designed to identify regulatory regions of an RNA that can be targeted by antisense oligonucleotides.☆10Sep 30, 2021Updated 4 years ago
- XYFI Swap Contract Source Code☆10Nov 11, 2020Updated 5 years ago
- QuESt Planning is a long-term power system capacity expansion planning model that identifies cost-optimal energy storage, generation, and…☆14Feb 4, 2026Updated last week
- 🧠 A sample app to integrate react-native and open ai☆11Jan 1, 2023Updated 3 years ago
- ☆19Nov 20, 2025Updated 2 months ago
- Go SDK for the Bare Metal Cloud API☆14Dec 20, 2025Updated last month
- A Gym for Agentic LLMs☆444Jan 21, 2026Updated 3 weeks ago
- ☆20May 24, 2025Updated 8 months ago
- ☆21Jan 26, 2026Updated 2 weeks ago
- Example Systems using PowerDynamics.jl☆12Oct 10, 2022Updated 3 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- Final Project of ME5413 Autonomous Mobile Robotics @ NUS☆10Oct 13, 2023Updated 2 years ago
- ☆13May 11, 2022Updated 3 years ago
- ☆12Dec 26, 2023Updated 2 years ago
- sgbm立体匹配算法以及生成点云☆12Jan 29, 2021Updated 5 years ago
- Implementation of a Systolic Array based sorting engine on an FPGA using Verilog☆11May 11, 2017Updated 8 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensor☆18Dec 21, 2025Updated last month
- ☆12Mar 15, 2023Updated 2 years ago
- FeatureBench: Benchmarking Agentic Coding for Complex Feature Development [ICLR 2026]☆18Updated this week
- ☆14Oct 5, 2024Updated last year
- RLCar Gazebo v2☆12Jun 28, 2024Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆184May 20, 2025Updated 8 months ago
- A full python tool to sketch serial robots offline programming☆14May 30, 2024Updated last year
- 使用ROS2+RL 的循迹小车☆12Aug 30, 2024Updated last year
- 🌟 Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's…☆13Jan 30, 2026Updated 2 weeks ago
- domain-level nucleic acid reaction enumeration☆10Aug 23, 2023Updated 2 years ago
- Go wrapper for Nvpipe (golang)☆11Aug 14, 2020Updated 5 years ago