☆22Jul 26, 2025Updated 10 months ago
Alternatives and similar repositories for eval-anything
Users that are interested in eval-anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆35Aug 20, 2024Updated last year
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆142Mar 31, 2026Updated 2 months ago
- I love algorithms.☆26Dec 25, 2024Updated last year
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆171Mar 14, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated last year
- Proof-carrying code completions in Dafny☆11Apr 4, 2025Updated last year
- Focused on the safety and security of Embodied AI☆107May 27, 2026Updated 2 weeks ago
- An example RLDS dataset builder for X-embodiment dataset conversion.☆62Mar 1, 2025Updated last year
- download all oral & spotlight papers from neurips, iclr, icml or any openreview conference☆28Apr 26, 2026Updated last month
- [MICCAI2023] XSurv: Merging-Diverging Hybrid Transformer Networks for Survival Prediction☆11Oct 2, 2023Updated 2 years ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39May 9, 2024Updated 2 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆24Mar 25, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆130Mar 22, 2024Updated 2 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆51Nov 27, 2025Updated 6 months ago
- The homework of robos learning base.☆11May 23, 2023Updated 3 years ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆13Nov 12, 2023Updated 2 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆13Apr 12, 2024Updated 2 years ago
- This project contains Balancing Robot in Gazebo.☆18Apr 22, 2025Updated last year
- pyCEPS provides an interface to import, visualize and translate clinical mapping data☆14Nov 25, 2025Updated 6 months ago
- ☆16Nov 2, 2025Updated 7 months ago
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆193Jan 16, 2025Updated last year
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆89Mar 15, 2024Updated 2 years ago
- Multi-modal approach for tumor segmentation and survival prediction using PET/CT imaging with attention mechanisms (MICCAI2021 HECKTOR Ch…☆12Apr 22, 2022Updated 4 years ago
- DafnyBench: A Benchmark for Formal Software Verification☆65Dec 12, 2024Updated last year
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Feb 21, 2024Updated 2 years ago
- AugmentA: Patient-specific Augmented Atrial model Generation Tool☆15Nov 24, 2025Updated 6 months ago
- Lightweight control environment for Franka robot☆12Mar 16, 2022Updated 4 years ago
- ☆14Sep 23, 2022Updated 3 years ago
- MACCIA 2022 paper reading notes: tasks and datasets☆12Feb 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 七轴机械臂的仿真☆13Jun 7, 2022Updated 4 years ago
- This is the code implementation of the Neural ordinary differential equations-based Lyapunov-Barrier Actor-Critic (NLBAC)☆17Sep 4, 2024Updated last year
- Atrial Modelling Toolkit☆18Nov 14, 2025Updated 6 months ago
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at IC…☆27Apr 29, 2020Updated 6 years ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- 用Kinova Gen3实机实现Rekep☆11Mar 18, 2025Updated last year