☆21Jul 26, 2025Updated 8 months ago
Alternatives and similar repositories for eval-anything
Users that are interested in eval-anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Aug 20, 2024Updated last year
- ☆17Sep 25, 2024Updated last year
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆134Mar 31, 2026Updated last week
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆141Mar 14, 2026Updated 3 weeks ago
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated 11 months ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆22Mar 25, 2026Updated 2 weeks ago
- Focused on the safety and security of Embodied AI☆100Dec 19, 2025Updated 3 months ago
- An example RLDS dataset builder for X-embodiment dataset conversion.☆61Mar 1, 2025Updated last year
- download all oral & spotlight papers from neurips, iclr, icml or any openreview conference☆24Dec 6, 2025Updated 4 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆129Mar 22, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆11Apr 12, 2024Updated last year
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆155Jun 25, 2024Updated last year
- ☆17Dec 5, 2024Updated last year
- A comprehensive survey of the World Law Agent ecosystem — AI + Law☆29Mar 14, 2026Updated 3 weeks ago
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- ☆14Nov 2, 2025Updated 5 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Jan 16, 2025Updated last year
- ☆59Dec 12, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DafnyBench: A Benchmark for Formal Software Verification☆59Dec 12, 2024Updated last year
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Feb 21, 2024Updated 2 years ago
- AugmentA: Patient-specific Augmented Atrial model Generation Tool☆15Nov 24, 2025Updated 4 months ago
- Lightweight control environment for Franka robot☆12Mar 16, 2022Updated 4 years ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19May 2, 2025Updated 11 months ago
- ☆14Sep 23, 2022Updated 3 years ago
- MACCIA 2022 paper reading notes: tasks and datasets☆12Feb 6, 2023Updated 3 years ago
- Hand eye calibration for panda and fr3 using apriltag☆16Apr 7, 2025Updated last year
- ☆23Jun 22, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the code implementation of the Neural ordinary differential equations-based Lyapunov-Barrier Actor-Critic (NLBAC)☆16Sep 4, 2024Updated last year
- Atrial Modelling Toolkit☆18Nov 14, 2025Updated 4 months ago
- GANs for generating digital distribution☆12Jul 18, 2018Updated 7 years ago
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at IC…☆27Apr 29, 2020Updated 5 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆29Mar 22, 2026Updated 2 weeks ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆21Mar 20, 2025Updated last year
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago