☆21Jul 26, 2025Updated 9 months ago
Alternatives and similar repositories for eval-anything
Users that are interested in eval-anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Aug 20, 2024Updated last year
- ☆17Sep 25, 2024Updated last year
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆113Jan 27, 2026Updated 3 months ago
- Automation scripts for setting up a basic development environment.☆104Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- I love algorithms.☆26Dec 25, 2024Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆151Mar 14, 2026Updated last month
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated last year
- Proof-carrying code completions in Dafny☆11Apr 4, 2025Updated last year
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆180Oct 27, 2023Updated 2 years ago
- An example RLDS dataset builder for X-embodiment dataset conversion.☆62Mar 1, 2025Updated last year
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆129Mar 22, 2024Updated 2 years ago
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆13Nov 12, 2023Updated 2 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆11Apr 12, 2024Updated 2 years ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆155Jun 25, 2024Updated last year
- pyCEPS provides an interface to import, visualize and translate clinical mapping data☆14Nov 25, 2025Updated 5 months ago
- ☆18Dec 5, 2024Updated last year
- ☆15Nov 2, 2025Updated 5 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆193Jan 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆89Mar 15, 2024Updated 2 years ago
- DafnyBench: A Benchmark for Formal Software Verification☆63Dec 12, 2024Updated last year
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Feb 21, 2024Updated 2 years ago
- Lightweight control environment for Franka robot☆12Mar 16, 2022Updated 4 years ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆20May 2, 2025Updated 11 months ago
- MACCIA 2022 paper reading notes: tasks and datasets☆12Feb 6, 2023Updated 3 years ago
- 七轴机械臂的仿真☆13Jun 7, 2022Updated 3 years ago
- Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"☆33May 28, 2025Updated 11 months ago
- Atrial Modelling Toolkit☆18Nov 14, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at IC…☆27Apr 29, 2020Updated 6 years ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- 用Kinova Gen3实机实现Rekep☆11Mar 18, 2025Updated last year
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- PKU course materials on computer science and life science.☆189Apr 5, 2026Updated 3 weeks ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year