☆36May 30, 2025Updated last year
Alternatives and similar repositories for PrefEval
Users that are interested in PrefEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆20Jul 27, 2025Updated 10 months ago
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]☆12Jun 27, 2023Updated 2 years ago
- ☆14Jun 18, 2024Updated last year
- ☆14May 12, 2025Updated last year
- ☆11Sep 26, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jul 6, 2022Updated 3 years ago
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆29Apr 23, 2026Updated last month
- ☆25Mar 4, 2024Updated 2 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- ☆24Dec 30, 2024Updated last year
- The agent benchmark that scores the full stack — harness, config, and model — not just the LLM. Trace-based scoring, reliability metrics,…☆111Updated this week
- Official implementation of OpenTab (ICLR2024)☆13Mar 27, 2024Updated 2 years ago
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆111May 13, 2025Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated last year
- A Survey of Self-Evolving Agents | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Self-Evolving Ag…☆238Jun 7, 2026Updated last week
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated 2 years ago
- Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"☆12Feb 6, 2023Updated 3 years ago
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆14Mar 23, 2025Updated last year
- A vision-based RL environment for the Franka Panda arm using NVIDIA Isaac Sim☆19Jan 3, 2025Updated last year
- OpenTeach fork for the bimanual Franka Research 3 setup☆26Nov 26, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆18Jul 8, 2025Updated 11 months ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30May 27, 2026Updated 2 weeks ago
- Code to reproduce results of our experiments using LoRe☆18Updated this week
- The example of correspondence between fine classes and superclasses (coarse classes) in ImageNet.☆13Dec 4, 2024Updated last year
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Sep 20, 2024Updated last year
- ☆18Jun 12, 2024Updated 2 years ago
- ☆15Apr 14, 2020Updated 6 years ago
- [NeurIPS 2025] Continual Multimodal Contrastive Learning☆28Dec 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2024] Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method☆15Oct 1, 2024Updated last year
- ☆152Mar 31, 2026Updated 2 months ago
- The implementation of paper "Strategy-aware Bundle Recommender System", SIGIR'23.☆15Sep 4, 2023Updated 2 years ago
- DependEval: a hierarchical benchmark for evaluating LLMs on repository-level code understanding across 8 programming languages.☆16Jul 28, 2025Updated 10 months ago
- Taskmate - an open source grading desktop application with synchronisation capabilities for Windows and MacOS☆14May 29, 2023Updated 3 years ago
- Visualizing ImageNet Classes Hierarchical Structure.☆15Apr 8, 2018Updated 8 years ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago