RewardAnything: Generalizable Principle-Following Reward Models
☆45Jun 11, 2025Updated 9 months ago
Alternatives and similar repositories for RewardAnything
Users that are interested in RewardAnything are comparing it to the libraries listed below
Sorting:
- Code for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment☆12Feb 28, 2024Updated 2 years ago
- ☆35Jun 3, 2025Updated 9 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- ☆14Sep 19, 2022Updated 3 years ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Dec 25, 2023Updated 2 years ago
- ☆23Nov 20, 2021Updated 4 years ago
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆28Sep 17, 2025Updated 6 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆28Feb 28, 2026Updated 2 weeks ago
- ☆355Jul 29, 2025Updated 7 months ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated last month
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- Very concise example of integrated gradients (a method to reveal areas of attention in input images)☆10Jun 17, 2019Updated 6 years ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 8 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆19May 28, 2024Updated last year
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆64Feb 6, 2026Updated last month
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 4 months ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- 🔧 Custom utils. 供日常使用的脚本小工具。☆10Jun 14, 2024Updated last year
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆39Dec 23, 2025Updated 2 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- ☆16Jun 19, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- Wuxia style novel generation using T5-PEGASUS model. 中文武侠小说续写☆12Nov 22, 2022Updated 3 years ago
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆110Sep 29, 2025Updated 5 months ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- ☆13Apr 21, 2024Updated last year
- [Paper][AAAI2023] Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding☆13Mar 3, 2023Updated 3 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆292Oct 2, 2025Updated 5 months ago
- ☆308Jul 6, 2025Updated 8 months ago
- ☆29May 24, 2024Updated last year
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago