RewardAnything: Generalizable Principle-Following Reward Models
☆44Jun 11, 2025Updated 11 months ago
Alternatives and similar repositories for RewardAnything
Users that are interested in RewardAnything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment☆12Feb 28, 2024Updated 2 years ago
- How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning☆26Aug 29, 2024Updated last year
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- ☆47Oct 29, 2025Updated 7 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Sep 19, 2022Updated 3 years ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Dec 25, 2023Updated 2 years ago
- Llama2 chinese finetuning☆38Aug 2, 2023Updated 2 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- ☆359Jul 29, 2025Updated 10 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 3 months ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 3 months ago
- Text-driven human motion generation surveys, datasets and models.☆90Aug 17, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆35Jun 29, 2024Updated last year
- Very concise example of integrated gradients (a method to reveal areas of attention in input images)☆10Jun 17, 2019Updated 6 years ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 11 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆20May 28, 2024Updated 2 years ago
- Offical Repository of MetaAgent Program☆53Dec 2, 2025Updated 6 months ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆32Sep 25, 2025Updated 8 months ago
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- ☆16Jul 29, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- ⚡️Lightweight framework for NLP research, based on PyTorch⚡️☆12Apr 5, 2023Updated 3 years ago
- ☆18Jun 19, 2023Updated 2 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago
- ☆16Apr 11, 2026Updated last month
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆32Aug 18, 2024Updated last year
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆118Sep 29, 2025Updated 8 months ago
- Wuxia style novel generation using T5-PEGASUS model. 中文武侠小说续写☆12Nov 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Dec 6, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆21Jul 10, 2025Updated 10 months ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- [Paper][AAAI2023] Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding☆13Mar 3, 2023Updated 3 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆37Jul 3, 2025Updated 11 months ago
- ☆311Jul 6, 2025Updated 11 months ago
- a benckmark for evaluating logical reasoning of LLMs☆23Jan 25, 2024Updated 2 years ago