shenao-zhang / reward-augmented-preference
The official implementation of Preference Data Reward-Augmentation.
☆14Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for reward-augmented-preference
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆30Updated 2 weeks ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆34Updated 3 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 2 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆29Updated 2 weeks ago
- The official implementation of Self-Exploring Language Models (SELM)☆56Updated 5 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning☆32Updated 3 weeks ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated 11 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆34Updated 3 weeks ago
- HelloBench: evaluating long text generation capabilities of LLMs☆29Updated 3 weeks ago
- ☆50Updated 2 weeks ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆55Updated 5 months ago
- LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain☆42Updated last week
- "Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?"☆58Updated last month
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆40Updated 9 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated 11 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆36Updated last week
- ☆89Updated 4 months ago
- ☆28Updated 9 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆87Updated 3 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆31Updated this week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆19Updated last week
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆93Updated last month
- The official implementation of Cross-Task Experience Sharing (COPS)☆14Updated 2 weeks ago
- ☆76Updated 10 months ago
- ☆18Updated last month