jaehunjung1 / cascaded-selective-evaluationLinks
☆26Updated 6 months ago
Alternatives and similar repositories for cascaded-selective-evaluation
Users that are interested in cascaded-selective-evaluation are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆75Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated last year
- ☆74Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 6 months ago
- List of papers on Self-Correction of LLMs.☆74Updated 8 months ago
- ☆22Updated 2 weeks ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 7 months ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Embedding Recycling for Language models☆39Updated 2 years ago
- ☆45Updated 4 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆88Updated last year
- ☆29Updated 3 weeks ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Google Research☆45Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- ☆75Updated last year
- ☆44Updated 9 months ago
- ☆14Updated 10 months ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated 2 years ago
- ☆25Updated last year
- ☆13Updated 2 years ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆43Updated 6 months ago
- Finding semantically meaningful and accurate prompts.☆47Updated last year
- ☆48Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆13Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆17Updated 10 months ago