UpstageAI / evalverse-IFEvalLinks
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)
☆14Updated last year
Alternatives and similar repositories for evalverse-IFEval
Users that are interested in evalverse-IFEval are comparing it to the libraries listed below
Sorting:
- ☆55Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆58Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆58Updated last year
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- The first dense retrieval model that can be prompted like an LM☆89Updated 6 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- LLM boxing matches☆58Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- ☆45Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆141Updated 2 years ago
- ☆78Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated last month
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago
- ☆62Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated last year
- ☆20Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆32Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated 11 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 8 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 5 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆219Updated 3 weeks ago