[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.
☆24Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for InstructEval
Users that are interested in InstructEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆55Sep 28, 2023Updated 2 years ago
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆60Aug 2, 2023Updated 2 years ago
- applications of https://github.com/PrefectHQ/marvin☆13Jan 15, 2024Updated 2 years ago
- ☆42Nov 21, 2023Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- Github Repo for ICML 2022 paper: Communication-Efficient Adaptive Federated Learning☆10Nov 18, 2022Updated 3 years ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 2 years ago
- Repo for our AKBC-2021 paper: Abg-CoQA: Clarifying Ambiguity in Conversational Question Answering☆10Oct 10, 2021Updated 4 years ago
- Code for AISTATS'25 paper - On the Power of Adaptive Weighted Aggregation in Heterogeneous Federated Learning and Beyond☆14Sep 23, 2025Updated 6 months ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- ☆15Jul 8, 2023Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated last year
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Mar 14, 2024Updated 2 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 11 months ago
- Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)☆25Jan 9, 2026Updated 3 months ago
- Various AI Related Concepts Directory☆33Updated this week
- ☆11Oct 2, 2023Updated 2 years ago
- ☆36Nov 14, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 1, 2023Updated 3 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- A small USD sample scene using resources from MaterialX's repo.☆13Apr 6, 2023Updated 3 years ago
- A Python client for Deepgram's Voice Agent API☆10Oct 14, 2025Updated 6 months ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 4 years ago
- Solve Geometric & Graph Problems with Large Language Models☆32Mar 6, 2023Updated 3 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A free, open source, AI powered alternative to Quizlet.☆17May 15, 2023Updated 2 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated 2 years ago
- large language model training-3-stages+deployment☆47Aug 14, 2023Updated 2 years ago
- Dicionário Histórico Biográfico Brasileiro☆13Apr 1, 2025Updated last year
- ☆11Nov 7, 2023Updated 2 years ago
- ☆26Oct 26, 2020Updated 5 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago