[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.
☆24Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for InstructEval
Users that are interested in InstructEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆57Sep 28, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- Repo to reproduce the First-Explore paper results☆39May 6, 2026Updated 3 weeks ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated last month
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 3 years ago
- Repo for our AKBC-2021 paper: Abg-CoQA: Clarifying Ambiguity in Conversational Question Answering☆11Oct 10, 2021Updated 4 years ago
- Code for Neural Networks journal paper - StoCFL: A stochastically clustered federated learning framework for Non-IID data with dynamic cl…☆13Apr 28, 2024Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆30Apr 8, 2026Updated last month
- The implementation for the work "Unconstrained Monotonic Calibration of Predictions in Deep Ranking Systems".☆23Jun 11, 2025Updated 11 months ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- ☆15Feb 18, 2021Updated 5 years ago
- A tool for dissecting Textual widgets, including default CSS and more☆20Oct 7, 2025Updated 7 months ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- L4: Practical loss-based stepsize adaptation for PyTorch☆18May 7, 2021Updated 5 years ago
- RND1: Scaling Diffusion Language Models☆181Feb 22, 2026Updated 3 months ago
- ☆10Jun 7, 2021Updated 4 years ago
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 4 months ago
- ☆36Nov 14, 2025Updated 6 months ago
- ☆10Apr 1, 2023Updated 3 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Various AI Related Concepts Directory☆35May 18, 2026Updated last week
- A Python client for Deepgram's Voice Agent API☆10Oct 14, 2025Updated 7 months ago
- Ralph is a suite of autonomous agents that orchestrate Claude CLI for backlog-driven SDLC automation. Each agent acts as a specialized te…☆51Updated this week
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- Code for the IROS 2021 paper "Learning of Parameters in Behavior Trees for Movement Skills". In short, we combine behavior trees (BT), a …☆13Jan 8, 2024Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆19Mar 29, 2021Updated 5 years ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated 2 years ago
- AI coding models, agents, CLIs, IDEs, AI app builders, open source tooling, benchmarks☆54Apr 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Dicionário Histórico Biográfico Brasileiro☆13Apr 1, 2025Updated last year
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Planet: A unified sampling-based approach to integrated task and motion planning☆16Jul 9, 2020Updated 5 years ago
- [TMLR 2025] Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework☆43Dec 17, 2025Updated 5 months ago
- ☆21May 14, 2026Updated 2 weeks ago
- Libraries for efficient and scalable group-structured dataset pipelines.☆25Jun 18, 2025Updated 11 months ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 3 years ago