A framework for few-shot evaluation of autoregressive language models.
☆26Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for lm-evaluation-harness
Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below
Sorting:
- ☆37Oct 29, 2024Updated last year
- Formalization of IMO shortlist problems in Lean 4☆25Updated this week
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- ☆26Aug 2, 2022Updated 3 years ago
- ProofNet dataset ported into Lean 4☆29Jun 9, 2025Updated 9 months ago
- Anh - LAION's multilingual assistant datasets and models☆27Apr 5, 2023Updated 2 years ago
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- deep learning for math☆29May 4, 2019Updated 6 years ago
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆454Feb 1, 2024Updated 2 years ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆67Feb 29, 2024Updated 2 years ago
- An open bibliography of machine learning for formal proof papers☆32Sep 30, 2023Updated 2 years ago
- Try a tactic at each step in a Lean proof.☆36Mar 1, 2026Updated last week
- A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.☆17Aug 12, 2025Updated 6 months ago
- A repository for OpenHack for Lakehouse. The contents are written in Japanese.☆11Nov 20, 2023Updated 2 years ago
- A semidefinite programming solver for clustered low-rank SDPs☆14Updated this week
- ☆13Jul 8, 2024Updated last year
- ☆11Sep 15, 2025Updated 5 months ago
- Complexity analysis in Lean☆10Feb 5, 2024Updated 2 years ago
- ☆11May 18, 2022Updated 3 years ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 2 years ago
- ☆12Oct 1, 2024Updated last year
- ☆13Oct 4, 2024Updated last year
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- ☆45Sep 21, 2024Updated last year
- A development of homotopy theory in the Lean formal theorem prover.☆14Aug 27, 2020Updated 5 years ago
- Simple c++ wrapper for xz utils☆13Jul 1, 2021Updated 4 years ago
- DREN Tensorflow rotate mnist☆11Mar 24, 2019Updated 6 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- Interesting ATP Proofs☆13Sep 3, 2021Updated 4 years ago
- API serving for your diffusers models☆11Jan 19, 2024Updated 2 years ago
- Tableaux for Propositional Dynamic Logic in Lean 4 (WORK IN PROGRESS)☆15Updated this week
- Unofficial copy of GSview 5.0☆11Jun 7, 2019Updated 6 years ago
- Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …☆13Apr 17, 2023Updated 2 years ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆42May 29, 2024Updated last year