☆19Aug 3, 2024Updated last year
Alternatives and similar repositories for FreeEval
Users that are interested in FreeEval are comparing it to the libraries listed below
Sorting:
- ☆19May 25, 2024Updated last year
- ☆118Jun 13, 2023Updated 2 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆12Apr 21, 2025Updated 10 months ago
- javascript animation capture examples 🎬☆13Mar 14, 2023Updated 2 years ago
- 2D physics engine☆11Jan 12, 2023Updated 3 years ago
- ☆12Jan 11, 2026Updated last month
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- Command line tool for collecting TODO markers from your code, known as Puzzle Driven Development (PDD)☆13Sep 16, 2023Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- Pagination for TelegramBot CallbackQuery☆11Apr 14, 2021Updated 4 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Opinionated, library-agnostic Python framework for rapid development of Telegram bots and userbots with focus on maintainability for larg…☆10Feb 16, 2023Updated 3 years ago
- ☆12Mar 5, 2025Updated 11 months ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Repository of papers released by Modulus Labs☆13Mar 13, 2024Updated last year
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Updated this week
- Automatic audio transcription to .srt using Google's Speech to Text API☆12Oct 26, 2020Updated 5 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- scraped www.allitebooks.com and index all the books available.☆12Oct 1, 2020Updated 5 years ago
- A mutation testing CLI tool built in Rust. Currently supports Noir as a target language☆12Dec 2, 2024Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- berg 🦀 Transform the contents of Epub documents.☆10Apr 27, 2023Updated 2 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- ☆11Oct 15, 2022Updated 3 years ago
- Rust implementation of the Fift esoteric language☆12Aug 19, 2025Updated 6 months ago
- Jai bindings for the sokol headers (https://github.com/floooh/sokol)☆12Feb 20, 2026Updated last week
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆11Nov 5, 2024Updated last year
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- ☆12Nov 5, 2024Updated last year