Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
☆18Mar 23, 2023Updated 2 years ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Choose today's text☆12Dec 9, 2022Updated 3 years ago
- Flappy Space Program☆18May 5, 2014Updated 11 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- A simple agent powered by LLMs that performs tasks.☆13Apr 25, 2025Updated 10 months ago
- ☆12Jan 19, 2024Updated 2 years ago
- Fisher xxh-plugin☆10Mar 8, 2021Updated 4 years ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- This Node.js script automates the process of downloading and extracting source maps from websites. It uses Puppeteer to navigate web page…☆18Dec 17, 2025Updated 2 months ago
- Dataset and code to reproduce the results of the paper "Evolving Structures in Complex Systems"☆11Dec 16, 2019Updated 6 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Turn Trello into a CMS to power all your websites and apps.☆10May 12, 2018Updated 7 years ago
- Local text-to-speech in your browser with Piper TTS☆17Aug 13, 2025Updated 6 months ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- Metalanguage analysis☆10Dec 10, 2018Updated 7 years ago
- OpenAI's Code Interpreter running locally, as a service via WebSocket☆10Sep 22, 2023Updated 2 years ago
- ☆11Sep 8, 2024Updated last year
- Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels☆10Mar 29, 2022Updated 3 years ago
- mdoc versions of the documentation for the execline suite☆13Apr 9, 2023Updated 2 years ago
- NodeBB Plugin enabling emoji as seen on http://www.emoji-cheat-sheet.com☆14Updated this week
- Official GraphQLBlog repository. Add your blog posts as pull request!☆13Jan 11, 2023Updated 3 years ago
- ☆17Jan 17, 2026Updated last month
- a sharable language☆15Jan 20, 2025Updated last year
- ☆16Dec 21, 2023Updated 2 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆12May 8, 2025Updated 9 months ago
- Moral Machine Experiment on LLMs☆11Updated this week
- Dynamical Systems with JAX☆12Jan 11, 2026Updated last month
- ☆10Jan 31, 2021Updated 5 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆13May 21, 2023Updated 2 years ago
- TiddlyWiki-based memory programme using advanced FSRS algorithm.☆12Dec 24, 2023Updated 2 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- This is a fork of docker-library/official-images. It is used by the Arch Linux team to create automated pull requests.☆12Updated this week