Comprehensive LLM evaluation at scale: A production-ready framework for evaluating large language models across multiple benchmarks.
☆36Updated this week
Alternatives and similar repositories for eval-framework
Users that are interested in eval-framework are comparing it to the libraries listed below
Sorting:
- Serverless AI powered by WebAssembly☆21Feb 6, 2026Updated 3 weeks ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- A repo for implementing and understanding design patterns of agentic workflows☆22Feb 1, 2025Updated last year
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Script for using Bing chat like a meal delivery service.☆12Mar 15, 2023Updated 2 years ago
- ☆12Apr 24, 2024Updated last year
- ☆11Nov 26, 2020Updated 5 years ago
- ☆14Jun 24, 2024Updated last year
- ☆11Jul 7, 2023Updated 2 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- A list where most values will be None (or default)☆10Jul 19, 2023Updated 2 years ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Deep Learning with Multiple Objectives: 2021 edition☆10May 27, 2021Updated 4 years ago
- Platform API as Configuration☆10Aug 18, 2020Updated 5 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆12Feb 4, 2025Updated last year
- Full List of Bad Words and Top Swear Words Banned by Google. As they closed the api☆12Sep 26, 2018Updated 7 years ago
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11May 1, 2025Updated 9 months ago
- ☆13May 21, 2024Updated last year
- Kubernetes controller which watches for newly created EKS clusters and configures ArgoCD to provision software to it☆11Jun 9, 2020Updated 5 years ago
- This project scrapes the entire public history of a Reddit user given their username☆14Dec 8, 2022Updated 3 years ago
- Too much tools in context. Use a gateway☆17Jan 24, 2026Updated last month
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 3 years ago
- Deep Learning Type Library☆39Updated this week
- 🤖📚 Telegram bot to convert and email PDFs, EPUBs or MOBIs to your Kindle☆11Sep 16, 2022Updated 3 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- Dataset accompanying the paper "Investigating African-American Vernacular English in Transformer-Based Text Generation."☆10Apr 8, 2022Updated 3 years ago
- ROUGE L metric implementation using tensorflow ops☆12Sep 17, 2018Updated 7 years ago
- TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross Attention Control" for Stable Diffusion☆16Mar 25, 2023Updated 2 years ago
- This repo consists of code for plotting top loss images☆13May 18, 2020Updated 5 years ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Dec 16, 2024Updated last year
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- ☆10Jul 27, 2018Updated 7 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Split bib files for anthology bibliography for overleaf☆11Aug 25, 2024Updated last year