Large-language Model Evaluation framework with Elo Leaderboard and A-B testing
☆52Oct 24, 2024Updated last year
Alternatives and similar repositories for h2o-LLM-eval
Users that are interested in h2o-LLM-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Nov 15, 2019Updated 6 years ago
- Generate 90s MS Wordart from Node or Docker☆11May 15, 2023Updated 2 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆21Nov 4, 2025Updated 5 months ago
- ☆50Apr 10, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Mar 19, 2024Updated 2 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago
- A tool library for riichi mahjong written in Rust, made mostly to be used as a WASM component.☆13Aug 29, 2025Updated 7 months ago
- ☆13Jul 30, 2024Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Nov 19, 2023Updated 2 years ago
- ⏱ A Google Chrome extension for keeping track of who is talking during Google Meets☆12Aug 17, 2023Updated 2 years ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated last year
- ☆12May 23, 2024Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- Portable TCP/UDP/ICMP traceroute tool, written in Python☆17Apr 18, 2020Updated 5 years ago
- GPT API Cost Estimation for Enterprises☆14Oct 24, 2023Updated 2 years ago
- ☆28Nov 10, 2025Updated 5 months ago
- Tools and dumps related to the Smishing Triad and the USPS smishing campaign from late 2023 into 2024☆11Apr 28, 2024Updated last year
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- YoloX for a Jetson Nano using ncnn.☆20Sep 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- A simple tutorial script on Streamlit using the Iris Dataset☆13Sep 13, 2023Updated 2 years ago
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 8 months ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆22Oct 13, 2024Updated last year
- ☆16May 21, 2024Updated last year
- Scan your AI/ML models for problems before you put them into production.☆11Mar 31, 2025Updated last year
- 3D chess game made in OpenGL☆10Mar 27, 2017Updated 9 years ago
- easypy makes python even easier!☆17Apr 1, 2026Updated 2 weeks ago
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jan 7, 2024Updated 2 years ago
- ☆23Nov 8, 2023Updated 2 years ago
- ☆13Jun 2, 2023Updated 2 years ago
- Online Multiple Camera Multiple Target Tracking Algorithm implemented by Visual C++☆20May 25, 2017Updated 8 years ago
- one-click deepfake (face swap)☆10May 30, 2023Updated 2 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year