Scrape and export data from the Open LLM Leaderboard.
☆48Dec 17, 2024Updated last year
Alternatives and similar repositories for scrape-open-llm-leaderboard
Users that are interested in scrape-open-llm-leaderboard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated 2 years ago
- A simple generate script utils using fastchat conv template for generation of Large Language Models☆21Jun 21, 2023Updated 3 years ago
- Complete set of English dialect transformation rules and evaluation code☆17Jun 7, 2024Updated 2 years ago
- Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"☆16Mar 6, 2026Updated 3 months ago
- Fluid Language Model Benchmarking☆30Sep 16, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Literature overview: gender bias in natural language processing☆11Jan 26, 2021Updated 5 years ago
- ☆16Apr 2, 2025Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- ☆19Jun 7, 2022Updated 4 years ago
- A supplementary code for Editable Neural Networks, an ICLR 2020 submission.☆46Jan 21, 2020Updated 6 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Converting the Enron email collection to mbox format☆12Dec 9, 2016Updated 9 years ago
- Evaluating LLMs with CommonGen-Lite☆95Mar 21, 2024Updated 2 years ago
- A package dedicated for running benchmark agreement testing☆19Sep 18, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆65Aug 7, 2023Updated 2 years ago
- Project to convert PDF files to Text files using google OCR☆13May 6, 2024Updated 2 years ago
- ☆10May 11, 2021Updated 5 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 9 months ago
- This repo is all about creating sample apps with Streamlit.☆13Oct 25, 2020Updated 5 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- A template code for running modular and reproducible experiments in pytorch☆13Sep 3, 2025Updated 9 months ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Mar 13, 2024Updated 2 years ago
- ☆50Jun 7, 2025Updated last year
- Evaluating LLMs with fewer examples☆179Apr 12, 2024Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆218Feb 10, 2024Updated 2 years ago
- Fact checking baseline combining dense retrieval and textual entailment☆30Aug 10, 2025Updated 10 months ago
- Text readability metrics in Python.☆11Aug 29, 2013Updated 12 years ago