Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for open-llm-leaderboard-report
Users that are interested in open-llm-leaderboard-report are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Python package ExceptNotifier enhances the try-except statement, allowing you to receive detailed error messages via email or messeng…☆30Apr 15, 2024Updated 2 years ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- pretrained kobert를 사용한 multi-label VOC(Voice of Customers) 태그 분류 모델☆15Apr 25, 2022Updated 4 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆14Nov 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆19Jun 21, 2023Updated 3 years ago
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and…☆44May 22, 2023Updated 3 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- ☆17Apr 7, 2025Updated last year
- Code for the paper "Exploiting Pretrained Biochemical Language Models for Targeted Drug Design", to appear in Bioinformatics, Proceedings…☆17Feb 26, 2024Updated 2 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆20Aug 28, 2023Updated 2 years ago
- ☆10Nov 29, 2024Updated last year
- ☆27Jan 14, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python monorepo template with Pants☆21Aug 19, 2023Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆61Apr 9, 2024Updated 2 years ago
- This repository contains the experiments performed on the GNNExplainer Code☆16Oct 12, 2021Updated 4 years ago
- ☆16Jun 25, 2025Updated last year
- ☆131Oct 1, 2024Updated last year
- ☆10Feb 6, 2025Updated last year
- ☆16Jul 11, 2023Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆31Mar 5, 2024Updated 2 years ago
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆50Oct 9, 2025Updated 8 months ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆23Sep 3, 2025Updated 9 months ago
- Public Inflection Benchmarks☆67Mar 6, 2024Updated 2 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Scrape and export data from the Open LLM Leaderboard.☆48Dec 17, 2024Updated last year
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 3 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/☆36May 31, 2026Updated last month
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- ☆27Mar 13, 2024Updated 2 years ago
- ☆31Mar 23, 2024Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…☆22Jun 6, 2025Updated last year