Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for open-llm-leaderboard-report
Users that are interested in open-llm-leaderboard-report are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Python package ExceptNotifier enhances the try-except statement, allowing you to receive detailed error messages via email or messeng…☆31Apr 15, 2024Updated 2 years ago
- Converting PDF files to text, mainly with a focus on arXiv papers.☆24Feb 19, 2024Updated 2 years ago
- pretrained kobert를 사용한 multi-label VOC(Voice of Customers) 태그 분류 모델☆15Apr 25, 2022Updated 4 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆15Dec 2, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An example of MLflow Tracking and Models Using Factorization Machine Recommender model library, rankfm.☆10Sep 9, 2021Updated 4 years ago
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆14Nov 20, 2024Updated last year
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆18Jun 21, 2023Updated 2 years ago
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆38Nov 11, 2025Updated 6 months ago
- Repository for organizing datasets and papers used in Open LLM.☆101Jul 6, 2023Updated 2 years ago
- ☆22Jan 8, 2026Updated 4 months ago
- ☆17Apr 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Jan 14, 2025Updated last year
- Python monorepo template with Pants☆21Aug 19, 2023Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated 2 years ago
- ☆16Jun 25, 2025Updated 11 months ago
- ☆131Oct 1, 2024Updated last year
- ☆16Jul 11, 2023Updated 2 years ago
- ☆12Sep 8, 2022Updated 3 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆31Mar 5, 2024Updated 2 years ago
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Mar 25, 2026Updated 2 months ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆48Oct 9, 2025Updated 7 months ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆23Sep 3, 2025Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- ☆23Jul 23, 2024Updated last year
- 한국어 심리 상담 데이터셋☆80Jun 20, 2023Updated 2 years ago
- ☆10Apr 21, 2024Updated 2 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/☆34Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Mar 13, 2024Updated 2 years ago
- ☆31Mar 23, 2024Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…☆22Jun 6, 2025Updated 11 months ago
- speech to text gui for different (e.g. Whisper, Voxtral) models and backends, including whisper.cpp, crispasar, mlx-whisper, faster-whisp…☆18May 18, 2026Updated last week
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 3 years ago