dsdanielpark / open-llm-leaderboard-reportView external linksLinks
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for open-llm-leaderboard-report
Users that are interested in open-llm-leaderboard-report are comparing it to the libraries listed below
Sorting:
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- Code for the paper "Exploiting Pretrained Biochemical Language Models for Targeted Drug Design", to appear in Bioinformatics, Proceedings…☆17Feb 26, 2024Updated last year
- Chatbot for quickly finding answers to questions.☆11Oct 25, 2020Updated 5 years ago
- An example of MLflow Tracking and Models Using Factorization Machine Recommender model library, rankfm.☆10Sep 9, 2021Updated 4 years ago
- ☆31Mar 23, 2024Updated last year
- The Python package ExceptNotifier enhances the try-except statement, allowing you to receive detailed error messages via email or messeng…☆32Apr 15, 2024Updated last year
- 基于RWKV模型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆17Aug 17, 2023Updated 2 years ago
- ☆17Apr 7, 2025Updated 10 months ago
- ☆15Dec 2, 2022Updated 3 years ago
- GPT-4 を用いて、言語モデルの応答を自動評価するスクリプト☆16Jun 6, 2024Updated last year
- pretrained kobert를 사용한 multi-label VOC(Voice of Customers) 태그 분류 모델☆16Apr 25, 2022Updated 3 years ago
- ☆19Aug 3, 2024Updated last year
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆20Apr 30, 2024Updated last year
- ☆23Dec 5, 2025Updated 2 months ago
- ☆47Oct 29, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Converting PDF files to text, mainly with a focus on arXiv papers.☆24Feb 19, 2024Updated last year
- Python monorepo template with Pants☆21Aug 19, 2023Updated 2 years ago
- Scrape and export data from the Open LLM Leaderboard.☆48Dec 17, 2024Updated last year
- ☆20Jan 8, 2026Updated last month
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- This repository contains the experiments performed on the GNNExplainer Code☆16Oct 12, 2021Updated 4 years ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 2 years ago
- ☆54Oct 24, 2024Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆308Aug 23, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- convert hwp to hwpx☆37Jan 20, 2026Updated 3 weeks ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Jul 2, 2021Updated 4 years ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Feb 29, 2024Updated last year
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Mar 25, 2023Updated 2 years ago
- Kimwoonggon - Cpp Libtorch Dll with GPU Verson of YOLOv8 Seg and inference in C#☆25Feb 10, 2026Updated last week
- Lokalizace hry Star Citizen☆13Updated this week
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆111May 22, 2025Updated 8 months ago
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated last year