Scenario-based Evaluation dataset for LLM (beta)
☆135Feb 6, 2024Updated 2 years ago
Alternatives and similar repositories for LLMScenarioEval
Users that are interested in LLMScenarioEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Running models faster and easier, support for x86 and ARM (M1, M1Pro).☆19May 19, 2022Updated 4 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- Creates a git repo showing the changes to Minecraft's history over time, including jar contents and source code☆13Jul 6, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 使用rag来学习rag☆11Sep 6, 2024Updated last year
- BakaXL的Minecraft启动核心,采用Rust编写☆10Jun 21, 2023Updated 2 years ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- ☆10Mar 6, 2024Updated 2 years ago
- Stable-version repository of FADING, "Face Aging via Diffusion-based Editing" based on official repository.☆17Oct 5, 2025Updated 7 months ago
- ☆16Jul 7, 2024Updated last year
- Evaluating LLMs with CommonGen-Lite☆95Mar 21, 2024Updated 2 years ago
- NutritionMaster_ShiShenPro - "Pro Nutrition, Pro Life, Master Your Diet with ShiShenPro" 营养大师——食神Pro,专业营养,专业生活,与食神一起管理你的饮食菜谱”☆10Aug 20, 2024Updated last year
- 一个 BetterNCM 插件,用于向其他插件提供获取歌曲歌词和解析歌曲数据的能力☆14Jan 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ChatDataExpert☆24Jul 13, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆12May 20, 2025Updated last year
- Synchronize git repositories like a mirror.☆13May 12, 2022Updated 4 years ago
- A package dedicated for running benchmark agreement testing☆18Sep 18, 2025Updated 8 months ago
- openmmlab learn and exercise☆11May 29, 2023Updated 2 years ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- 机器人自用一眼丁真图库,目前已收录1400+不重复丁真梗图,现已公开☆25Mar 6, 2025Updated last year
- ☆50Feb 20, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆41Mar 6, 2026Updated 2 months ago
- ☆25Apr 10, 2025Updated last year
- ☆17Jan 3, 2024Updated 2 years ago
- ☆12May 16, 2024Updated 2 years ago
- An adjustment of the existing Virtual Makeup repository https://github.com/srivatsan-ramesh/Virtual-Makeup and https://github.com/badarsh…☆11Mar 13, 2020Updated 6 years ago
- 浏览器tab自动分组☆11Jan 26, 2026Updated 4 months ago
- ☆458Aug 9, 2023Updated 2 years ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆103Jan 11, 2026Updated 4 months ago
- ☆21Apr 17, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- fork from http://dualmonitortool.sourceforge.net/☆19Aug 29, 2025Updated 8 months ago
- 高级计算机体系结构记分牌算法实验☆13Dec 22, 2018Updated 7 years ago
- Stable-diffusion-WebUI extensions, which enable tensorrt accelerated Unet for SDXL base model☆12Oct 18, 2023Updated 2 years ago
- A library to manipulate Inkscape SVG content using Python 3☆12Apr 28, 2021Updated 5 years ago
- Build a bridge that connects beginners to deep reinforcement learning.☆11Sep 23, 2024Updated last year
- ☆21Jul 3, 2025Updated 10 months ago
- [ACMMM 2022] ReCoRo: Region-Controllable Robust Light Enhancement by User-Specified Imprecise Masks☆15Feb 6, 2023Updated 3 years ago