A list of LLM benchmark frameworks.
☆73Feb 17, 2024Updated 2 years ago
Alternatives and similar repositories for llm-benchmark
Users that are interested in llm-benchmark are comparing it to the libraries listed below
Sorting:
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆20Jun 16, 2024Updated last year
- Reinforcement Learning with Pong in the Browser via TensorFlow.js☆17Jan 4, 2023Updated 3 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- ☆11Updated this week
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Oct 11, 2024Updated last year
- Local emulator for Hugging Face Inference Endpoints customer handlers☆27Jul 25, 2023Updated 2 years ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-design☆22Dec 13, 2024Updated last year
- ☆32Jul 2, 2025Updated 8 months ago
- ☆47Aug 5, 2025Updated 6 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- ☆13Oct 5, 2025Updated 4 months ago
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆421Oct 25, 2025Updated 4 months ago
- My portfolio website made with React and Sass☆16Sep 5, 2024Updated last year
- ☆28Dec 4, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆27Feb 13, 2026Updated 2 weeks ago
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- Tree-Invent: A novel molecular generative model constrained with topological tree☆13Jul 26, 2023Updated 2 years ago
- Node-RED Flow (and web page example) for the LLaMA AI model☆11Jul 27, 2023Updated 2 years ago
- A Deepfake detector based on hybrid EfficientNet CNN and Vision Transformer archietcture. The model is explainable by rendering a heatma…☆15Mar 16, 2022Updated 3 years ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- RAG Chatbot powered by Groq LPU, Ollama and Langchain☆13Mar 5, 2024Updated last year
- A multi-agent framework to help with your homework.☆10Mar 1, 2025Updated last year
- It is very difficult for getting a perfect distance between gaps and objects, Here using OpenCV, some possibilities can be made☆10Nov 24, 2018Updated 7 years ago
- This is a fork from Ryan Carson's AI Dev Tasks repository, with some code cleanup and refactoring to enable support for PostgreSQL databa…☆15Sep 8, 2025Updated 5 months ago
- Python Telegraph api.☆15Mar 22, 2025Updated 11 months ago
- ☆14Mar 21, 2024Updated last year
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 3 years ago
- 青岛船舶检测☆13Apr 16, 2025Updated 10 months ago
- ☆13Sep 14, 2021Updated 4 years ago
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- CRUD with Authentication and Authorization using Get x cli pattern and Supabase☆12Nov 5, 2023Updated 2 years ago