Includes examples on how to evaluate LLMs
☆23Nov 4, 2024Updated last year
Alternatives and similar repositories for evaluate-llms
Users that are interested in evaluate-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- ☆12Jun 13, 2025Updated 11 months ago
- Example for agent orchestration☆19Mar 31, 2025Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆30Sep 25, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of AnimateDiff.☆10Sep 27, 2023Updated 2 years ago
- ☆18Apr 26, 2025Updated last year
- ☆11Apr 14, 2022Updated 4 years ago
- ☆12Oct 8, 2020Updated 5 years ago
- A curated list of reranking models, libraries, and resources for building high-quality Retrieval-Augmented Generation (RAG) applications.☆55Jan 20, 2026Updated 4 months ago
- Multi-Agent Collaboration Design Patterns Built on LangGraph with 10+ battle-tested patterns, each with complete code, architectu…☆48Apr 9, 2026Updated last month
- ☆13May 30, 2025Updated 11 months ago
- Automatically summarize lectures and ask questions about the course material☆13Apr 16, 2024Updated 2 years ago
- ☆13Jul 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Metrics, Benchmarks, and Practical Tools for Assessing Large Language Models☆26Feb 16, 2025Updated last year
- Probabilistic Solution of Differential Equations☆13Jun 19, 2022Updated 3 years ago
- MCP Server for kicking off and getting status of your crew deployments☆51Mar 19, 2025Updated last year
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆40Sep 1, 2025Updated 8 months ago
- Functional differential geometry in Julia☆12Oct 20, 2022Updated 3 years ago
- ☆20Aug 5, 2024Updated last year
- Evaluation kit for testing stateful agents☆71May 13, 2026Updated last week
- ☆11Jul 13, 2018Updated 7 years ago
- ☆24Dec 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The notebooks and scripts in this repository are created for learning purposes. Use this repo as a supplement to the video lectures provi…☆16Sep 30, 2023Updated 2 years ago
- This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature u…☆38Nov 9, 2022Updated 3 years ago
- gpu-ray-tracing-in-unity implements by three-eyed-games☆20Jan 9, 2020Updated 6 years ago
- The Vulkan Tutorial adapted to SDL2, VMA, Slang, Volk, Imgui and pure functions.☆12Apr 21, 2025Updated last year
- ☆15Aug 16, 2023Updated 2 years ago
- whisk is a data science project framework that makes collaboration, reproducibility, and deployment "just work".☆11Dec 26, 2022Updated 3 years ago
- ☆12Jun 2, 2023Updated 2 years ago
- vietnamese-ready-to-production RAG☆19Jul 17, 2024Updated last year
- 🦥 A tutorial to illustrate how to lazyload images in a react app using react lazy load image component library☆11Jun 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Jax Decompiler☆16Apr 22, 2025Updated last year
- An implementation of a Vulkan RayQuery ray tracing integration project in Unity./移动端 Vulkan 光追实现☆17Feb 23, 2025Updated last year
- Unity sample project using instancing in HDRP path tracing.☆17May 1, 2025Updated last year
- ☆29Aug 21, 2025Updated 9 months ago
- An experimental and alternative approach to Finetuning and RAG.☆34Dec 9, 2023Updated 2 years ago
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆37Dec 29, 2024Updated last year