cpldcpu / llmbenchmarkLinks
Various LLM Benchmarks
☆21Updated 3 weeks ago
Alternatives and similar repositories for llmbenchmark
Users that are interested in llmbenchmark are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆45Updated last year
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated 2 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆22Updated 2 weeks ago
- Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆86Updated 2 weeks ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆51Updated last month
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated last month
- A set of tools to create synthetically-generated data from documents☆18Updated last week
- LLM inference in C/C++☆77Updated this week
- The DPAB-α Benchmark☆25Updated 5 months ago
- ☆90Updated 3 months ago
- Very minimal (and stateless) agent framework☆44Updated 5 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆50Updated 3 weeks ago
- ☆48Updated 4 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 6 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated last month
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 11 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 4 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆49Updated 11 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆86Updated last week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆71Updated 4 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Updated last year
- setup the env for vllm users☆16Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆101Updated 2 weeks ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 2 months ago