johnbean393 / SVGBenchLinks
SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.
☆60Updated 2 weeks ago
Alternatives and similar repositories for SVGBench
Users that are interested in SVGBench are comparing it to the libraries listed below
Sorting:
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆39Updated 9 months ago
- Verify Precision of all Kimi K2 API Vendor☆491Updated this week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆223Updated 2 months ago
- CursorCore: Assist Programming through Aligning Anything☆133Updated 10 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆559Updated last month
- ☆15Updated 3 weeks ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆54Updated 11 months ago
- ☆303Updated 2 months ago
- ☆94Updated 6 months ago
- Pivotal Token Search☆142Updated 3 weeks ago
- The State Of The Art, intelligence☆157Updated 4 months ago
- A simple tool that let's you explore different possible paths that an LLM might sample.☆199Updated 8 months ago
- Sparse Inferencing for transformer based LLMs☆216Updated 5 months ago
- ☆135Updated 8 months ago
- Train Large Language Models on MLX.☆239Updated last month
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 6 months ago
- ☆62Updated 6 months ago
- ☆107Updated 2 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆312Updated this week
- ☆158Updated 8 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 6 months ago
- LLM inference in C/C++☆104Updated 3 weeks ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆223Updated this week
- Very minimal (and stateless) agent framework☆44Updated 11 months ago
- Distributed Inference for mlx LLm☆100Updated last year
- ☆34Updated 9 months ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆44Updated 9 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆96Updated 8 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- ☆55Updated 5 months ago