☆136May 2, 2025Updated last year
Alternatives and similar repositories for SOLOBench
Users that are interested in SOLOBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆72Apr 16, 2026Updated last month
- ☆27Jun 11, 2025Updated 11 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Nov 20, 2025Updated 6 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆304Jan 7, 2026Updated 4 months ago
- ☆23Sep 27, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A benchmark for emotional intelligence in large language models☆427Jul 26, 2024Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated 3 months ago
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆26Jun 12, 2025Updated 11 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆98Apr 2, 2026Updated last month
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆57Jan 11, 2026Updated 4 months ago
- ☆16Feb 21, 2026Updated 3 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Nov 20, 2025Updated 6 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆54Mar 11, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆87Dec 9, 2025Updated 5 months ago
- Chrome extension that provides comprehensive browser fingerprint protection by defending against various tracking techniques used across …☆29Oct 26, 2025Updated 7 months ago
- ☆241Mar 9, 2025Updated last year
- ☆57Feb 18, 2025Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆247Aug 7, 2025Updated 9 months ago
- ☆332Nov 1, 2025Updated 6 months ago
- SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.☆68Feb 12, 2026Updated 3 months ago
- Automated speech dataset creator☆221Jun 12, 2025Updated 11 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Apr 2, 2025Updated last year
- Tiny Llama model trained to play chess☆30Jul 22, 2025Updated 10 months ago
- A fairly lightweight daemon that keeps your computer awake. Designed for rootless environments.☆26May 3, 2019Updated 7 years ago
- ☆78Jun 20, 2025Updated 11 months ago
- ☆17Aug 5, 2025Updated 9 months ago
- Fast inference engine for Transformer models☆57Nov 9, 2024Updated last year
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 11 months ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆46May 4, 2026Updated 3 weeks ago
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆379Apr 29, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 6 years ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆689Mar 22, 2025Updated last year
- ☆345Mar 5, 2026Updated 2 months ago
- Limopola is an AI platform that allows you to communicate with a wide range of AI models. It features autonomous agents, model-agnostic r…☆111Dec 13, 2025Updated 5 months ago
- A ComfyUI extension for OmniGen2☆48Jul 1, 2025Updated 10 months ago
- Interactive levels adjustment node for ComfyUI that provides a real-time levels adjustment tool directly within the user interface. It al…☆42Aug 23, 2025Updated 9 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 6 months ago