jd-3d / SOLOBenchLinks
☆135Updated 9 months ago
Alternatives and similar repositories for SOLOBench
Users that are interested in SOLOBench are comparing it to the libraries listed below
Sorting:
- Easily view and modify JSON datasets for large language models☆86Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆85Updated last month
- ☆336Updated 6 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆241Updated 5 months ago
- AI management tool☆119Updated last year
- ☆109Updated 5 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Updated 3 months ago
- ☆209Updated 3 weeks ago
- Open source LLM UI, compatible with all local LLM providers.☆177Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- ☆90Updated last month
- ☆159Updated 9 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Updated 11 months ago
- ☆135Updated last month
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆81Updated this week
- Distributed Inference for mlx LLm☆100Updated last year
- A fast batching API to serve LLM models☆189Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆100Updated 7 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- ☆304Updated 3 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- automatically quant GGUF models☆219Updated last month
- ☆178Updated 5 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆457Updated 6 months ago