sam-paech / slop-forensicsLinks
☆289Updated last month
Alternatives and similar repositories for slop-forensics
Users that are interested in slop-forensics are comparing it to the libraries listed below
Sorting:
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆43Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆237Updated 3 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆145Updated last month
- A simple tool that let's you explore different possible paths that an LLM might sample.☆193Updated 7 months ago
- AI management tool☆121Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 9 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆163Updated 2 weeks ago
- ☆107Updated last month
- ☆158Updated 7 months ago
- Train Large Language Models on MLX.☆223Updated this week
- explore token trajectory trees on instruct and base models☆149Updated 6 months ago
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆170Updated 4 months ago
- ☆135Updated 7 months ago
- Enhancing LLMs with LoRA☆177Updated last month
- Verify Precision of all Kimi K2 API Vendor☆442Updated 2 weeks ago
- A flexible, adaptive classification system for dynamic text classification☆506Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- ☆209Updated 2 months ago
- ☆164Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆273Updated last month
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 5 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 5 months ago
- ☆234Updated last week
- A user interface for DSPy☆198Updated 2 months ago
- Claude Deep Research config for Claude Code.☆223Updated 8 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆210Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆206Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆77Updated 3 months ago