sam-paech / slop-forensicsLinks

☆308

Alternatives and similar repositories for slop-forensics

Users that are interested in slop-forensics are comparing it to the libraries listed below

Sorting:

d0rc / deepdive
Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…
☆44Updated last year
lechmazur / confabulations
Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.
☆243Updated 6 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆150Updated last month
TC-Zheng / ActuosusAI
AI management tool
☆119Updated last year
QuixiAI / dolphin-logger
☆107Updated 3 months ago
vgel / logitloom
explore token trajectory trees on instruct and base models
☆150Updated 8 months ago
janhq / ReZero
☆159Updated 9 months ago
jd-3d / SOLOBench
☆135Updated 9 months ago
obalcells / hallucination_probes
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆278Updated 2 months ago
codelion / ellora
Enhancing LLMs with LoRA
☆206Updated 3 months ago
willkurt / token-explorer
A simple tool that let's you explore different possible paths that an LLM might sample.
☆201Updated 9 months ago
lechmazur / step_game
Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…
☆85Updated 2 months ago
sam-paech / antislop-sampler
☆337Updated 6 months ago
matteoserva / GraphLLM
☆209Updated last month
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆100Updated 7 months ago
lechmazur / nyt-connections
Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words
☆193Updated this week
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆85Updated 5 months ago
willccbb / claude-deep-research
Claude Deep Research config for Claude Code.
☆226Updated 10 months ago
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆86Updated last year
hammer-mt / DSPyUI
A user interface for DSPy
☆211Updated 4 months ago
QuixiAI / OpenChatML
☆166Updated 6 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆97Updated 9 months ago
shobrook / weightgain
Train an adapter for any embedding model in under a minute
☆130Updated 10 months ago
shobrook / promptimal
A very fast, very minimal prompt optimizer
☆298Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆57Updated last year
lechmazur / generalization
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…
☆63Updated 4 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆87Updated 8 months ago
codelion / pts
Pivotal Token Search
☆145Updated last month
teknium1 / ShareGPT-Builder
☆119Updated last year
MoonshotAI / K2-Vendor-Verifier
Verify Precision of all Kimi K2 API Vendor
☆513Updated 2 weeks ago