R0bk / killedbyllmLinks
☆94Updated 10 months ago
Alternatives and similar repositories for killedbyllm
Users that are interested in killedbyllm are comparing it to the libraries listed below
Sorting:
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆291Updated 2 months ago
- Live-bending a foundation model’s output at neural network level.☆269Updated 7 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆156Updated 3 weeks ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆225Updated 10 months ago
- Applying the ideas of Deepseek R1 to computer use☆216Updated 9 months ago
- See Through Your Models☆401Updated 4 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆237Updated 2 years ago
- Pivotal Token Search☆131Updated 3 months ago
- LLM benchmark: Generate an SVG of a pelican riding a bicycle☆137Updated 3 months ago
- Detect whether or not an audio file was generated by NotebookLM☆140Updated 11 months ago
- Visual inference exploration & experimentation playground☆96Updated 11 months ago
- ☆283Updated last week
- Editor with LLM generation tree exploration☆77Updated 8 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆152Updated 5 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 10 months ago
- Dead Simple LLM Abliteration☆235Updated 8 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- Mistral7B playing DOOM☆138Updated last year
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆123Updated 8 months ago
- Physical AI Assistant that illuminates your life☆188Updated last month
- Replace OpenAI with Llama.cpp Automagically.☆325Updated last year
- ☆116Updated 9 months ago
- ☆198Updated 6 months ago
- Your AI research assistant☆79Updated 7 months ago
- AI management tool☆121Updated last year
- High-Performance Implementation of OpenAI's TikToken.☆457Updated 4 months ago
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆76Updated 2 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆230Updated 3 months ago
- ☆150Updated 4 months ago
- explore token trajectory trees on instruct and base models☆148Updated 5 months ago