R0bk / killedbyllmLinks
☆95Updated 6 months ago
Alternatives and similar repositories for killedbyllm
Users that are interested in killedbyllm are comparing it to the libraries listed below
Sorting:
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆130Updated this week
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆282Updated this week
- Applying the ideas of Deepseek R1 to computer use☆214Updated 5 months ago
- Live-bending a foundation model’s output at neural network level.☆263Updated 3 months ago
- Pivotal Token Search☆109Updated this week
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆222Updated 6 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆78Updated 9 months ago
- AI management tool☆118Updated 8 months ago
- Mistral7B playing DOOM☆132Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆183Updated last week
- A GTK graphical interface for chatting with large language models (LLMs)☆80Updated last month
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆145Updated last month
- An LLM trained only on data from certain time periods to reduce modern bias☆197Updated this week
- ☆116Updated 5 months ago
- Visual inference exploration & experimentation playground☆94Updated 7 months ago
- An AI agent library using Python as the common language to define executable actions and tool interfaces.☆83Updated this week
- Conversation logs with Claude 3.5 Sonnet to try and iteratively optimize code☆99Updated 6 months ago
- Sort input lines semantically with llm☆119Updated last month
- DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.☆178Updated 2 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 7 months ago
- Detect whether or not an audio file was generated by NotebookLM☆138Updated 7 months ago
- llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆259Updated last week
- Dead Simple LLM Abliteration☆222Updated 4 months ago
- Replace OpenAI with Llama.cpp Automagically.☆320Updated last year
- ☆143Updated last week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆140Updated last month
- ☆163Updated 3 months ago
- ☆215Updated 4 months ago
- explore token trajectory trees on instruct and base models☆134Updated last month