mc-bench / mc-bench-backendLinks
☆34Updated last month
Alternatives and similar repositories for mc-bench-backend
Users that are interested in mc-bench-backend are comparing it to the libraries listed below
Sorting:
- ☆18Updated 5 months ago
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆167Updated 9 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆523Updated 8 months ago
- Live-bending a foundation model’s output at neural network level.☆264Updated 5 months ago
- i will automate factorio☆111Updated last year
- ☆151Updated 2 months ago
- command loom interface☆109Updated 7 months ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆116Updated 3 months ago
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆209Updated 7 months ago
- A graph visualization of attention☆57Updated 4 months ago
- It is windows, but all of the apps are ai generated.☆179Updated last year
- ☆143Updated 7 months ago
- Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine☆588Updated 4 months ago
- Claude Deep Research config for Claude Code.☆217Updated 6 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆290Updated last month
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆313Updated 3 months ago
- Interactive timeline of AI history☆61Updated 3 weeks ago
- ☆507Updated last month
- ☆133Updated 4 months ago
- explore token trajectory trees on instruct and base models☆133Updated 3 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 6 months ago
- ☆316Updated last month
- Basic semantic search for a tweet archive☆59Updated 7 months ago
- A LLM trained only on data from certain time periods to reduce modern bias☆536Updated last week
- pipeline to auto (scrape => clean => analyze => chat with) tons of data☆40Updated last year
- deepseek running locally in your browser☆319Updated 3 weeks ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 10 months ago
- smol models are fun too☆93Updated 10 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆697Updated this week