mc-bench / mc-bench-backendLinks
☆35Updated last month
Alternatives and similar repositories for mc-bench-backend
Users that are interested in mc-bench-backend are comparing it to the libraries listed below
Sorting:
- ☆18Updated 6 months ago
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆166Updated 11 months ago
- A LLM trained only on data from certain time periods to reduce modern bias☆607Updated last month
- It is windows, but all of the apps are ai generated.☆180Updated last year
- Twitter (sometimes known as X) can look prettier with this simple AddOn!☆117Updated 6 months ago
- explore token trajectory trees on instruct and base models☆148Updated 5 months ago
- i will automate factorio☆111Updated last year
- Interactive timeline of AI history☆62Updated 2 months ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆118Updated 4 months ago
- ☆89Updated 7 months ago
- No code AI agents☆445Updated 9 months ago
- Live-bending a foundation model’s output at neural network level.☆269Updated 7 months ago
- A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o,…☆110Updated 6 months ago
- Claude Deep Research config for Claude Code.☆223Updated 7 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆106Updated 8 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆529Updated 9 months ago
- ☆321Updated 3 months ago
- Dive endlessly deeper into a single concept using AI☆98Updated 6 months ago
- command loom interface☆110Updated 9 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆291Updated 2 months ago
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆212Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.☆217Updated 8 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆313Updated 4 months ago
- Basic semantic search for a tweet archive☆60Updated 8 months ago
- ☆152Updated 3 months ago
- A graph visualization of attention☆57Updated 5 months ago
- ☆92Updated last year
- the simplest self-building general autonomous agent☆325Updated last year
- ☆142Updated 8 months ago