lechmazur / nyt-connections
Benchmark that evaluates LLMs using 436 NYT Connections puzzles
☆12Updated this week
Alternatives and similar repositories for nyt-connections:
Users that are interested in nyt-connections are comparing it to the libraries listed below
- Web Interface for Vision Language Models Including InternVLM2☆17Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated last month
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆16Updated last week
- Simple LLM inference server☆20Updated 7 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆35Updated last week
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 11 months ago
- A QT GUI for large language models☆28Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆70Updated last month
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- ☆25Updated last week
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- ANE accelerated embedding models!☆15Updated last month
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Github repo for Peifeng's internship project☆13Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆12Updated 5 months ago
- Mistral-7B finetuned for function calling☆15Updated last year
- ☆17Updated 2 weeks ago
- ☆11Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated last month
- ☆21Updated 7 months ago
- IA-powered Ollama Modelfile Generator☆23Updated 8 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆22Updated 2 months ago
- ☆27Updated 5 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆13Updated 2 weeks ago
- Web page with political compass quiz results for open LLMs☆37Updated last year
- ☆12Updated 4 months ago
- Build HTML artefacts with Ollama☆11Updated last month
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago