The State Of The Art, intelligence
☆158Aug 12, 2025Updated 8 months ago
Alternatives and similar repositories for Crux
Users that are interested in Crux are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- ☆39Aug 4, 2025Updated 8 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆71May 5, 2025Updated 11 months ago
- ☆15Apr 26, 2025Updated last year
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Nov 19, 2024Updated last year
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 8 months ago
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆95Mar 12, 2026Updated last month
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Pokedex for LLMs☆14Apr 14, 2025Updated last year
- DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification (CVPR 2026)☆62Mar 31, 2026Updated last month
- [ICLR2026] Test-Time Scaling with Reflective Generative Model☆302Jan 28, 2026Updated 3 months ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆19Jun 29, 2025Updated 10 months ago
- ☆11May 18, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12May 20, 2025Updated 11 months ago
- ☆17Apr 20, 2025Updated last year
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Jul 17, 2025Updated 9 months ago
- utilities for batched llm calls with retries☆50Updated this week
- General benchmarking apparatus for running multi-agent systems against benchmarks☆46Apr 13, 2026Updated 2 weeks ago
- State-of-the-art prompting techniques implementation with DSpy - Manager-style prompts, role personas, meta-prompting, and more☆54Apr 11, 2026Updated 2 weeks ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- ☆25May 23, 2025Updated 11 months ago
- ☆61Apr 8, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆482Sep 27, 2024Updated last year
- Browser-based Voice Assistant☆43Mar 31, 2023Updated 3 years ago
- ☆37Feb 5, 2025Updated last year
- All information and news with respect to Falcon-H1 series☆116Oct 9, 2025Updated 6 months ago
- simple grpo☆12May 28, 2025Updated 11 months ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆47Sep 2, 2025Updated 7 months ago
- A benchmark for conversational bargaining by language models. In each 20‑round match one LLM plays buyer, one plays seller, and both hold…☆35Aug 21, 2025Updated 8 months ago
- ☆72Oct 23, 2025Updated 6 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆329Mar 31, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆35Nov 30, 2024Updated last year
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 3 months ago
- ☆13Nov 11, 2023Updated 2 years ago
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆123Jan 22, 2026Updated 3 months ago
- A multimodal live AI assistant designed to enhance the browsing experience using Gemini.☆11Feb 15, 2025Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆104Jul 19, 2025Updated 9 months ago