antonpk1 / stackfishLinks
Stackfish is an open-source LLM-powered pipeline designed to automatically solve competitive programming problems.
☆55Updated last year
Alternatives and similar repositories for stackfish
Users that are interested in stackfish are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆144Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 9 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆251Updated last month
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 7 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 3 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated 2 weeks ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Updated 6 months ago
- ☆62Updated 6 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 7 months ago
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆67Updated 8 months ago
- ☆55Updated last year
- Training Proactive and Personalized LLM Agents☆98Updated 2 weeks ago
- Very minimal (and stateless) agent framework☆44Updated last year
- ☆57Updated 2 weeks ago
- ☆57Updated 11 months ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆42Updated last week
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 8 months ago
- Simple repository for training small reasoning models☆48Updated last year
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 10 months ago
- The DPAB-α Benchmark☆32Updated last year
- ☆24Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆29Updated 10 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆39Updated last year
- ☆107Updated 3 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆148Updated this week