Tiny evaluation of leading LLMs on competitive programming problems
☆14Nov 28, 2024Updated last year
Alternatives and similar repositories for cp_eval
Users that are interested in cp_eval are comparing it to the libraries listed below
Sorting:
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- ☆15Feb 23, 2026Updated last week
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆27Jul 27, 2025Updated 7 months ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 9 months ago
- ☆23Jan 17, 2025Updated last year
- An advanced AI-powered conversational agent leveraging the Llama 3.2 model and Phidata framework. Features include reasoning, natural lan…☆16Oct 29, 2024Updated last year
- Reinforcing General Reasoning without Verifiers☆96Jun 24, 2025Updated 8 months ago
- MB-X.01 · Logical Origin Node (L.O.N.) — TruthΩ → Co⁺ → Score⁺. Demo e spec verificabili. https://massimiliano.neocities.org/☆59Feb 3, 2026Updated last month
- Code for paper "Analog Foundation Models"☆31Sep 18, 2025Updated 5 months ago
- prompt engineering experiments with DSPy GEPA and TextGrad☆67Sep 2, 2025Updated 6 months ago
- ☆68Feb 15, 2026Updated 2 weeks ago
- ☆32Feb 11, 2025Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- Reinforcement learning examples for Torobo based on IsaacLab☆36Dec 3, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆86Aug 20, 2025Updated 6 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- General purpose ant system☆10Feb 15, 2023Updated 3 years ago
- ☆16Feb 22, 2025Updated last year
- amp up your command line — assign variables to output from common commands. a great complement to tab completion☆21Jul 27, 2022Updated 3 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- DEVELOPMENT OF Low Cost IoT BASED VIBRATION MONITORING AND SPECTRUM ANALYSIS SYSTEMS FOR TECHNICAL OBJECTS☆11Oct 26, 2020Updated 5 years ago
- Enhance your Google account security with this comprehensive guide. It covers strong passwords, two-factor authentication, phishing preve…☆11Nov 21, 2024Updated last year
- Sphere online judge problems solutions☆11Apr 22, 2023Updated 2 years ago
- Stripe Payment Gateway integration in Django☆10May 24, 2021Updated 4 years ago
- Enemies for your LLM☆35Jan 20, 2026Updated last month
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Oct 18, 2025Updated 4 months ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- A corporate Slack-like messenger built with the YugabyteDB database, Vaadin, Spring Boot, and Kong.☆11Nov 2, 2023Updated 2 years ago
- ☆14Mar 20, 2025Updated 11 months ago
- ☆10Mar 5, 2020Updated 5 years ago
- ☆37May 11, 2024Updated last year
- Chennaipy's website at chennaipy.org☆13Updated this week
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Exploration of automated dataset selection approaches at large scales.☆52Mar 4, 2025Updated last year
- This includes 2 separate tutorial series for OpenAI swarm library each 10 files from basic to advanced☆14Jan 14, 2025Updated last year
- ☆16Jun 30, 2025Updated 8 months ago