anpaure/cp_eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anpaure/cp_eval)

anpaure / cp_eval

Tiny evaluation of leading LLMs on competitive programming problems

☆14

Alternatives and similar repositories for cp_eval

Users that are interested in cp_eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WolframRavenwolf / MMLU-Pro
View on GitHub
MMLU-Pro eval results
☆15Aug 21, 2025Updated 11 months ago
QuixiAI / dolphin-utils
View on GitHub
☆17Feb 23, 2026Updated 5 months ago
the-laughing-monkey / agent-rl
View on GitHub
Scripts for training Qwen 2.5 VL with ms-swift and GRPO
☆12Feb 27, 2025Updated last year
vincentamato / mlx-esm-2
View on GitHub
An MLX implementation of Meta AI's ESM-2 protein language model
☆16Aug 16, 2025Updated 11 months ago
Goekdeniz-Guelmez / MLX-Benchmark
View on GitHub
The best benchmark for LLMs on Apple's MLX framework knowledge and coding tasks.
☆36Jun 12, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
uncenter / uncenter.dev
View on GitHub
🌐 My home on the internet.
☆15Updated this week
rosmineb / unit_test_rl
View on GitHub
Project code for training LLMs to write better unit tests + code
☆22May 19, 2025Updated last year
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆23Updated this week
TheSethRose / Agent-Chat
View on GitHub
An advanced AI-powered conversational agent leveraging the Llama 3.2 model and Phidata framework. Features include reasoning, natural lan…
☆15Oct 29, 2024Updated last year
hydroo / macos-core-to-core-latency
View on GitHub
Core-to-core latency benchmark that works on Apple MacOS without hard affinity
☆20May 9, 2026Updated 2 months ago
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
Hmbown / ZMLX
View on GitHub
Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon
☆47Mar 31, 2026Updated 3 months ago
Bent-Solutions / hermes-bench
View on GitHub
Local benchmarking UI for LLMs and AI agents
☆20Apr 13, 2026Updated 3 months ago
halfprice06 / huberman-rlm
View on GitHub
☆29Jan 19, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
BAI-LAB / MoE-CL
View on GitHub
[WWW 2026 Oral] MoE-CL:Self-Evolving LLMs via Continual Instruction Tuning
☆21Dec 1, 2025Updated 7 months ago
Quitch / Queller-AI
View on GitHub
Improved AI for Planetary Annihilation
☆17Jul 7, 2026Updated 2 weeks ago
keizerworks / keizer-auth
View on GitHub
API for keizer-auth
☆15Jan 1, 2025Updated last year
diicellman / dynamite-dogs
View on GitHub
BH hackathon
☆14Apr 4, 2024Updated 2 years ago
dnakov / llm-asi-arch
View on GitHub
🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…
☆30Jul 27, 2025Updated 11 months ago
N8python / mlx-pretrain
View on GitHub
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆85Aug 20, 2025Updated 11 months ago
PAIR-code / pretraining-tda
View on GitHub
☆33Feb 11, 2025Updated last year
marib00 / llamaindex-embedding-lora
View on GitHub
☆31Mar 18, 2024Updated 2 years ago
nodox / bizbuysell-scraper
View on GitHub
A demo on how to scrape data from BizBuySell.
☆24Apr 11, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
weizeming / momentum-attack-llm
View on GitHub
☆25Jan 17, 2025Updated last year
0xSero / minimax-m2-proxy
View on GitHub
A proxy for minimax-m2, enabling interleaved thinking, and tool calls.
☆39Nov 21, 2025Updated 8 months ago
para-lost / ReBase
View on GitHub
ReBase: Training Task Experts through Retrieval Based Distillation
☆28Feb 5, 2025Updated last year
LongHorizonReasoning / h1
View on GitHub
☆26Oct 29, 2025Updated 8 months ago
4ad / dotfiles
View on GitHub
My collection of dotfiles
☆14Apr 22, 2026Updated 3 months ago
ivanfioravanti / easy-azure-opensource
View on GitHub
OpenSource deployment made easy
☆10Jun 13, 2015Updated 11 years ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
belindal / LaMPP
View on GitHub
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Apr 3, 2023Updated 3 years ago
przchojecki / agentic-erdos
View on GitHub
Agentic workflow for tackling all open Erdos problems at once.
☆33May 10, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
armbues / SiLLM-examples
View on GitHub
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
☆16May 8, 2025Updated last year
fsndzomga / open_source_lrm
View on GitHub
☆10Oct 24, 2024Updated last year
misbahsy / XQuotes
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
GitPistachio / Competitive-programming
View on GitHub
Sphere online judge problems solutions
☆11Apr 22, 2023Updated 3 years ago
Archelunch / dspy-repl
View on GitHub
☆46Feb 20, 2026Updated 5 months ago
hrtan / MoSo
View on GitHub
[NeurIPS-2023] The PyTorch Implementation of MoSo. The algorithms are based on our paper: "Data Pruning via Moving-one-Sample-out". MoSo …
☆10May 21, 2026Updated 2 months ago
realsuayip / pcontract
View on GitHub
A data structure to track data over time. It works by tracking time/schedule information rather than tracking data changes over time.
☆13Jan 19, 2024Updated 2 years ago