Lossfunk / KernelBench-v2Links

KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems

☆21

Alternatives and similar repositories for KernelBench-v2

Users that are interested in KernelBench-v2 are comparing it to the libraries listed below

Sorting:

facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆93Updated this week
tyler-romero / microR1
Simple repository for training small reasoning models
☆32Updated 5 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 8 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 3 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆66Updated 2 weeks ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆49Updated 3 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
CLAIRE-Labo / quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …
☆23Updated 3 weeks ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 11 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆78Updated last week
kmohan321 / Research_Papers
☆46Updated 4 months ago
PiotrNawrot / sparse-frontier
The evaluation framework for training-free sparse attention in LLMs
☆86Updated last month
okarthikb / state-space-models
☆27Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆18Updated last week
google-deepmind / asyncdiloco
☆45Updated last year
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 9 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆143Updated 2 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆77Updated 9 months ago
lilakk / BLEUBERI
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆25Updated 2 months ago
RobertCsordas / moeut
☆83Updated 11 months ago
epfml / DenseFormer
☆81Updated last year
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated 9 months ago
HarleyCoops / smolThinker-.5B
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated 5 months ago
google-deepmind / regress-lm
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…
☆94Updated last week
cloneofsimo / min-fsdp
☆83Updated last year
arcee-ai / DAM
☆53Updated 8 months ago