alphaXiv/paper-implementations

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alphaXiv/paper-implementations)

alphaXiv / paper-implementations

Clean, reusable paper implementations for trending papers on alphaXiv

☆199

Alternatives and similar repositories for paper-implementations

Users that are interested in paper-implementations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaximeRivest / moereport
View on GitHub
☆19Aug 23, 2025Updated 11 months ago
alphaXiv / feedback
View on GitHub
Issue tracker for https://alphaxiv.org
☆24Oct 13, 2025Updated 9 months ago
alphaXiv / TinyRecursiveModels
View on GitHub
☆30Dec 15, 2025Updated 7 months ago
machinestein / Deep-Improvement-Supervision
View on GitHub
Official PyTorch implementation of "Latent Reasoning in TRMs is Secretly a Policy Improvement Operator" (ICML 2026)
☆23May 29, 2026Updated last month
YuvrajSingh-mist / smolcluster
View on GitHub
An educational distributed training and inference library for neural nets using local computing
☆72Jun 10, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BhavyaGoyal777 / IMPLEMENTING-RESEARCH-PAPERS
View on GitHub
Basically a repo containing architectures/algorithms/papers from scratch in pytorch
☆30Feb 11, 2026Updated 5 months ago
Chengsong-Huang / G-Zero
View on GitHub
☆25May 14, 2026Updated 2 months ago
mvakde / mdlARC
View on GitHub
Goal is to solve sample efficiency by using ARC-AGI as a benchmark
☆165Apr 21, 2026Updated 3 months ago
sjelassi / ebft_openrlhf
View on GitHub
Code for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".
☆23Mar 16, 2026Updated 4 months ago
rehaanahmad2013 / self-improving-robots
View on GitHub
☆19Mar 28, 2023Updated 3 years ago
mehdie79 / RTM_latent_refinement
View on GitHub
☆22Jul 10, 2026Updated 2 weeks ago
YuvrajSingh-mist / Paper-Replications
View on GitHub
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆425Nov 11, 2025Updated 8 months ago
olivkoch / TinyRecursiveModels
View on GitHub
☆35Nov 11, 2025Updated 8 months ago
lambda-calculus-LLM / lambda-RLM
View on GitHub
Method for Long Context RLMs using verifiable Lambda Calculus
☆304Apr 24, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BJSwaroop / PortfolioCodeWithSwaroop
View on GitHub
☆29Sep 16, 2023Updated 2 years ago
nathanrs / tiny-infini-gram
View on GitHub
An unbounded n-gram language model on Tiny Shakespeare
☆22Jan 21, 2026Updated 6 months ago
jammastergirish / BuildAnLLM
View on GitHub
☆174May 29, 2026Updated last month
BAI-LAB / MoE-CL
View on GitHub
[WWW 2026 Oral] MoE-CL:Self-Evolving LLMs via Continual Instruction Tuning
☆21Dec 1, 2025Updated 7 months ago
avbiswas / finetuning_recipes
View on GitHub
☆109Jun 18, 2026Updated last month
NadavSc / Diff-Mamba
View on GitHub
☆22Jan 23, 2026Updated 6 months ago
shreyansh26 / pytorch-distributed-training-from-scratch
View on GitHub
A simple but instructive implementation of DP, TP, FSDP, FSDP+TP using pytorch distributed primitives
☆19Apr 12, 2026Updated 3 months ago
distil-labs / distil-cli-skill
View on GitHub
Claude skill for distil cli
☆177Updated this week
avbiswas / text-albumentations
View on GitHub
A simple library for generating instruction tuning datasets locally
☆90Jun 10, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AsyncFuncAI / alphaxiv-open
View on GitHub
alphaxiv open source alternative
☆141May 24, 2025Updated last year
hyc2026 / M3-Agent-Training
View on GitHub
☆30Mar 30, 2026Updated 3 months ago
SakanaAI / sparser-faster-llms
View on GitHub
Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.
☆254Jun 29, 2026Updated 3 weeks ago
SamsungSAILMontreal / TinyRecursiveModels
View on GitHub
☆6,574Apr 1, 2026Updated 3 months ago
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
stockeh / mlx-drifting-model
View on GitHub
Generative Modeling via Drifting in MLX
☆43Feb 6, 2026Updated 5 months ago
cmpnd-ai / dspy-tutorial-deep-research
View on GitHub
Learn DSPy's core abstractions while building a deep research agent.
☆44Mar 8, 2026Updated 4 months ago
autoLearnMem / AutoMem
View on GitHub
AutoMem: Automated Learning of Memory as a Cognitive Skill
☆130Jul 3, 2026Updated 3 weeks ago
naklecha / simple-llm
View on GitHub
~950 line, minimal, extensible LLM inference engine built from scratch.
☆478Jan 9, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LaunchPlatform / marketplace
View on GitHub
Marketplace ML experiment - training without backprop
☆28Sep 9, 2025Updated 10 months ago
SrijanSriv211 / Strawberry
View on GitHub
Is strawberry a fruit or a vegetable?
☆54Jun 10, 2026Updated last month
sapientinc / data_io
View on GitHub
Data pipeline for HRM-Text pretraining
☆68May 21, 2026Updated 2 months ago
ESHyperscale / nano-egg
View on GitHub
Evolution Pretraining Fully in Int Formats
☆177Feb 25, 2026Updated 4 months ago
EvanZhuang / knowledge_flow
View on GitHub
Official Implementation of Knowledge Flow Prompting
☆35Oct 20, 2025Updated 9 months ago
actava-ai / Cura
View on GitHub
actAVA Cura: Specialized Model for Agentic Healthcare
☆22Updated this week
doyc-1 / Crux
View on GitHub
The State Of The Art, intelligence
☆162Aug 12, 2025Updated 11 months ago