Repository of implementations of classic and sota rl algorithms from scratch in PyTorch
☆220Jan 3, 2026Updated 2 months ago
Alternatives and similar repositories for NeatRL
Users that are interested in NeatRL are comparing it to the libraries listed below
Sorting:
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆408Nov 11, 2025Updated 3 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 5 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Feb 9, 2026Updated last month
- Multimodal AI workloads: batch inference, model training and online serving.☆107Aug 22, 2025Updated 6 months ago
- learningggggggg 🐳☆576Apr 2, 2025Updated 11 months ago
- everything i know about cuda and triton☆13Jan 28, 2025Updated last year
- Notes of PRNN course taught at IISC as part of MTech AI curriculum☆16Nov 30, 2024Updated last year
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆25Oct 17, 2025Updated 4 months ago
- ☆31Feb 28, 2026Updated last week
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆81Feb 10, 2026Updated last month
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated 2 months ago
- ☆27Apr 17, 2025Updated 10 months ago
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated 9 months ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- Building the EVM from Scratch☆24Feb 28, 2024Updated 2 years ago
- qwen3 experiments☆34Jul 1, 2025Updated 8 months ago
- Some tools for learning purrr☆20May 23, 2018Updated 7 years ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆46Sep 7, 2025Updated 6 months ago
- Research on training an LLM with DeepSeek & Kimi architecture☆41Sep 30, 2025Updated 5 months ago
- Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project☆25Feb 22, 2025Updated last year
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆56Jan 18, 2026Updated last month
- ☆28Apr 2, 2025Updated 11 months ago
- coding CUDA everyday!☆74Feb 5, 2026Updated last month
- Free and open-source curriculum to master artificial intelligence☆35Feb 28, 2025Updated last year
- Paper implementation☆48Apr 8, 2025Updated 11 months ago
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,157Jan 23, 2025Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Jan 2, 2025Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆78Apr 4, 2025Updated 11 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Feb 11, 2026Updated 3 weeks ago
- a Change Data Capture (CDC) system using Outbox Pattern with Postgres WAL, Redis Streams and gRPC☆80Dec 1, 2025Updated 3 months ago
- ☆30Jan 25, 2025Updated last year
- WTF Vyper极简入门,供小白们使用,每周更新1-3讲。官网: https://wtf.academy☆31Apr 7, 2024Updated last year
- rl from zero pretrain, can it be done? yes.☆288Sep 28, 2025Updated 5 months ago
- Powerful Auto Research powered by LangChain, and Anthropic.☆29Jul 16, 2024Updated last year
- AI-driven storytelling system☆10Apr 24, 2025Updated 10 months ago
- documentation used in my projects☆16Mar 2, 2026Updated last week
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆77Aug 30, 2023Updated 2 years ago
- Learnings and programs related to CUDA☆434Jun 29, 2025Updated 8 months ago