kmohan321 / Research_PapersLinks
☆46Updated 8 months ago
Alternatives and similar repositories for Research_Papers
Users that are interested in Research_Papers are comparing it to the libraries listed below
Sorting:
- working implimention of deepseek MLA☆45Updated 10 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆70Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 8 months ago
- Collection of autoregressive model implementation☆86Updated 7 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆68Updated last week
- ☆45Updated 6 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Updated 5 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 6 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month
- Exploring Applications of GRPO☆249Updated 3 months ago
- ☆45Updated 7 months ago
- Lego for GRPO☆30Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆60Updated last year
- An introduction to LLM Sampling☆79Updated 11 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 8 months ago
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆81Updated this week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆89Updated 4 months ago
- rl from zero pretrain, can it be done? yes.☆281Updated 2 months ago
- minimal GRPO implementation from scratch☆100Updated 8 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆302Updated last month
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 8 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated 11 months ago
- aesthetic tensor visualiser☆27Updated 7 months ago
- Simple repository for training small reasoning models☆46Updated 9 months ago
- ☆136Updated last year