HackerCupAI / starter-kitsLinks

☆67

Alternatives and similar repositories for starter-kits

Users that are interested in starter-kits are comparing it to the libraries listed below

Sorting:

divyamakkar0 / JAXformer
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆95Updated 2 weeks ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆194Updated 4 months ago
muellerzr / nbdistributed
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆50Updated 3 weeks ago
google-deepmind / onetwo
☆243Updated 7 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated last year
wandb / aihackercup
A competition to get you started on the NeurIPS AI Hackercup
☆29Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
stas00 / ml-ways
ML/DL Math and Method notes
☆64Updated last year
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆42Updated 8 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 10 months ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated 4 months ago
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆126Updated 2 years ago
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆195Updated last year
huggingface / competitions
☆124Updated 11 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆56Updated last week
srush / Tensor-Puzzles-Penzai
☆21Updated last year
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 5 months ago
kevinwu23 / StanfordFineTuneBench
☆31Updated 10 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆40Updated 8 months ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆70Updated 7 months ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
google-deepmind / mishax
☆142Updated last month
Cohere-Labs-Community / AI-Alignment-Cohort
☆28Updated last year
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆133Updated 8 months ago
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆68Updated 4 months ago
changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆290Updated 2 months ago
gpu-mode / profiling-cuda-in-torch
☆173Updated last year
gpu-mode / discord-cluster-manager
Write a fast kernel and run it on Discord. See how you compare against the best!
☆58Updated 2 weeks ago
apple / ml-hypercloning
☆52Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 5 months ago