HackerCupAI / starter-kitsLinks
☆68Updated last year
Alternatives and similar repositories for starter-kits
Users that are interested in starter-kits are comparing it to the libraries listed below
Sorting:
- A competition to get you started on the NeurIPS AI Hackercup☆29Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated last year
- ☆251Updated last week
- ☆29Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 5 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 7 months ago
- Building GPT ...☆18Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆113Updated 5 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆104Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
- ML/DL Math and Method notes☆64Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆55Updated 2 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- Fine tune Gemma 3 on an object detection task☆88Updated 4 months ago
- Highly commented implementations of Transformers in PyTorch☆137Updated 2 years ago
- Training-Ready RL Environments + Evals☆177Updated this week
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆110Updated this week
- ☆124Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated 2 years ago
- A puzzle to learn about prompting☆135Updated 2 years ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 8 months ago
- LLM training in simple, raw C/CUDA☆15Updated 11 months ago
- ☆43Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆268Updated 6 months ago
- Notebooks for fine tuning pali gemma☆117Updated 7 months ago
- ☆143Updated 2 months ago
- ☆31Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 9 months ago
- Material for the series of seminars on Large Language Models☆34Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 7 months ago