HackerCupAI / starter-kitsLinks
☆64Updated 9 months ago
Alternatives and similar repositories for starter-kits
Users that are interested in starter-kits are comparing it to the libraries listed below
Sorting:
- ☆27Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- A competition to get you started on the NeurIPS AI Hackercup☆28Updated 9 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- ☆228Updated 4 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated last month
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆85Updated last year
- ☆124Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆125Updated last year
- ML/DL Math and Method notes☆61Updated last year
- A puzzle to learn about prompting☆131Updated 2 years ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 2 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- ☆30Updated 8 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- ☆43Updated last month
- ☆19Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 months ago
- ☆134Updated 3 months ago
- Let's build better datasets, together!☆260Updated 6 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- ☆48Updated 8 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated 10 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- A miniture AI training framework for PyTorch☆42Updated 5 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆265Updated last year
- ☆145Updated 11 months ago