HackerCupAI / starter-kitsLinks
☆64Updated 8 months ago
Alternatives and similar repositories for starter-kits
Users that are interested in starter-kits are comparing it to the libraries listed below
Sorting:
- A competition to get you started on the NeurIPS AI Hackercup☆28Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆45Updated 4 months ago
- ☆47Updated 7 months ago
- ☆25Updated 8 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆257Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆46Updated this week
- Building GPT ...☆18Updated 6 months ago
- ☆46Updated 3 weeks ago
- ☆227Updated 3 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 weeks ago
- Iterate fast on your RAG pipelines☆23Updated this week
- A miniture AI training framework for PyTorch☆42Updated 4 months ago
- ☆134Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆66Updated 4 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆185Updated 3 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆124Updated last year
- LLM training in simple, raw C/CUDA☆14Updated 6 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆34Updated 8 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆260Updated 11 months ago
- ☆30Updated 7 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- List of online discord servers for ML collaborations.☆29Updated 7 months ago