☆94Oct 5, 2023Updated 2 years ago
Alternatives and similar repositories for train-with-fsdp
Users that are interested in train-with-fsdp are comparing it to the libraries listed below
Sorting:
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- batched loras☆350Sep 6, 2023Updated 2 years ago
- Re-implementation of local descriptor HardNet training in fasta2+kornia☆21Apr 6, 2020Updated 5 years ago
- ☆23Jul 10, 2023Updated 2 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- ☆10Apr 21, 2024Updated last year
- Run multiple programs to check if a VCF is usable☆11May 15, 2020Updated 5 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- ☆198Feb 9, 2024Updated 2 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- This repo consists of code for plotting top loss images☆13May 18, 2020Updated 5 years ago
- ☆14Oct 18, 2023Updated 2 years ago
- PharML is a framework for predicting compound affinity for protein structures. It utilizes a novel Molecular-Highway Graph Neural Network…☆13May 8, 2020Updated 5 years ago
- Typescript parser combinator library☆15Jan 9, 2026Updated last month
- CargoCoin is designed to be a smart contract, crypto currency platform, decentralising global trade and transport. The platform target is…☆13Aug 8, 2018Updated 7 years ago
- Label images with LabelImg; Object detection with detectron2☆13Aug 20, 2021Updated 4 years ago
- A simple uv workspace☆19Apr 5, 2025Updated 11 months ago
- This repo lets you run mistral-7b in Google Colab.☆16Oct 1, 2023Updated 2 years ago
- ☆19Aug 10, 2024Updated last year
- A pedagogical implementation of panel apps served up on a remote machine.☆14Oct 27, 2021Updated 4 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Jan 13, 2023Updated 3 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆206Aug 10, 2024Updated last year
- A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…☆16Apr 28, 2023Updated 2 years ago
- An introduction to DSPy☆34Aug 30, 2025Updated 6 months ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated last year
- ☆24Sep 2, 2022Updated 3 years ago
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- ☆17Jul 28, 2023Updated 2 years ago
- Chatbot in spanish using differents model: Seq2Seq model with Luong attention and transformer☆17Jan 9, 2020Updated 6 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Feb 21, 2026Updated last week
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated last year
- CNN ensemble for prostate cancer Gleason grading☆19Jan 28, 2026Updated last month
- ☆16Jun 4, 2016Updated 9 years ago
- ☆19Dec 4, 2025Updated 3 months ago