Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆105Mar 16, 2023Updated 3 years ago
Alternatives and similar repositories for cs324_p2
Users that are interested in cs324_p2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 22, 2023Updated 3 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- VANS: A validated NVRAM simulator☆27Nov 22, 2023Updated 2 years ago
- ☆43Nov 1, 2022Updated 3 years ago
- ☆18Apr 23, 2025Updated 11 months ago
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 6 months ago
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- Cookiecutter template for a RR Python library.☆12May 28, 2022Updated 3 years ago
- ccNVMe: crash consistent non-volatile memory express☆14Aug 17, 2021Updated 4 years ago
- Scripts to parse arxiv documents for NLP tasks☆19Jun 12, 2023Updated 2 years ago
- This tool dumps images in tensorboard☆17Sep 12, 2020Updated 5 years ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Oct 31, 2024Updated last year
- ☆12Jan 27, 2025Updated last year
- ☆13Jan 14, 2020Updated 6 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- ☆13Jan 20, 2021Updated 5 years ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- A copy of the datasets for PROBEN1 from the paper "Proben1: A Set of Neural Network Benchmark Problems and Benchmarking Rules", Lutz Prec…☆12Jul 21, 2015Updated 10 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- Atari-style POMDPs☆25Feb 13, 2026Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆19Sep 11, 2024Updated last year
- Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python☆18Jan 30, 2023Updated 3 years ago
- ☆58Aug 19, 2025Updated 7 months ago
- ☆15Jul 1, 2021Updated 4 years ago
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- Fibertree emulator☆17Nov 4, 2024Updated last year
- Python implements of the code in "14 lectures on visual SLAM"☆11Aug 29, 2019Updated 6 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆138Apr 30, 2024Updated last year
- Helper scripts I use to run many experiments in the morning to check at night☆20Jun 14, 2021Updated 4 years ago
- ☆40Nov 28, 2022Updated 3 years ago
- ☆11Dec 7, 2024Updated last year
- Implementation of a fast Chung-Lu random graph generator☆11Oct 21, 2019Updated 6 years ago
- ☆18Apr 15, 2024Updated last year
- ☆52Jan 19, 2023Updated 3 years ago
- HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable d…☆15Oct 14, 2024Updated last year
- ☆15Oct 26, 2021Updated 4 years ago