Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆104Mar 16, 2023Updated 3 years ago
Alternatives and similar repositories for cs324_p2
Users that are interested in cs324_p2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- VANS: A validated NVRAM simulator☆27Nov 22, 2023Updated 2 years ago
- ☆45Nov 1, 2022Updated 3 years ago
- ☆29Nov 5, 2021Updated 4 years ago
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- Demonstration of the use of TensorRT and TRITON☆16Feb 9, 2021Updated 5 years ago
- 山东大学自行车协会 历届会刊☆14Apr 8, 2023Updated 3 years ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 9 months ago
- ☆17Dec 9, 2022Updated 3 years ago
- Debug print operator for cudagraph debugging☆18Aug 2, 2024Updated last year
- Interactive notebooks inside Roam☆11Mar 30, 2022Updated 4 years ago
- "CCNLab: A Benchmarking Framework for Computational Cognitive Neuroscience" (NeurIPS 2021)☆10Jul 12, 2021Updated 4 years ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆52Oct 31, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Jan 27, 2025Updated last year
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago
- Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightning☆13Aug 7, 2022Updated 3 years ago
- Official codebase for our paper "Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks"☆12Jun 30, 2021Updated 5 years ago
- A simple SQL parser based on Apache Calcite.☆14May 8, 2026Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- A set of tools to Filter/transform/render RoamResearch JSON export. Used in Roam Garden☆18Apr 3, 2021Updated 5 years ago
- ☆13Jun 12, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆19Sep 11, 2024Updated last year
- ☆10Oct 17, 2022Updated 3 years ago
- Similarity Encoder (SimEc) Neural Network Framework for learning low dimensional similarity preserving representations☆17Jun 28, 2020Updated 6 years ago
- Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python☆19Jan 30, 2023Updated 3 years ago
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …☆16May 9, 2019Updated 7 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆12Mar 13, 2021Updated 5 years ago
- Survey of Learning To Rank☆16Nov 13, 2025Updated 7 months ago
- ☆14Mar 9, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- Implementation of a fast Chung-Lu random graph generator☆11Oct 21, 2019Updated 6 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- Collection of Microsoft Navision SQL Queries for Customers, Suppliers, Products, Inventory, Accounts Receivable Ledger, Sales Invoices, P…☆11Nov 5, 2019Updated 6 years ago
- SQL Fundamentals for Marketing, Digital and Web Analytics [Video], Published by Packt☆14Dec 15, 2025Updated 6 months ago
- bootstrap my zsh shell☆16Mar 28, 2026Updated 3 months ago
- ☆54Jan 19, 2023Updated 3 years ago