Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆104Mar 16, 2023Updated 3 years ago
Alternatives and similar repositories for cs324_p2
Users that are interested in cs324_p2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- BVH accelerated CPU and OpenGL ray-tracing☆19Mar 13, 2023Updated 3 years ago
- 山东大学自行车协会 历届会刊☆12Apr 8, 2023Updated 3 years ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 8 months ago
- Repository for the Tweet2Story framework for the extraction of narratives from tweets.☆13Feb 13, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A better prompt.☆89Nov 9, 2016Updated 9 years ago
- ☆23Jun 21, 2023Updated 2 years ago
- "CCNLab: A Benchmarking Framework for Computational Cognitive Neuroscience" (NeurIPS 2021)☆10Jul 12, 2021Updated 4 years ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆52Oct 31, 2024Updated last year
- ☆13Jan 14, 2020Updated 6 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆11Mar 19, 2025Updated last year
- Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightning☆13Aug 7, 2022Updated 3 years ago
- Official codebase for our paper "Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks"☆12Jun 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Automatic gradient descent☆217Jun 26, 2023Updated 2 years ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- ☆10Oct 17, 2022Updated 3 years ago
- ☆59Aug 19, 2025Updated 9 months ago
- ☆16Jul 1, 2021Updated 4 years ago
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …☆16May 9, 2019Updated 7 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- Implementation of a fast Chung-Lu random graph generator☆11Oct 21, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- UCB 285 Deep Reinforcement Learning (Fall 2023) Homeworks☆13Nov 11, 2023Updated 2 years ago
- bootstrap my zsh shell☆17Mar 28, 2026Updated last month
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- ☆14Apr 19, 2022Updated 4 years ago
- Evaluation Suite for NVMe devices☆14Nov 14, 2024Updated last year
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 9 months ago
- ☆12Feb 17, 2025Updated last year
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,019Aug 21, 2024Updated last year
- Cache Simulator specialized for flash caching for bulk storage systems)☆13Jan 16, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 4 months ago
- Distributed Optimization Infra for learning CLIP models☆31Oct 3, 2024Updated last year
- Some utility functions to help myself (and perhaps others) go faster with ML/AI work☆49May 16, 2026Updated last week
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 5 months ago
- ☆21Sep 27, 2023Updated 2 years ago
- Action Value Gradient Algorithm☆28May 18, 2025Updated last year