sangmichaelxie / cs324_p2Links
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆104Updated 2 years ago
Alternatives and similar repositories for cs324_p2
Users that are interested in cs324_p2 are comparing it to the libraries listed below
Sorting:
- A puzzle to learn about prompting☆127Updated 2 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆254Updated last year
- ☆166Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- [NeurIPS 2023] Learning Transformer Programs☆161Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 weeks ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆220Updated last year
- ☆51Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- RuLES: a benchmark for evaluating rule-following in language models☆223Updated 3 months ago
- ☆149Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 9 months ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆78Updated 10 months ago
- An interactive exploration of Transformer programming.☆264Updated last year
- Scaling Data-Constrained Language Models☆334Updated 8 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 5 months ago
- ML/DL Math and Method notes☆61Updated last year
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆182Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆117Updated this week
- ☆266Updated 4 months ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆207Updated 4 months ago
- ☆68Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆58Updated last year
- ☆92Updated last year
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆86Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Collection of autoregressive model implementation☆85Updated last month