stanford-cs336 / spring2024-assignment1-basicsLinks
☆64Updated last year
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆99Updated last year
- ☆71Updated 3 months ago
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 11 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆66Updated 7 months ago
- ☆393Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆172Updated 4 months ago
- Open-source framework for the research and development of foundation models.☆611Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆301Updated this week
- Understand and test language model architectures on synthetic tasks.☆238Updated last month
- ☆52Updated last year
- ☆91Updated last year
- A puzzle to learn about prompting☆134Updated 2 years ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆225Updated last week
- Benchmarking Optimizers for LLM Pretraining☆40Updated last week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆60Updated last year
- ☆106Updated 3 weeks ago
- ☆38Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆205Updated last year
- ☆197Updated 6 months ago
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- A 7B parameter model for mathematical reasoning☆40Updated 9 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆64Updated 6 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆130Updated 3 years ago
- PyTorch building blocks for the OLMo ecosystem☆317Updated last week
- ☆23Updated 9 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆302Updated 2 weeks ago
- RuLES: a benchmark for evaluating rule-following in language models☆238Updated 8 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 6 months ago