stanford-cs336 / spring2024-assignment1-basics
☆38Updated 9 months ago
Alternatives and similar repositories for spring2024-assignment1-basics:
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
- Sparse and discrete interpretability tool for neural networks☆62Updated last year
- ☆49Updated last year
- A mechanistic approach for understanding and detecting factual errors of large language models.☆43Updated 9 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆55Updated last week
- ☆37Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated 11 months ago
- ☆85Updated 7 months ago
- ☆61Updated this week
- ☆71Updated this week
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆71Updated 5 months ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Updated last year
- Easily run PyTorch on multiple GPUs & machines☆45Updated last month
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆139Updated this week
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆47Updated 2 months ago
- ☆73Updated this week
- A reading list of relevant papers and projects on foundation model annotation☆26Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.☆193Updated last month
- Code associated to papers on superposition (in ML interpretability)☆27Updated 2 years ago
- Building the cognitive-core to solve ARC-AGI-2☆20Updated 2 months ago
- ☆71Updated 5 months ago
- ☆254Updated 4 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆74Updated last year
- Language models scale reliably with over-training and on downstream tasks☆96Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 2 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆169Updated this week
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago
- An extension of the nanoGPT repository for training small MOE models.☆131Updated last month
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated last month
- ☆38Updated last year