stanford-cs324 / winter2023Links
☆38Updated 2 years ago
Alternatives and similar repositories for winter2023
Users that are interested in winter2023 are comparing it to the libraries listed below
Sorting:
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- ☆52Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆68Updated 11 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 4 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Updated last year
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- ML/DL Math and Method notes☆64Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Updated 10 months ago
- Make triton easier☆49Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- ☆75Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated this week
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 11 months ago
- Language models scale reliably with over-training and on downstream tasks☆100Updated last year
- Website☆57Updated 2 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 7 months ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- ☆109Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Updated 2 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆150Updated last year
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆76Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- train with kittens!☆63Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆61Updated last year