stanford-cs324 / winter2023
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for winter2023
- ☆47Updated 9 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆61Updated 7 months ago
- Website☆47Updated last year
- ☆55Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Make triton easier☆41Updated 5 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆61Updated last month
- PyTorch building blocks for OLMo☆19Updated this week
- ☆25Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated 10 months ago
- Minimum Description Length probing for neural network representations☆16Updated this week
- ☆49Updated 6 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆34Updated 2 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆29Updated 6 months ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆38Updated 11 months ago
- ☆51Updated last month
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆37Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆41Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- ☆38Updated 7 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆17Updated 7 months ago
- Download, parse, and filter data PubMed, data-ready for The-Pile☆20Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- The Efficiency Spectrum of LLM☆52Updated 11 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated 7 months ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆55Updated last year