stanford-cs324 / winter2023
☆36Updated last year
Alternatives and similar repositories for winter2023:
Users that are interested in winter2023 are comparing it to the libraries listed below
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Minimum Description Length probing for neural network representations☆18Updated this week
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- PyTorch building blocks for OLMo☆49Updated this week
- Aioli: A unified optimization framework for language model data mixing☆19Updated last week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Website☆51Updated 2 years ago
- ☆48Updated last year
- ☆25Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- The Efficiency Spectrum of LLM☆52Updated last year
- Utilities for Training Very Large Models☆57Updated 4 months ago
- Embedding Recycling for Language models☆38Updated last year
- 💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.☆45Updated last year
- ☆59Updated 9 months ago
- ML/DL Math and Method notes☆58Updated last year
- Code repo for MathAgent☆13Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- distill chatGPT coding ability into small model (1b)☆26Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆43Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆63Updated last month
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- ☆37Updated 9 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated 11 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆69Updated last month
- ☆23Updated last year
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆58Updated 3 months ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆47Updated 6 months ago