AutoLLM / ArxivDigest
ArXiv Digest and Personalized Recommendations using Large Language Models
☆318Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ArxivDigest
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆251Updated last year
- ☆411Updated last year
- batched loras☆336Updated last year
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆101Updated last year
- ☆258Updated last month
- Scaling Data-Constrained Language Models☆321Updated last month
- ☆246Updated 4 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆210Updated 10 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆292Updated 10 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆160Updated last month
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆216Updated 7 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆210Updated last month
- Official PyTorch implementation of QA-LoRA☆116Updated 7 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆183Updated 10 months ago
- A comprehensive deep dive into the world of tokens☆214Updated 4 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆255Updated 4 months ago
- Generate textbook-quality synthetic LLM pretraining data☆488Updated last year
- ☆126Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆194Updated 6 months ago
- Extract full next-token probabilities via language model APIs☆228Updated 8 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆429Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆224Updated this week
- A puzzle to learn about prompting☆120Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆177Updated 5 months ago
- Evaluating LLMs with fewer examples☆134Updated 7 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆618Updated last month
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆265Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆154Updated 6 months ago
- ☆266Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆236Updated 4 months ago