eric-prog / GPU-GrantsLinks
GPUGrants - a list of GPU grants that I can think of
☆45Updated last month
Alternatives and similar repositories for GPU-Grants
Users that are interested in GPU-Grants are comparing it to the libraries listed below
Sorting:
- ⏰ AI conference deadline countdowns☆285Updated 2 weeks ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆179Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆204Updated last year
- A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).☆155Updated 10 months ago
- This repository collects all relevant resources about interpretability in LLMs☆377Updated last year
- ☆81Updated 8 months ago
- List of AI Internships☆128Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆171Updated 4 months ago
- ☆208Updated 11 months ago
- A brief and partial summary of RLHF algorithms.☆136Updated 8 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆137Updated last year
- ☆29Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆239Updated 8 months ago
- ☆99Updated last year
- minimal GRPO implementation from scratch☆99Updated 7 months ago
- Physics of Language Models, Part 4☆255Updated 3 months ago
- An extension of the nanoGPT repository for training small MOE models.☆207Updated 8 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆232Updated 3 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆193Updated last year
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆179Updated 4 months ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆177Updated last year
- ☆197Updated 6 months ago
- ☆142Updated 2 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated last year
- ☆108Updated last year
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆301Updated last week
- Prune transformer layers☆69Updated last year