google-deepmind / transformer_ngramsLinks
☆31Updated 11 months ago
Alternatives and similar repositories for transformer_ngrams
Users that are interested in transformer_ngrams are comparing it to the libraries listed below
Sorting:
- ☆142Updated last month
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- Training-Ready RL Environments + Evals☆121Updated this week
- Our solution for the arc challenge 2024☆179Updated 3 months ago
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- ☆188Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated this week
- 📄Small Batch Size Training for Language Models☆63Updated last week
- Implementation for robust ViT and scaled attention☆20Updated 6 months ago
- ☆124Updated 9 months ago
- ☆97Updated 2 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆268Updated this week
- ☆103Updated this week
- ☆16Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- Open-source framework for the research and development of foundation models.☆481Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- SIMD quantization kernels☆87Updated last month
- ☆114Updated last month
- Create an AI capable of solving reasoning tasks it has never seen before☆95Updated 10 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 5 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- Evaluation of LLMs on latest math competitions☆171Updated 3 weeks ago
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆62Updated last week
- RLP: Reinforcement as a Pretraining Objective☆155Updated last week
- An introduction to LLM Sampling☆79Updated 9 months ago
- ☆475Updated 2 months ago