google-deepmind / transformer_ngramsLinks
☆30Updated 9 months ago
Alternatives and similar repositories for transformer_ngrams
Users that are interested in transformer_ngrams are comparing it to the libraries listed below
Sorting:
- Open source interpretability artefacts for R1.☆157Updated 3 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- ☆98Updated last week
- ☆136Updated 4 months ago
- Machine Learning with Symbolic Tensors☆325Updated 2 months ago
- ☆97Updated last week
- Our solution for the arc challenge 2024☆168Updated last month
- Dion optimizer algorithm☆259Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆95Updated 2 weeks ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆466Updated this week
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 8 months ago
- ☆174Updated 4 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆189Updated 2 months ago
- ☆363Updated this week
- SIMD quantization kernels☆79Updated this week
- A JAX-native LLM Post-Training Library☆92Updated this week
- ☆470Updated 3 weeks ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆63Updated 2 months ago
- ☆275Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 5 months ago
- ☆42Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆149Updated last month
- The history files when recording human interaction while solving ARC tasks☆114Updated 2 weeks ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆25Updated 6 months ago
- ☆186Updated last week
- ☆449Updated last week
- ☆415Updated 2 months ago
- Resources from the EleutherAI Math Reading Group☆53Updated 5 months ago
- Alex Krizhevsky's original code from Google Code☆196Updated 9 years ago
- ☆62Updated 9 months ago