google-deepmind / transformer_ngramsLinks
☆29Updated 8 months ago
Alternatives and similar repositories for transformer_ngrams
Users that are interested in transformer_ngrams are comparing it to the libraries listed below
Sorting:
- ☆41Updated 6 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 7 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 11 months ago
- Simple repository for training small reasoning models☆33Updated 5 months ago
- rl from zero pretrain, can it be done? we'll see.☆63Updated 3 weeks ago
- Stochastic Parameter Decomposition☆27Updated this week
- ☆54Updated 4 months ago
- ☆27Updated last year
- Jax like function transformation engine but micro, microjax☆33Updated 8 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆68Updated 2 months ago
- A Python Library for Learning Non-Euclidean Representations☆54Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆48Updated 3 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- Building the cognitive-core to solve ARC-AGI-2☆21Updated 5 months ago
- Evaluation of neuro-symbolic engines☆38Updated 11 months ago
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆27Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆91Updated 4 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆89Updated 2 weeks ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆91Updated this week
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 5 months ago
- ☆56Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated this week
- ☆27Updated last year
- ☆167Updated 3 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆20Updated 2 weeks ago
- Code for☆27Updated 7 months ago
- Open source interpretability artefacts for R1.☆154Updated 2 months ago
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆24Updated 8 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆25Updated 5 months ago