ag8 / sha-transformerLinks
☆12Updated last year
Alternatives and similar repositories for sha-transformer
Users that are interested in sha-transformer are comparing it to the libraries listed below
Sorting:
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated last year
- ☆17Updated 3 weeks ago
- ☆22Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆141Updated 4 months ago
- Solidity contracts for the decentralized Prime Network protocol☆27Updated 6 months ago
- ☆27Updated 3 months ago
- Efficient Zero-Knowledge Proofs for LoRA Verification☆152Updated last month
- ☆27Updated last year
- Modded vLLM to run pipeline parallelism over public networks☆41Updated 7 months ago
- ☆47Updated 7 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆51Updated 8 months ago
- RL gym for vision language models written in JAX☆140Updated 2 months ago
- A visual interface for understanding and interpreting Transformers☆77Updated 2 years ago
- Minimal open-source implementation of AlphaProof [WIP]☆59Updated this week
- ☆13Updated 2 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆125Updated 3 months ago
- Implementation of Direct Preference Optimization☆17Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Updated last year
- ☆51Updated 2 months ago
- ☆31Updated 2 years ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Simple Transformer in Jax☆140Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 4 months ago
- ☆42Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 5 months ago
- This is the official repository for all the code of TheoremLlama☆47Updated 5 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- minimal Energy-based transformer☆42Updated last month
- ☆14Updated last year