advait / c4a0
Alpha-Zero Connect Four NN trained via self play
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for c4a0
- Genetics for Language Models☆11Updated 4 months ago
- Latent Large Language Models☆16Updated 2 months ago
- ☆36Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Training hybrid models for dummies.☆15Updated 2 weeks ago
- ☆31Updated 2 weeks ago
- ☆27Updated 4 months ago
- ☆20Updated last week
- alternative way to calculating self attention☆18Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Jax like function transformation engine but micro, microjax☆26Updated 2 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆20Updated last week
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆31Updated this week
- look how they massacred my boy☆54Updated 3 weeks ago
- ☆40Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- GoldFinch and other hybrid transformer components☆39Updated 3 months ago
- ☆12Updated last week
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week
- Implementation of Spectral State Space Models☆17Updated 8 months ago
- ☆11Updated 3 weeks ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated last month
- ☆55Updated 11 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆27Updated this week
- ☆12Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆20Updated last week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- BH hackathon☆14Updated 7 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago