Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆71May 18, 2025Updated 10 months ago
Alternatives and similar repositories for transformer-directed-evolution
Users that are interested in transformer-directed-evolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Jan 21, 2025Updated last year
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆153May 2, 2025Updated 10 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆73Nov 18, 2025Updated 4 months ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 3 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59May 31, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆96Feb 24, 2025Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆63Dec 29, 2025Updated 2 months ago
- Explorations into improving ViTArc with Slot Attention☆43Oct 19, 2024Updated last year
- open source alpha evolve☆68May 19, 2025Updated 10 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Sep 23, 2024Updated last year
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆67Apr 7, 2025Updated 11 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆107Jun 24, 2024Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆54Jul 2, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆135Oct 15, 2025Updated 5 months ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,935Feb 9, 2026Updated last month
- Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon☆68Feb 8, 2026Updated last month
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46May 23, 2023Updated 2 years ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated 11 months ago
- Axial Positional Embedding for Pytorch☆84Feb 25, 2025Updated last year
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- [NeurIPS 2025 spotlight] Efficient factorized variant of the IPA module.☆46Nov 14, 2025Updated 4 months ago
- Repository for code and models for the paper "Extrapolative Controlled Sequence Generation via Iterative Refinement"☆16Mar 5, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of AlphaGenome, Deepmind's updated genomic attention model☆97Updated this week
- Implementation of Kronecker Attention in Pytorch☆19Sep 12, 2020Updated 5 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- Official implementation of the paper "Light Transport-aware Diffusion Posterior Sampling for Single View Reconstruction of Volumes"☆18Aug 1, 2025Updated 7 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- FlashRNN - Fast RNN Kernels with I/O Awareness☆177Oct 20, 2025Updated 5 months ago
- A simple Transformer where the softmax has been replaced with normalization☆20Sep 11, 2020Updated 5 years ago
- Implementation of the proposed minGRU in Pytorch☆319Dec 10, 2025Updated 3 months ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Apr 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆43Nov 8, 2020Updated 5 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Jan 6, 2021Updated 5 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules☆58Dec 2, 2020Updated 5 years ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆179Sep 12, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year