lucidrains/firefly-torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/firefly-torch)

lucidrains / firefly-torch

Exploration into the Firefly algorithm in Pytorch

☆41

Alternatives and similar repositories for firefly-torch

Users that are interested in firefly-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / gotennet-pytorch
View on GitHub
Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch
☆68Apr 7, 2025Updated last year
charlesfrye / cuda-substrings
View on GitHub
Because it's there.
☆16Sep 22, 2024Updated last year
lucidrains / genetic-algorithm-pytorch
View on GitHub
Toy genetic algorithm in Pytorch
☆58Apr 21, 2026Updated 3 months ago
lucidrains / GAF-microbatch-pytorch
View on GitHub
Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch
☆25Jan 21, 2025Updated last year
lucidrains / HoST-pytorch
View on GitHub
Implementation of Humanoid Standing Up, from the paper "Learning Humanoid Standing-up Control across Diverse Postures" out of Shanghai, i…
☆45May 3, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
lucidrains / transformer-directed-evolution
View on GitHub
Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆72Jun 2, 2026Updated last month
lucidrains / evolutionary-policy-optimization
View on GitHub
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
☆110May 18, 2026Updated 2 months ago
lucidrains / streaming-deep-rl
View on GitHub
Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta
☆33May 18, 2026Updated 2 months ago
lucidrains / autoregressive-linear-attention-cuda
View on GitHub
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆46May 23, 2023Updated 3 years ago
lucidrains / mind-evolution
View on GitHub
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆59May 31, 2025Updated last year
lucidrains / light-recurrent-unit-pytorch
View on GitHub
Implementation of a Light Recurrent Unit in Pytorch
☆50Oct 6, 2024Updated last year
lucidrains / self-reasoning-tokens-pytorch
View on GitHub
Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto
☆57May 17, 2024Updated 2 years ago
kklemon / FlashPerceiver
View on GitHub
Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.
☆32Nov 4, 2024Updated last year
kyegomez / GPT3
View on GitHub
An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
☆22Jun 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kyegomez / OpenStrawberry
View on GitHub
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆31Updated this week
lucidrains / llama-qrlhf
View on GitHub
Implementation of the Llama architecture with RLHF + Q-learning
☆170Feb 1, 2025Updated last year
lucidrains / grokfast-pytorch
View on GitHub
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆104Dec 22, 2024Updated last year
davisyoshida / abnormal-floats
View on GitHub
Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)
☆20Jun 22, 2023Updated 3 years ago
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆67Updated this week
eager-dev / eagerx_tutorials
View on GitHub
Tutorials on how to use EAGERx
☆16Aug 14, 2025Updated 11 months ago
lucidrains / flash-attention-jax
View on GitHub
Implementation of Flash Attention in Jax
☆229Mar 1, 2024Updated 2 years ago
nf-core / deepmodeloptim
View on GitHub
Stochastic Testing and Input Manipulation for Unbiased Learning Systems
☆31Updated this week
lucidrains / axial-positional-embedding
View on GitHub
Axial Positional Embedding for Pytorch
☆84Feb 25, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lucidrains / neat
View on GitHub
Explorations into NEAT and some of its derivative research
☆41Updated this week
smonsays / contrastive-meta-learning
View on GitHub
Code accompanying the paper "A contrastive rule for meta-learning"
☆13Oct 31, 2024Updated last year
szc12153 / sparse_interpolated_experts
View on GitHub
Official implementation for Sparse MetA-Tuning (SMAT)
☆17Jul 29, 2025Updated last year
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
sjchoi86 / 2022-1-intelligent-robotics
View on GitHub
☆19May 22, 2022Updated 4 years ago
lucidrains / mirasol-pytorch
View on GitHub
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆92Dec 22, 2023Updated 2 years ago
Jellyfish042 / RWKV-15Puzzle
View on GitHub
☆12Dec 14, 2024Updated last year
google-research / diffren
View on GitHub
☆26Jul 13, 2026Updated 2 weeks ago
lucidrains / hippoformer
View on GitHub
Unofficial implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers
☆53Apr 28, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DemisEom / RNNT-pytorch
View on GitHub
Implementaion RNN tranceducer
☆23Jun 25, 2019Updated 7 years ago
rwightman / imagenet-12k
View on GitHub
ImageNet-12k subset of ImageNet-21k (fall11)
☆23Jun 13, 2023Updated 3 years ago
lucidrains / distilled-retriever-pytorch
View on GitHub
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Dec 16, 2020Updated 5 years ago
crowsonkb / dice-mc
View on GitHub
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆33Jul 28, 2023Updated 3 years ago
lucidrains / zorro-pytorch
View on GitHub
Implementation of Zorro, Masked Multimodal Transformer, in Pytorch
☆98Oct 20, 2023Updated 2 years ago
lucidrains / PEER-pytorch
View on GitHub
Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind
☆137Nov 1, 2025Updated 8 months ago
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago