ogchen / nanofold
A nano protein structure prediction model based on DeepMind's AlphaFold paper
☆23Updated 3 months ago
Related projects: ⓘ
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆51Updated last year
- Rust Implementation of micrograd☆51Updated 2 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆82Updated 3 weeks ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- σ-GPT: A New Approach to Autoregressive Models☆53Updated last month
- ☆27Updated 2 months ago
- ☆55Updated 9 months ago
- This repository contains the data and scripts necessary to reproduce the results presented in the paper: **"Scalable molecular simulation…☆35Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 2 months ago
- ☆25Updated 4 months ago
- ☆29Updated 3 weeks ago
- ☆38Updated 8 months ago
- Collection of autoregressive model implementation☆62Updated 2 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆46Updated 5 months ago
- ☆73Updated 5 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆45Updated last month
- alternative way to calculating self attention☆18Updated 3 months ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆253Updated 6 months ago
- Implementation of Infini-Transformer in Pytorch☆100Updated last month
- PyTorch implementation of models from the Zamba2 series.☆63Updated last month
- Repository for code used in the xVal paper☆110Updated 5 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆53Updated last month
- Latent Large Language Models☆16Updated 3 weeks ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated this week
- A MAD laboratory to improve AI architecture designs 🧪☆84Updated 4 months ago
- ☆16Updated 2 weeks ago
- ☆19Updated last week
- Gpu benchmark☆35Updated 2 weeks ago
- ☆42Updated this week
- Simple and fast low-bit matmul kernels in CUDA☆48Updated this week