metagene-ai / metagene-pretrain
Pretraining Code for METAGENE-1
☆64Updated 3 months ago
Alternatives and similar repositories for metagene-pretrain:
Users that are interested in metagene-pretrain are comparing it to the libraries listed below
- An aviary-based data science agent based on jupyter notebooks☆13Updated this week
- ☆61Updated last year
- Implementation of the Pairformer model used in AlphaFold 3☆12Updated this week
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 3 months ago
- ☆99Updated last month
- Repository to create traveling waves integrate special information through time☆50Updated last month
- ☆34Updated 3 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Agent framework for constructing language model agents and training on constructive tasks.☆72Updated this week
- ☆38Updated 9 months ago
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆168Updated 2 months ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated this week
- reasoning model trained using GRPO towards rosetta REF2015 for protein stability☆64Updated this week
- Benchmark for LLM-based Agents in Computational Biology☆34Updated this week
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated last month
- Pretraining infrastructure for multi-hybrid AI model architectures☆149Updated this week
- alternative way to calculating self attention☆18Updated 11 months ago
- ☆47Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- ☆28Updated 2 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last week
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 6 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 6 months ago
- ☆27Updated 9 months ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 months ago
- QLoRA for Masked Language Modeling☆22Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆52Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year