metagene-ai / metagene-pretrainLinks
Pretraining Code for METAGENE-1
☆68Updated 11 months ago
Alternatives and similar repositories for metagene-pretrain
Users that are interested in metagene-pretrain are comparing it to the libraries listed below
Sorting:
- ☆34Updated last month
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 11 months ago
- ☆31Updated 10 months ago
- ☆62Updated 2 years ago
- Repository to create traveling waves integrate special information through time☆56Updated 4 months ago
- Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau☆152Updated this week
- Implementation of the Pairformer model used in AlphaFold 3☆14Updated last week
- alternative way to calculating self attention☆18Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 9 months ago
- An aviary-based data science agent based on jupyter notebooks☆42Updated 2 months ago
- A nano protein structure prediction model based on DeepMind's AlphaFold paper☆32Updated last year
- Pretraining infrastructure for multi-hybrid AI model architectures☆195Updated 4 months ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆116Updated this week
- A language agent gym with challenging scientific tasks☆219Updated this week
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 6 months ago
- Benchmark for LLM-based Agents in Computational Biology☆65Updated 2 months ago
- ☆40Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- aesthetic tensor visualiser☆27Updated 7 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 6 months ago
- ☆89Updated last month
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆62Updated 9 months ago
- look how they massacred my boy☆63Updated last year
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆55Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆22Updated 5 months ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆407Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 2 months ago