metagene-ai / metagene-pretrainLinks
Pretraining Code for METAGENE-1
☆67Updated 7 months ago
Alternatives and similar repositories for metagene-pretrain
Users that are interested in metagene-pretrain are comparing it to the libraries listed below
Sorting:
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 7 months ago
- Pretraining infrastructure for multi-hybrid AI model architectures☆181Updated last month
- ☆28Updated last month
- Framework enabling modular interchange of language agents, environments, and optimizers☆104Updated last week
- ☆61Updated last year
- Implementation of the Pairformer model used in AlphaFold 3☆14Updated last week
- Benchmark for LLM-based Agents in Computational Biology☆50Updated 2 months ago
- ☆30Updated 6 months ago
- An aviary-based data science agent based on jupyter notebooks☆35Updated 2 months ago
- A nano protein structure prediction model based on DeepMind's AlphaFold paper☆30Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆67Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 5 months ago
- Generic MCP Client to use any MCP tool in a chat☆44Updated 3 months ago
- reasoning model trained using GRPO towards rosetta REF2015 for protein stability☆89Updated last month
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆54Updated 2 years ago
- A language agent gym with challenging scientific tasks☆201Updated this week
- ☆56Updated 9 months ago
- Making folding experiments more accessible .☆56Updated last month
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆191Updated 7 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 3 months ago
- Repository to create traveling waves integrate special information through time☆55Updated 3 weeks ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆390Updated last year
- alternative way to calculating self attention☆18Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 5 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆219Updated 3 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 3 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 8 months ago
- Implementation of AlphaGenome, Deepmind's updated genomic attention model☆65Updated 3 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 3 months ago
- aesthetic tensor visualiser☆24Updated 4 months ago