metagene-ai/metagene-pretrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/metagene-ai/metagene-pretrain)

metagene-ai / metagene-pretrain

Pretraining Code for METAGENE-1

☆69

Alternatives and similar repositories for metagene-pretrain

Users that are interested in metagene-pretrain are comparing it to the libraries listed below

Sorting:

facebookresearch / DIG-In
View on GitHub
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Jun 3, 2024Updated last year
not-lain / pxia
View on GitHub
minimalistic AI library that resembles HF's transformers
☆13Dec 31, 2024Updated last year
PDBeurope / protvista-pdb
View on GitHub
PDB ProtVista Viewer
☆11Jul 8, 2025Updated 8 months ago
LSSI-ETH / plmfit
View on GitHub
PLMFit platform for TL on PLMs
☆20Aug 13, 2025Updated 6 months ago
nick11roberts / XD
View on GitHub
☆12Jul 6, 2022Updated 3 years ago
si0wang / ViCrit
View on GitHub
☆24Jun 18, 2025Updated 8 months ago
JustinAzoff / pynfdump
View on GitHub
python wrapper for the nfdump cli application
☆21Apr 8, 2021Updated 4 years ago
charlesfrye / cuda-substrings
View on GitHub
Because it's there.
☆16Sep 22, 2024Updated last year
huggingface / hf-nix
View on GitHub
☆26Nov 18, 2025Updated 3 months ago
ivanleomk / modal-grpo
View on GitHub
☆19Mar 16, 2025Updated 11 months ago
SalesforceAIResearch / text2data
View on GitHub
☆22Feb 12, 2025Updated last year
JuliaGeodynamics / JustPIC.jl
View on GitHub
Particle-in-Cell advection
☆21Mar 1, 2026Updated last week
matttreed / diloco-sim
View on GitHub
☆23Jan 5, 2025Updated last year
microsoft / protnote
View on GitHub
ProtNote is a multimodal deep learning model that leverages free-form text to enable both supervised and zero-shot protein function predi…
☆58Apr 29, 2025Updated 10 months ago
ai4protein / VenusMine
View on GitHub
☆35Jun 17, 2025Updated 8 months ago
shoibloya / nuggt-research
View on GitHub
☆21Jan 25, 2025Updated last year
facebookresearch / ExploreToM
View on GitHub
Code for ExploreTom
☆91Jun 25, 2025Updated 8 months ago
amyxlu / cheap-proteins
View on GitHub
Joint embedding of protein sequence and structure with discrete and continuous compressions of protein folding model latent spaces. http:…
☆150Apr 16, 2025Updated 10 months ago
shukai1997 / VSDS-VD
View on GitHub
benchmarking AI-powered docking methods from the perspective of virtual screening
☆32Dec 26, 2024Updated last year
OpenMOSS / Lorsa
View on GitHub
☆29Nov 9, 2025Updated 4 months ago
MeLeLBGU / SaGe
View on GitHub
Code for SaGe subword tokenizer (EACL 2023)
☆27Nov 30, 2024Updated last year
SakanaAI / self-adaptive-llms
View on GitHub
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,191Jan 30, 2025Updated last year
Arcadia-Science / 2023-nr-clustering
View on GitHub
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Nov 7, 2025Updated 4 months ago
skapadia3214 / groq-shell-assistant
View on GitHub
☆23Jul 24, 2024Updated last year
facebookresearch / memory
View on GitHub
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆373Dec 12, 2024Updated last year
NousResearch / StripedHyenaTrainer
View on GitHub
☆62Dec 8, 2023Updated 2 years ago
etowahadams / interprot
View on GitHub
[ICML 25] We train and evaluate SAEs to identify interpretable features in pLMs and show their potential for scientific discovery.
☆124Jan 16, 2026Updated last month
blt2114 / MotifBench
View on GitHub
A standardized protein design benchmark for motif-scaffolding problems
☆76Jan 27, 2026Updated last month
jmikedupont2 / ai-ticket
View on GitHub
The AI and Human powered ticket system to manage your AI based code generation with tickets
☆30Jan 22, 2026Updated last month
TIGER-AI-Lab / StructLM
View on GitHub
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆76Oct 19, 2024Updated last year
theopensystemslab / planx-new
View on GitHub
Plan✕ is a platform for creating and publishing digital planning services
☆17Updated this week
prescient-design / lobster
View on GitHub
Lbster: Language models for Biological Sequence Transformation and Evolutionary Representation
☆148Updated this week
AIRI-Institute / PROSTATA
View on GitHub
☆35Sep 21, 2023Updated 2 years ago
XiaoduoAILab / XmodelLM
View on GitHub
XmodelLM
☆38Nov 19, 2024Updated last year
bouralab / Prop3D
View on GitHub
A protein structure dataset that combines 3D atomic coordinates with biophysical and evolutionary properties for every atom in every "cle…
☆38Jul 25, 2024Updated last year
HazyResearch / lolcats
View on GitHub
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
☆252Jan 31, 2025Updated last year
fhalab / LevSeq
View on GitHub
Pipeline for Nanopore sequencing: demultiplexing, variant calling, and quality visualization with error handling.
☆11Jan 25, 2026Updated last month
jerhadf / pulse-survey-bot
View on GitHub
A bot to automatically take surveys on the Pulse website
☆14Mar 6, 2022Updated 4 years ago
AlvarEhr / PAROL6-Python-API-Gemini-Vision-Public
View on GitHub
☆17Sep 3, 2025Updated 6 months ago