Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"
☆24Jun 26, 2024Updated last year
Alternatives and similar repositories for LLM_eval
Users that are interested in LLM_eval are comparing it to the libraries listed below
Sorting:
- ☆44Feb 11, 2026Updated 2 weeks ago
- ☆24Jun 5, 2025Updated 8 months ago
- Evolution-inspired data augmentations for PyTorch-based models for regulatory genomics☆25Jun 3, 2025Updated 8 months ago
- Jax code for functional genomics ML☆14Mar 5, 2025Updated 11 months ago
- ☆15Jan 28, 2026Updated last month
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆97Nov 13, 2025Updated 3 months ago
- RNA-seq prediction with deep convolutional neural networks.☆226Aug 28, 2025Updated 6 months ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- ☆23Jul 8, 2025Updated 7 months ago
- Genome annotation pre-publication results☆25Updated this week
- ☆10Jun 9, 2025Updated 8 months ago
- Map query sequences to the assemblies of all pre-June 2023 bacteria (https://ftp.ebi.ac.uk/pub/databases/AllTheBacteria/Releases/0.2/) on…☆12May 22, 2024Updated last year
- [ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.☆17Jun 19, 2025Updated 8 months ago
- ☆13Apr 23, 2025Updated 10 months ago
- Biological sequence analysis for the modern age.☆263Feb 22, 2026Updated last week
- ☆16Feb 15, 2026Updated 2 weeks ago
- ☆13Jan 7, 2026Updated last month
- QuickProt: A Fast and Accurate Homology-Based Protein Annotation Tool for Non-Model Organisms to Advance Comparative Genomics☆16Jan 12, 2026Updated last month
- ☆12Jun 7, 2024Updated last year
- Library to extract embeddings for DNA sequences using BioFM genomics foundation model☆19Aug 13, 2025Updated 6 months ago
- Taxonomy classification of viral sequences / contigs☆12Jul 15, 2025Updated 7 months ago
- Build DeepSea training dataset from raw data☆31Jul 6, 2023Updated 2 years ago
- Orthrus is a mature RNA model for RNA property prediction. It uses a mamba encoder backbone, a variant of state-space models specifical…☆89Dec 10, 2025Updated 2 months ago
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆40Mar 27, 2025Updated 11 months ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆60Aug 2, 2024Updated last year
- ☆19Dec 20, 2025Updated 2 months ago
- An LC-MS/MS glycan and glycopeptide search engine☆13Sep 15, 2025Updated 5 months ago
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆129Oct 31, 2024Updated last year
- Generation of disentangled microenvironment-induced and intrinsic gene expression vectors from spatial transcriptomics data☆24Feb 19, 2026Updated last week
- Robust individual and aggregate checksums for nucleotide sequences☆17Feb 8, 2026Updated 3 weeks ago
- A command-line tool to mitigate homology-based data leakage in sequence-to-expression models☆19Oct 27, 2025Updated 4 months ago
- Analyses related to the Borzoi paper.☆25Dec 14, 2025Updated 2 months ago
- Calculate genome wide average nucleotide identity (gwANI) for a multiFASTA alignment☆16Dec 5, 2018Updated 7 years ago
- ☆19Feb 15, 2025Updated last year
- A deep learning approach to predicting transcription initiation from sequence at single nucleotide resolution☆14Updated this week
- ☆21Jan 13, 2026Updated last month
- ☆16Feb 14, 2025Updated last year
- PAF (pairwise alignment format) validator based on extended CIGAR strings☆15Aug 10, 2025Updated 6 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year