HPC-SJTU / xfoldLinks
Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction
☆20Updated 5 months ago
Alternatives and similar repositories for xfold
Users that are interested in xfold are comparing it to the libraries listed below
Sorting:
- Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction☆50Updated 10 months ago
- A Throughput-Optimized Pipeline Parallel Inference System for Large Language Models☆44Updated 3 months ago
- 🧪 Ultrafast bisulfite☆37Updated last year
- The Zaychik Power Controller server☆13Updated last year
- OpenCAEPoro for ASC 2024☆37Updated last year
- Repository for HPCGame 1st Problems.☆68Updated last year
- Puzzles for learning Triton, play it with minimal environment configuration!☆559Updated last month
- ☆263Updated last week
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆298Updated 10 months ago
- Solution of Programming Massively Parallel Processors☆50Updated last year
- performance engineering☆30Updated last year
- Intel® Tensor Processing Primitives extension for Pytorch*☆17Updated last month
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Updated 2 years ago
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆285Updated 5 months ago
- ☆47Updated last year
- Examples of CUDA implementations by Cutlass CuTe☆246Updated 4 months ago
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Updated last year
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆154Updated 3 years ago
- A Easy-to-understand TensorOp Matmul Tutorial☆389Updated last month
- High performance Transformer implementation in C++.☆140Updated 9 months ago
- Documentation for HPC course☆157Updated 4 months ago
- A torch compile backend for multi-targets☆40Updated this week
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆33Updated 2 weeks ago
- A disitributed implementation of alphafold3 base on xfold and tpp-pytorch-extension☆12Updated 5 months ago
- ucas hpc course code☆15Updated 2 years ago
- ☆131Updated 2 weeks ago
- Wiki fo HPC☆123Updated 3 months ago
- Distributed Compiler based on Triton for Parallel Systems☆1,214Updated 3 weeks ago
- FlagGems is an operator library for large language models implemented in the Triton Language.☆749Updated this week
- Summary of some awesome work for optimizing LLM inference☆134Updated last week