HPC-SJTU / xfoldLinks
Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction
☆21Updated 8 months ago
Alternatives and similar repositories for xfold
Users that are interested in xfold are comparing it to the libraries listed below
Sorting:
- Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction☆55Updated last year
- A Throughput-Optimized Pipeline Parallel Inference System for Large Language Models☆46Updated last month
- Learning TileLang with 10 puzzles!☆118Updated 2 weeks ago
- 🧪 Ultrafast bisulfite☆38Updated last year
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Updated 3 weeks ago
- The Zaychik Power Controller server☆13Updated last year
- LeetGPU Solutions☆107Updated 4 months ago
- Repository for HPCGame 1st Problems.☆70Updated 2 years ago
- performance engineering☆30Updated last year
- A disitributed implementation of alphafold3 base on xfold and tpp-pytorch-extension☆12Updated 8 months ago
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆313Updated 8 months ago
- ☆37Updated last week
- Puzzles for learning Triton, play it with minimal environment configuration!☆624Updated last month
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Updated 2 years ago
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆42Updated this week
- ☆288Updated last week
- OpenCAEPoro for ASC 2024☆38Updated 2 years ago
- Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM☆75Updated 6 months ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆324Updated last year
- Solution of Programming Massively Parallel Processors☆49Updated 2 years ago
- High performance Transformer implementation in C++.☆151Updated last year
- Summary of some awesome work for optimizing LLM inference☆173Updated 2 months ago
- The dataset and baseline code for ASC23 LLM inference optimization challenge.☆32Updated 2 years ago
- Flash Attention from Scratch on CUDA Ampere☆129Updated 5 months ago
- Examples of CUDA implementations by Cutlass CuTe☆270Updated 7 months ago
- A lightweight design for computation-communication overlap.☆219Updated 3 weeks ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆283Updated 11 months ago
- gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling☆53Updated last month
- Ascend TileLang adapter☆217Updated this week
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆30Updated 9 months ago