phonism/genesis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/phonism/genesis)

phonism / genesis

Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing.

☆35

Alternatives and similar repositories for genesis

Users that are interested in genesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

warlockee / oxRL
View on GitHub
A lightweight post-training framework for LLMs and VLMs. 51 algorithms, 38 verified models. Scales with DeepSpeed, vLLM, and Ray.
☆19May 6, 2026Updated 2 months ago
vdcores / vdcores
View on GitHub
Virtual Decoupled Cores: Composable Programming Framework and Runtime for Async GPUs
☆20Updated this week
hku-systems / naspipe
View on GitHub
☆14Jan 12, 2022Updated 4 years ago
drbh / yamoe
View on GitHub
🔀 yet another mixture of experts
☆23Jun 5, 2026Updated last month
Qwesh157 / conv_op_optimization
View on GitHub
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆44Sep 29, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
jdai019 / domain-adaptation-lesion-assessment
View on GitHub
☆14Apr 1, 2025Updated last year
chengzeyi / piflux
View on GitHub
(WIP) Parallel inference for black-forest-labs' FLUX model.
☆19Nov 18, 2024Updated last year
wangrunji0408 / rjrouter
View on GitHub
[AFK] Hardware router in Chisel (THU Network Joint Lab 2020)
☆14Oct 8, 2020Updated 5 years ago
dhcode-cpp / easy-dualpipe
View on GitHub
Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.
☆31Sep 17, 2025Updated 10 months ago
ConvolutedDog / gpgpu-sim-comments
View on GitHub
GPGPU-Sim 中文注释版代码，包含 GPGPU-Sim 模拟器的最新版代码，经过中文注释，以帮助中文用户更好地理解和使用该模拟器。
☆30Dec 18, 2024Updated last year
harvard-cns / Harvard-CNS-Seminar
View on GitHub
Reading seminar in Harvard Cloud Networking and Systems Group
☆16Aug 29, 2022Updated 3 years ago
kabir2505 / tiny-mixtral
View on GitHub
☆44May 4, 2025Updated last year
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
habanero-lab / APPy
View on GitHub
APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…
☆29Mar 22, 2026Updated 4 months ago
sonnyli / flash_attention_from_scratch
View on GitHub
Flash Attention from Scratch on CUDA Ampere
☆186Sep 1, 2025Updated 10 months ago
junyuan-chen / LCPsolve.jl
View on GitHub
A solver for linear complementarity problems
☆12Dec 16, 2021Updated 4 years ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
xiaguan / pegaflow
View on GitHub
PegaFlow is a high-performance KV cache offloading solution for vLLM v1 on single-node multi-GPU setups.
☆25Jan 7, 2026Updated 6 months ago
gokce-d / StackelbergMFG_epidemics
View on GitHub
This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…
☆13Nov 22, 2021Updated 4 years ago
forket86 / ANNEA
View on GitHub
ANN-based Expectations Algorithm applied to the Neoclassical Investment Model
☆10Mar 15, 2023Updated 3 years ago
open-lm-engine / accelerated-model-architectures
View on GitHub
A bunch of kernels that might make stuff slower 😉
☆91Updated this week
liangyuwang / Tiny-DeepSpeed
View on GitHub
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
☆53Aug 20, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
coderlemon17 / LemonScripts
View on GitHub
Here is the repo for public scripts.
☆12Jul 16, 2022Updated 4 years ago
eddiegaoo / Apt-Serve
View on GitHub
☆21Jun 9, 2025Updated last year
oliverYoung2001 / UltraAttn
View on GitHub
SC'25 UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling
☆16Aug 14, 2025Updated 11 months ago
lastweek / FlashRL
View on GitHub
☆16Mar 24, 2026Updated 4 months ago
illinois-nsai / dede
View on GitHub
DeDe (OSDI '25): an optimization framework for large-scale resource allocation
☆15May 18, 2026Updated 2 months ago
amckay / OptStab
View on GitHub
Replication material for "Optimal Automatic Stabilizers"
☆11Aug 9, 2021Updated 4 years ago
vegaction / nanorllm
View on GitHub
mini project for nanorllm
☆64Mar 31, 2026Updated 3 months ago
marsggbo / NAS-LID
View on GitHub
[AAAI2023] NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension
☆17Dec 20, 2022Updated 3 years ago
jesusfv / Parallel_Computing
View on GitHub
☆10Jan 25, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
DeepLink-org / dlinfer
View on GitHub
☆74Updated this week
JulienPascal / bc-MC_Operator
View on GitHub
This repository contains the code to generate results from the paper "Artificial Neural Networks to solve dynamic programming problems: a…
☆10May 24, 2024Updated 2 years ago
liangyuRain / ForestColl
View on GitHub
☆20Jun 1, 2026Updated last month
xinhaoc / ferret
View on GitHub
Autonomous CUDA kernel optimization agent with structured task specs and per-config scoring
☆17Jun 17, 2026Updated last month
weishengying / tiny-flash-attention
View on GitHub
使用 cutlass 实现 flash-attention 精简版，具有教学意义
☆59Aug 12, 2024Updated last year
anh-tong / nanoGPT-equinox
View on GitHub
nanoGPT using Equinox
☆15Mar 3, 2023Updated 3 years ago
ScalingIntelligence / caesar
View on GitHub
Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]
☆24May 27, 2025Updated last year