OpenRL-Lab / PyTorch_TutorialLinks

PyTorch使用技巧和教程

☆11

Alternatives and similar repositories for PyTorch_Tutorial

Users that are interested in PyTorch_Tutorial are comparing it to the libraries listed below

Sorting:

Rose-STL-Lab / Teleportation-Optimization
[ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries
☆29Updated last year
buttercutter / Mamba_SSM
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
☆22Updated last year
Caiyun-AI / MUDDFormer
☆73Updated 2 months ago
zeke-xie / stable-weight-decay-regularization
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
☆60Updated last year
blairstar / The_Art_of_DPM
An In-depth Analysis of Diffusion Probability Model
☆116Updated 8 months ago
test-time-training / ttt-lm-kernels
Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆72Updated last year
bojone / FSQ
Keras implement of Finite Scalar Quantization
☆78Updated last year
ICT-ANS / StarLight
☆61Updated last year
dzy3 / KCD
Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409
☆30Updated 2 years ago
WailordHe / DenseSSM
A repository for DenseSSMs
☆87Updated last year
ml-research / self-expanding-neural-networks
Self-Expanding Neural Networks
☆38Updated last year
Letian2003 / MM_INF
An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…
☆28Updated last month
deepglint / RWKV-CLIP
[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner
☆139Updated 2 months ago
radarFudan / mamba
☆18Updated 8 months ago
feizc / Diffusion-RWKV
Scaling RWKV-Like Architectures for Diffusion Models
☆136Updated last year
zhixuan-lin / forgetting-transformer
[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"
☆116Updated 2 weeks ago
pprp / Awesome-Efficient-MoE
Efficient Mixture of Experts for LLM Paper List
☆84Updated 7 months ago
transformer-vq / transformer_vq
☆196Updated last year
360CVGroup / Inner-Adaptor-Architecture
LMM solved catastrophic forgetting, AAAI2025
☆44Updated 3 months ago
vedaldi / micro_llama
A tiny, didactical implementation of LLAMA 3
☆41Updated 7 months ago
keshik6 / grafting
Exploring Diffusion Transformer Designs via Grafting
☆46Updated last month
TencentARC / mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
☆91Updated 10 months ago
alihassanijr / TorchKMeans
A torch-based implementation of K-Means and K-Means++
☆17Updated 4 years ago
ZhangAIPI / YOPO_MLLM_Pruning
Pruning the VLLMs
☆97Updated 7 months ago
lucidrains / LVMAE-pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
☆52Updated 8 months ago
SuperBruceJia / Poster_Template
Poster Templates (PPT and LaTeX)
☆66Updated 2 years ago
OpenRL-Lab / Ray_Tutorial
Tutorial for Ray
☆28Updated last year
tobna / WhatTransformerToFavor
Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.
☆30Updated 4 months ago
wutaiqiang / MoSLoRA
☆109Updated last year
lucasjinreal / LLaVA-Magvit2
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆37Updated last year