OpenRL-Lab / PyTorch_TutorialLinks
PyTorch使用技巧和教程
☆11Updated 2 years ago
Alternatives and similar repositories for PyTorch_Tutorial
Users that are interested in PyTorch_Tutorial are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated last year
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Updated last year
- ☆73Updated 2 months ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆60Updated last year
- An In-depth Analysis of Diffusion Probability Model☆116Updated 8 months ago
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆72Updated last year
- Keras implement of Finite Scalar Quantization☆78Updated last year
- ☆61Updated last year
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 2 years ago
- A repository for DenseSSMs☆87Updated last year
- Self-Expanding Neural Networks☆38Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆28Updated last month
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆139Updated 2 months ago
- ☆18Updated 8 months ago
- Scaling RWKV-Like Architectures for Diffusion Models☆136Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆116Updated 2 weeks ago
- Efficient Mixture of Experts for LLM Paper List☆84Updated 7 months ago
- ☆196Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 3 months ago
- A tiny, didactical implementation of LLAMA 3☆41Updated 7 months ago
- Exploring Diffusion Transformer Designs via Grafting☆46Updated last month
- mllm-npu: training multimodal large language models on Ascend NPUs☆91Updated 10 months ago
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- Pruning the VLLMs☆97Updated 7 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆52Updated 8 months ago
- Poster Templates (PPT and LaTeX)☆66Updated 2 years ago
- Tutorial for Ray☆28Updated last year
- Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.☆30Updated 4 months ago
- ☆109Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year