ml-researcher / VAE
☆11Updated 2 years ago
Alternatives and similar repositories for VAE:
Users that are interested in VAE are comparing it to the libraries listed below
- ☆23Updated 2 years ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆119Updated 11 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆80Updated last year
- A paper list about diffusion models for natural language processing.☆181Updated last year
- ICLR2024 statistics☆47Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆42Updated 3 years ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated 9 months ago
- Keras implement of Finite Scalar Quantization☆69Updated last year
- A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).☆72Updated 6 months ago
- ☆46Updated last month
- A repository for DenseSSMs☆86Updated 10 months ago
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- Mixture of Attention Heads☆41Updated 2 years ago
- Idempotent Generative Network's unofficial pytorch implementation☆45Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆55Updated last year
- Tips for paper writing and researches 科技论文写作经验记录和总结☆130Updated 3 years ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆91Updated 2 years ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆33Updated 3 months ago
- ICLR 2022 Paper submission trend analysis from https://openreview.net/group?id=ICLR.cc/2022/Conference☆86Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆24Updated last year
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆29Updated 3 years ago
- ☆38Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆26Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆35Updated 4 months ago
- ☆73Updated 2 years ago
- 抢占显卡☆62Updated 4 months ago
- Lion and Adam optimization comparison☆57Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆48Updated 8 months ago
- ☆54Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆59Updated last year