ml-researcher / VAE
☆11Updated 2 years ago
Alternatives and similar repositories for VAE:
Users that are interested in VAE are comparing it to the libraries listed below
- ☆23Updated 2 years ago
- ICLR2024 statistics☆47Updated last year
- ICLR2023 statistics☆60Updated last year
- Keras implement of Finite Scalar Quantization☆71Updated last year
- A paper list about diffusion models for natural language processing.☆182Updated last year
- ICLR 2022 Paper submission trend analysis from https://openreview.net/group?id=ICLR.cc/2022/Conference☆86Updated 2 years ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆42Updated 3 years ago
- Lion and Adam optimization comparison☆60Updated 2 years ago
- A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).☆77Updated 8 months ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆119Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆82Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆24Updated last year
- Gpu 任务排队☆2Updated last year
- Mixture of Attention Heads☆43Updated 2 years ago
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆44Updated 3 years ago
- A curated list of vision-and-language pre-training (VLP). :-)☆58Updated 2 years ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆91Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 4 months ago
- differentiable top-k operator☆21Updated 3 months ago
- ☆25Updated 2 months ago
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆30Updated 3 years ago
- diffusion generative model☆181Updated 2 years ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆12Updated 10 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆37Updated 5 months ago
- Recent Advances on MLLM's Reasoning Ability☆24Updated this week
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- More light-weight pytorch experiment management library!☆63Updated last year
- ☆16Updated 3 years ago
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- ☆25Updated last year