cyd3r / notify-free-gpu
A telegram bot that sends you a message when the GPU is in use
☆9Updated 7 months ago
Alternatives and similar repositories for notify-free-gpu:
Users that are interested in notify-free-gpu are comparing it to the libraries listed below
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆39Updated 2 months ago
- ☆31Updated 7 months ago
- The official repo of continuous speculative decoding☆21Updated 2 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆31Updated 2 years ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆47Updated last month
- ☆52Updated last week
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆23Updated last month
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Updated 9 months ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆71Updated last year
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆76Updated 9 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆26Updated 8 months ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆121Updated 7 months ago
- ☆41Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆24Updated 11 months ago
- [ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery☆61Updated 7 months ago
- Generative Equilibrium Transformer☆17Updated last year
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆12Updated 7 months ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations"☆31Updated last year
- Official repository of InLine attention (NeurIPS 2024)☆35Updated last month
- Triton implement of bi-directional (non-causal) linear attention☆35Updated last week
- FlexAttention w/ FlashAttention3 Support☆27Updated 3 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Clean and anonymized code to be submitted to conferences☆12Updated 4 months ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆24Updated last year
- [CVPR 2024] Official repository for "Tactile-Augmented Radiance Fields".☆53Updated 3 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆26Updated 3 weeks ago
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆45Updated 3 months ago
- ☆17Updated last year