lewisjs4 / csce311
☆11Updated this week
Alternatives and similar repositories for csce311:
Users that are interested in csce311 are comparing it to the libraries listed below
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆10Updated 2 years ago
- Advanced Programming Language☆8Updated last year
- Training diffusion model with CIFAR10 dataset(insight from 13 papers)☆14Updated last week
- Bad Quant Recruiters☆22Updated 4 months ago
- CVPR2023: Vector Quantization with Self-Attention for Quality-Independent Representation Learning.☆14Updated 8 months ago
- ☆39Updated last year
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆15Updated 2 years ago
- Official Implementation Of The Paper: `DeciMamba: Exploring the Length Extrapolation Potential of Mamba'☆23Updated 6 months ago
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆44Updated last year
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆32Updated last month
- Transformers trained on Tiny ImageNet☆51Updated 2 years ago
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆21Updated 11 months ago
- Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"☆70Updated 2 weeks ago
- ☆26Updated this week
- ☆221Updated 5 months ago
- Official Repo for EdgeQAT☆13Updated 3 months ago
- [ICLR 2025] AdaFisher: Adaptive Second Order Optimization via Fisher Information☆30Updated last week
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆60Updated 10 months ago
- Fine-tuning Vision Transformers on various classification datasets☆103Updated 5 months ago
- ImageNet2012 download & arrangement☆13Updated 2 years ago
- ☆38Updated 4 months ago
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆14Updated 6 months ago
- Layer-wise Pruning of Transformer Heads for Efficient Language Modeling☆21Updated 2 years ago
- Jane Street quant interview/test☆99Updated 7 years ago
- Public quant internship repository, maintained by NUFT but available for everyone.☆1,359Updated 4 months ago
- [ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.☆20Updated 11 months ago
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆37Updated 8 months ago
- This is a repo covers ai research papers pseudocodes☆14Updated last year
- CKA (Centered Kernel Alignment) implemented in PyTorch☆12Updated last week
- ☆22Updated 7 months ago