jacobmarks / awesome-neurips-2023
Conference schedule, top papers, and analysis of the data for NeurIPS 2023!
☆103Updated 9 months ago
Related projects: ⓘ
- ☆55Updated 3 months ago
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆119Updated 2 months ago
- Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel☆221Updated this week
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆44Updated 3 months ago
- ☆124Updated 10 months ago
- LoRA and DoRA from Scratch Implementations☆179Updated 6 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆186Updated 3 months ago
- ☆136Updated 7 months ago
- A More Fair and Comprehensive Comparison between KAN and MLP☆122Updated last month
- Decomposing and Editing Predictions by Modeling Model Computation☆97Updated 3 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆46Updated 3 weeks ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆168Updated 4 months ago
- E5-V: Universal Embeddings with Multimodal Large Language Models☆148Updated 2 months ago
- Learning from synthetic data - code and models☆293Updated 8 months ago
- Awesome list of papers that extend Mamba to various applications.☆124Updated 2 weeks ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆166Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆87Updated 8 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆115Updated last month
- https://slds-lmu.github.io/seminar_multimodal_dl/☆162Updated last year
- ☆45Updated 2 months ago
- Video descriptions of research papers relating to foundation models and scaling☆29Updated last year
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆273Updated 8 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆90Updated 3 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆104Updated 6 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆153Updated last week
- ☆189Updated 10 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆120Updated last week
- Official implementation for "Targeted Cause Discovery with Data-Driven Learning"☆20Updated 3 weeks ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆29Updated this week
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆56Updated last month