jacobmarks / awesome-neurips-2023
Conference schedule, top papers, and analysis of the data for NeurIPS 2023!
☆108Updated last year
Alternatives and similar repositories for awesome-neurips-2023:
Users that are interested in awesome-neurips-2023 are comparing it to the libraries listed below
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆152Updated 11 months ago
- ☆64Updated 2 months ago
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆136Updated 5 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 4 months ago
- LoRA and DoRA from Scratch Implementations☆191Updated 9 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆149Updated last month
- ☆198Updated last year
- Awesome list of papers that extend Mamba to various applications.☆128Updated 2 months ago
- Annotated version of the Mamba paper☆460Updated 9 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆207Updated 6 months ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆178Updated 7 months ago
- Implementation of Infini-Transformer in Pytorch☆106Updated 2 months ago
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆288Updated 10 months ago
- ☆130Updated last year
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆248Updated 7 months ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆164Updated last year
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆163Updated 2 months ago
- Self-Supervised Learning in PyTorch☆130Updated 9 months ago
- ☆28Updated 7 months ago
- ☆38Updated 4 months ago
- ☆154Updated 10 months ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆171Updated 3 months ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆137Updated 6 months ago
- Video descriptions of research papers relating to foundation models and scaling☆29Updated last year
- Learning from synthetic data - code and models☆303Updated 11 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆111Updated 4 months ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆147Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆176Updated last year
- Reading list for research topics in state-space models☆246Updated this week
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆174Updated last month