state-spaces / mamba
Mamba SSM architecture
☆12,542Updated last month
Related projects: ⓘ
- Fast and memory-efficient exact attention☆13,401Updated this week
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,533Updated 6 months ago
- Latest Advances on Multimodal Large Language Models☆11,722Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆15,839Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆10,327Updated last month
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆6,008Updated 3 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆7,687Updated this week
- Kolmogorov Arnold Networks☆14,545Updated this week
- LAVIS - A One-stop Library for Language-Vision Intelligence☆9,663Updated 3 weeks ago
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆3,866Updated last month
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆19,545Updated 3 weeks ago
- An open source implementation of CLIP.☆9,782Updated last month
- A playbook for systematically maximizing the performance of deep learning models.☆26,385Updated 3 months ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆19,651Updated 3 weeks ago
- Train transformer language models with reinforcement learning.☆9,288Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆19,294Updated last month
- PyTorch code and models for the DINOv2 self-supervised learning method.☆8,791Updated last month
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆4,573Updated last week
- Ongoing research training transformer models at scale☆9,949Updated this week
- A collection of resources and papers on Diffusion Models☆10,758Updated last month
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆2,812Updated last month
- ☆10,072Updated 3 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.☆9,050Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,351Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆6,029Updated this week
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆10,713Updated 3 weeks ago
- ☆4,006Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26,822Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆31,479Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆24,723Updated last month