lightly-ai / lightly-studioLinks
Curate, Annotate, and Manage Your Data in LightlyStudio.
☆682Updated this week
Alternatives and similar repositories for lightly-studio
Users that are interested in lightly-studio are comparing it to the libraries listed below
Sorting:
- 🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…☆346Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,147Updated 2 weeks ago
- All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.☆1,290Updated this week
- Code release for "LLMs can see and hear without any training"☆457Updated 9 months ago
- Open source AI/ML capabilities for the FiftyOne ecosystem☆155Updated 2 weeks ago
- ☆390Updated 3 months ago
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆584Updated 2 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆348Updated 2 months ago
- [NeurIPS'23] Emergent Correspondence from Image Diffusion☆751Updated last year
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,810Updated 2 months ago
- Code for the Molmo Vision-Language Model☆870Updated last year
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆454Updated last week
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,505Updated last week
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆201Updated 9 months ago
- Efficient Track Anything☆775Updated last year
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,507Updated last month
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆498Updated 2 weeks ago
- This repo contains the code for 1D tokenizer and generator☆1,109Updated 10 months ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,629Updated last year
- An open source implementation of CLIP (With TULIP Support)☆165Updated 8 months ago
- Code for Scaling Language-Free Visual Representation Learning (WebSSL)☆245Updated 9 months ago
- ConceptAttention: A method for interpreting multi-modal diffusion transformers.☆416Updated 3 weeks ago
- CVPR 2025 Workshop on CVEU.☆42Updated 7 months ago
- ☆860Updated 2 weeks ago
- Learning from synthetic data - code and models☆327Updated 2 years ago
- Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]☆525Updated 2 years ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆290Updated 8 months ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆144Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆393Updated last year
- [TMLR 2025 J2C] TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models☆51Updated last month