lightly-ai / lightly-studioLinks
Curate, Annotate, and Manage Your Data in LightlyStudio.
☆637Updated this week
Alternatives and similar repositories for lightly-studio
Users that are interested in lightly-studio are comparing it to the libraries listed below
Sorting:
- Code release for "LLMs can see and hear without any training"☆454Updated 6 months ago
- ☆365Updated last month
- 🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…☆347Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,776Updated 2 months ago
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆555Updated last week
- ConceptAttention: A method for interpreting multi-modal diffusion transformers.☆354Updated 3 weeks ago
- An open source implementation of CLIP (With TULIP Support)☆163Updated 6 months ago
- Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".☆229Updated 8 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,400Updated last month
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,765Updated last week
- Learning from synthetic data - code and models☆325Updated last year
- [TMLR 2025 J2C] TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models☆49Updated last month
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆189Updated 7 months ago
- Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]☆525Updated last year
- Benchmarking physical understanding in generative video models☆221Updated last month
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆274Updated 6 months ago
- Code for Scaling Language-Free Visual Representation Learning (WebSSL)☆245Updated 7 months ago
- Open source AI/ML capabilities for the FiftyOne ecosystem☆151Updated this week
- LongLive: Real-time Interactive Long Video Generation☆855Updated last month
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆127Updated 3 months ago
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆408Updated this week
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆338Updated this week
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆129Updated last month
- Sparse Linear Concept Embeddings☆122Updated 8 months ago
- Official implementation of Inductive Moment Matching☆565Updated 4 months ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,403Updated last week
- ☆702Updated 2 weeks ago
- Code for the Molmo Vision-Language Model☆815Updated 11 months ago
- Official inference repo for FLUX.2 models☆1,058Updated this week
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆387Updated 2 months ago