lightly-ai / lightly-studioLinks
Curate, Annotate, and Manage Your Data in LightlyStudio.
☆570Updated this week
Alternatives and similar repositories for lightly-studio
Users that are interested in lightly-studio are comparing it to the libraries listed below
Sorting:
- ☆349Updated this week
- Code release for "LLMs can see and hear without any training"☆451Updated 6 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,718Updated last month
- Efficient Track Anything☆658Updated 10 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,320Updated 2 weeks ago
- Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]☆524Updated last year
- Open source AI/ML capabilities for the FiftyOne ecosystem☆151Updated this week
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,376Updated 3 weeks ago
- 🌍 WorldGen - Generate Any 3D Scene in Seconds☆783Updated 5 months ago
- 🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…☆344Updated 2 weeks ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,596Updated last year
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆537Updated 3 weeks ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆484Updated 10 months ago
- AllTracker is a model for tracking all pixels in a video.☆361Updated 2 months ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,704Updated last month
- All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.☆1,067Updated this week
- [NeurIPS'23] Emergent Correspondence from Image Diffusion☆736Updated last year
- Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".☆228Updated 7 months ago
- Code for the Molmo Vision-Language Model☆793Updated 10 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆333Updated last year
- Code release for https://kovenyu.com/WonderWorld/☆666Updated 6 months ago
- LL3M writes Python code that generates 3D assets in Blender.☆483Updated 3 weeks ago
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆125Updated 3 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆742Updated 5 months ago
- An open source implementation of CLIP (With TULIP Support)☆163Updated 5 months ago
- ☆157Updated last year
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆402Updated last month
- Learning from synthetic data - code and models☆324Updated last year
- Benchmarking physical understanding in generative video models☆214Updated last week
- ☆180Updated 5 months ago