Lightning-Universe / InVideo-search_app
β14Updated 9 months ago
Related projects β
Alternatives and complementary repositories for InVideo-search_app
- Hugging Face's Zapier Integration π€β‘οΈβ47Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ77Updated last year
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"β18Updated last year
- Make-A-Video Latent Diffusion Modelβ18Updated last year
- β52Updated 2 months ago
- This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.β230Updated 7 months ago
- β58Updated 8 months ago
- Aim for the moon. If you miss, you may hit a star.β160Updated last year
- Gradio Client in Rust.β23Updated last month
- β57Updated last month
- The Next Generation Multi-Modality Superintelligenceβ70Updated 2 months ago
- Webpage for DreamBoothβ39Updated last year
- Document your code repositories with LLMsβ23Updated 10 months ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timmβ¦β121Updated this week
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home projectβ51Updated 2 years ago
- β62Updated last month
- An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision modβ¦β21Updated 9 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated 5 months ago
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826β52Updated last month
- Voyage AI Official Python Libraryβ41Updated 2 weeks ago
- Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRAβ176Updated 11 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).β¦β122Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β153Updated last year
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, β¦β91Updated last year
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desiβ¦β163Updated last year
- DiffusionWithAutoscalerβ29Updated 7 months ago
- Create topological graph for image segments.β18Updated last month
- A Gradio component that can be used to annotate images with bounding boxes.β31Updated 3 weeks ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β34Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β23Updated 10 months ago