Netflix / videoannotatorLinks
☆48Updated last year
Alternatives and similar repositories for videoannotator
Users that are interested in videoannotator are comparing it to the libraries listed below
Sorting:
- ☆75Updated 8 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆64Updated this week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- ☆68Updated last year
- ☆78Updated 8 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- ☆58Updated last year
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 10 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 8 months ago
- MetaCLIP module for use with Autodistill.☆21Updated last year
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- ☆62Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆82Updated this week
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision mod…☆21Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Jockey is a conversational video agent.☆81Updated 3 weeks ago
- Tokun to can tokens☆17Updated last week
- ☆41Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 10 months ago
- ☆13Updated 6 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 11 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated 2 years ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆96Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆17Updated last year