Netflix / videoannotator
☆42Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for videoannotator
- ☆68Updated last month
- ☆62Updated last month
- ☆55Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- ☆41Updated last month
- Video-LlaVA fine-tune for CinePile evaluation☆38Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- ☆11Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆61Updated 2 weeks ago
- ☆59Updated 5 months ago
- ☆35Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆47Updated 2 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆43Updated 2 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 6 months ago
- ☆59Updated last month
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆22Updated this week
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆32Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 10 months ago
- E5-V: Universal Embeddings with Multimodal Large Language Models☆173Updated 4 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆20Updated 9 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆38Updated last month
- ☆44Updated 6 months ago
- Simple CogVLM client script☆14Updated 11 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- ☆57Updated last month