☆23Jan 12, 2024Updated 2 years ago
Alternatives and similar repositories for NurViD-benchmark
Users that are interested in NurViD-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆45Nov 30, 2024Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆62Jul 5, 2025Updated 11 months ago
- ☆18Jun 25, 2022Updated 3 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆16Aug 30, 2023Updated 2 years ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition☆58Aug 13, 2024Updated last year
- Code for Chinese grammatical error correction based on knowledge distillation☆11Aug 16, 2022Updated 3 years ago
- ☆41Mar 26, 2025Updated last year
- ☆11Jul 14, 2023Updated 2 years ago
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆98Dec 13, 2024Updated last year
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆17Nov 13, 2022Updated 3 years ago
- [CVPR 2023] Official PyTorch implementation of the paper "GAP: Post-Processing Temporal Action Detection"☆18Aug 31, 2023Updated 2 years ago
- This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and cha…☆22Aug 26, 2022Updated 3 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆104Aug 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Medical Matting☆29Feb 21, 2023Updated 3 years ago
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP models☆38Mar 23, 2025Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆24Apr 10, 2025Updated last year
- Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments (ECCV 2022)☆26Nov 12, 2024Updated last year
- Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track☆25Nov 17, 2025Updated 6 months ago
- There are compilations of surgery-related tasks, datasets, and papers.☆182Apr 3, 2026Updated 2 months ago
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- ☆14Jan 5, 2022Updated 4 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 11 months ago
- ☆10Oct 20, 2022Updated 3 years ago
- Combating Mode Collapse via Manifold Entropy Estimation☆11Apr 21, 2023Updated 3 years ago
- The official start-up code for paper "FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark."☆67Jan 21, 2025Updated last year
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated 2 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆20Sep 15, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆50Apr 18, 2023Updated 3 years ago
- ☆57Jan 7, 2023Updated 3 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 6 months ago
- Code for the paper "Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking"☆15Apr 12, 2025Updated last year