Let's make a video clip
☆97Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for video-clip
Users that are interested in video-clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Aggregating embeddings over time☆32Jan 19, 2023Updated 3 years ago
- Easily compute clip embeddings from video frames☆149Oct 31, 2023Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Mar 11, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- Aim for the moon. If you miss, you may hit a star.☆167Feb 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple python template☆43Apr 25, 2024Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 10 months ago
- Easily create large video dataset from video urls☆659Jul 30, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25May 14, 2026Updated last month
- ☆65Oct 4, 2023Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆682Aug 14, 2024Updated last year
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 4 years ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆159Dec 9, 2024Updated last year
- ☆13Sep 15, 2021Updated 4 years ago
- Code for the HowTo100M paper☆303Mar 10, 2020Updated 6 years ago
- Train vision models using JAX and 🤗 transformers☆102Dec 14, 2025Updated 6 months ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 4 years ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Mar 21, 2023Updated 3 years ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆308Apr 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 8 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,807Updated this week
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- Research Into Learning to Generate Game Levels through Play☆30Feb 20, 2020Updated 6 years ago
- ☆129Jul 30, 2024Updated last year
- [NeurIPS 2022] VisDA 2022 Challenge Toolkit☆20Oct 1, 2022Updated 3 years ago
- Code release for DriveGAN (CVPR 2021)☆100Nov 11, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - fork with video pseudo3d☆99Mar 5, 2023Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,759Dec 8, 2023Updated 2 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆87Mar 24, 2022Updated 4 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆335Aug 9, 2022Updated 3 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆50Dec 18, 2023Updated 2 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆376May 19, 2022Updated 4 years ago