Let's make a video clip
☆97Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for video-clip
Users that are interested in video-clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Aggregating embeddings over time☆32Jan 19, 2023Updated 3 years ago
- Easily compute clip embeddings from video frames☆146Oct 31, 2023Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Mar 11, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Aim for the moon. If you miss, you may hit a star.☆165Feb 14, 2023Updated 3 years ago
- Simple python template☆43Apr 25, 2024Updated last year
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 8 months ago
- Easily create large video dataset from video urls☆653Jul 30, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Optimized library for large-scale extraction of frames and audio from video.☆202Sep 11, 2023Updated 2 years ago
- ☆65Oct 4, 2023Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆678Aug 14, 2024Updated last year
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 6 months ago
- Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".☆13Oct 20, 2024Updated last year
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 3 years ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- Code for the HowTo100M paper☆298Mar 10, 2020Updated 6 years ago
- Train vision models using JAX and 🤗 transformers☆101Dec 14, 2025Updated 4 months ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 4 years ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Mar 21, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Mar 25, 2021Updated 5 years ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆306Apr 3, 2024Updated 2 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,793Mar 25, 2026Updated 3 weeks ago
- SVIT: Scaling up Visual Instruction Tuning☆166Jun 20, 2024Updated last year
- ☆15Aug 3, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Research Into Learning to Generate Game Levels through Play☆30Feb 20, 2020Updated 6 years ago
- ☆129Jul 30, 2024Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,714Dec 8, 2023Updated 2 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆87Mar 24, 2022Updated 4 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆49Dec 18, 2023Updated 2 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆377May 19, 2022Updated 3 years ago