Let's make a video clip
☆96Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for video-clip
Users that are interested in video-clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Aggregating embeddings over time☆32Jan 19, 2023Updated 3 years ago
- Easily compute clip embeddings from video frames☆145Oct 31, 2023Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Mar 11, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Aim for the moon. If you miss, you may hit a star.☆164Feb 14, 2023Updated 3 years ago
- Simple python template☆43Apr 25, 2024Updated last year
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 7 months ago
- Easily create large video dataset from video urls☆653Jul 30, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆65Oct 4, 2023Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆677Aug 14, 2024Updated last year
- ☆10Nov 11, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 3 years ago
- Code for the HowTo100M paper☆298Mar 10, 2020Updated 6 years ago
- Train vision models using JAX and 🤗 transformers☆101Dec 14, 2025Updated 3 months ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 4 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Mar 21, 2023Updated 3 years ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆304Apr 3, 2024Updated last year
- A step-by-step tutorial about how to use Distributed Data Parallel feature of PyTorch☆16Nov 20, 2020Updated 5 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,781Mar 20, 2026Updated last week
- SVIT: Scaling up Visual Instruction Tuning☆166Jun 20, 2024Updated last year
- ☆15Aug 3, 2021Updated 4 years ago
- ☆129Jul 30, 2024Updated last year
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- [NeurIPS 2022] VisDA 2022 Challenge Toolkit☆20Oct 1, 2022Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,698Dec 8, 2023Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆337Aug 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆87Mar 24, 2022Updated 4 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - fork with video pseudo3d☆99Mar 5, 2023Updated 3 years ago
- waymo open data utils☆11Aug 29, 2020Updated 5 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆223May 26, 2024Updated last year
- Command-line tool for downloading and extending the RedCaps dataset.☆49Dec 18, 2023Updated 2 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- https://github.com/PRBonn/kiss-icp☆11Dec 6, 2022Updated 3 years ago