Comparing speed of different implementations of reading video into numpy arrays
☆47Nov 19, 2022Updated 3 years ago
Alternatives and similar repositories for benchmarking_video_reading_python
Users that are interested in benchmarking_video_reading_python are comparing it to the libraries listed below
Sorting:
- Low-latency live streaming PoC☆11Jul 30, 2019Updated 6 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Aug 20, 2022Updated 3 years ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 9 months ago
- (unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets☆12Aug 22, 2020Updated 5 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Jan 30, 2024Updated 2 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Mar 14, 2021Updated 4 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- 利用transformer模型来实现语音识别系统☆18Aug 11, 2020Updated 5 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- PyTorch implementation of NMT models along with custom tokenizers, models, and datasets☆21Aug 1, 2022Updated 3 years ago
- Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading (CVPR 2022)☆13Jun 18, 2022Updated 3 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- MWPToolkit is an open-source framework for math word problem(MWP) solvers.☆28Jan 7, 2022Updated 4 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Sep 5, 2019Updated 6 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Oct 21, 2021Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Mar 11, 2025Updated 11 months ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆14Apr 25, 2025Updated 10 months ago
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- Few shot recognition using CLIP's OpenAI architecture.☆36Aug 2, 2021Updated 4 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 8 months ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- ☆13Aug 28, 2024Updated last year
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- ☆12Jan 29, 2023Updated 3 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 6 months ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆2,427Jul 17, 2024Updated last year
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- html template library that supports streaming for javascript runtimes☆15May 4, 2025Updated 9 months ago
- A C++ module for Node.js providing access to the system font catalog.☆11Sep 13, 2022Updated 3 years ago