[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
☆46Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for PKOL
Users that are interested in PKOL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”☆86Aug 14, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated last year
- Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"☆118Aug 5, 2024Updated last year
- Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.☆27Aug 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Apr 5, 2023Updated 3 years ago
- 2025 CCF BDCI DeepSearch 赛道 Top 方案☆90Apr 15, 2026Updated 2 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated 2 years ago
- This repository is the official implementation of [Natural Color Fool: Towards Boosting Black-box Unrestricted Attacks (NeurIPS'22)](http…☆26Feb 13, 2023Updated 3 years ago
- This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.☆17Aug 6, 2022Updated 3 years ago
- ☆19Jul 22, 2024Updated last year
- Aurora Weather☆24Dec 8, 2016Updated 9 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Our Team (green hand) 6th Solution for CVPR-2021 AIC-VI: Unrestricted Adversarial Attacks on ImageNet☆26Jan 25, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The extension of "Patch-wise Attack for Fooling Deep Neural Network (ECCV2020)", and we aim to boost the success rates of targeted attack…☆28Mar 14, 2022Updated 4 years ago
- A list of papers in NeurIPS 2022 related to adversarial attack and defense / AI security.☆77Dec 5, 2022Updated 3 years ago
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆62Jun 8, 2023Updated 3 years ago
- [ACL 2025] The official implementation of the paper "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free".☆77Dec 4, 2025Updated 6 months ago
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Jun 7, 2024Updated 2 years ago
- Talk to ChatGPT and Generate image via any Matrix client!☆16Apr 25, 2023Updated 3 years ago
- [TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"☆16May 12, 2025Updated last year
- FT-Data Ranker: Fine-Tuning Data Processing Competition for LLMs (1B-Model Track & 7B-Model Track) FT-Data Ranker:大语言模型微调数据竞赛 -- 1B模型赛道比赛…☆15Dec 6, 2023Updated 2 years ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆51Jul 1, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Patch-wise iterative attack (accepted by ECCV 2020) to improve the transferability of adversarial examples.☆94Mar 13, 2022Updated 4 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- List of resources for video retrieval.☆20Mar 17, 2022Updated 4 years ago
- ☆29Jun 27, 2022Updated 3 years ago
- Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering☆10Dec 6, 2019Updated 6 years ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆39Mar 9, 2025Updated last year
- ChineseCLIP using online learning☆14Nov 7, 2022Updated 3 years ago
- ☆11Jul 4, 2024Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆12Sep 21, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Not All Patches Are Equal: Hierarchical Dataset Condensation for Single Image Super-Resolution☆10May 7, 2024Updated 2 years ago
- A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…☆22Sep 12, 2025Updated 9 months ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- ☆38Jul 13, 2020Updated 5 years ago
- 实现对携程网站的酒店评论爬取,并进行数据预处理和基于情感分类的数据分析,使用了jieba评论分词等处理技术,情感词典,特征值提取,机器学习模型等分析预测技术,词云,热力图等可视化技术☆13Jul 15, 2022Updated 3 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- W2VV++: A fully deep learning solution for ad-hoc video search☆29Jul 25, 2024Updated last year