The Document of WenLan API, which was used to obtain image and text feature.
☆41Jan 10, 2023Updated 3 years ago
Alternatives and similar repositories for WenLan-api-document
Users that are interested in WenLan-api-document are comparing it to the libraries listed below
Sorting:
- Bling's Object detection tool☆56Jan 9, 2023Updated 3 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Nov 27, 2024Updated last year
- Open domain Chinese dialogue corpus and datasets.☆16Jan 8, 2022Updated 4 years ago
- 利用resnet_18来对虹膜图像进行模糊清晰二分类☆10Apr 8, 2018Updated 7 years ago
- This project is aim to develope a 2D "chicken dinner"☆15Aug 18, 2018Updated 7 years ago
- [ICLR 2025] Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation☆19Mar 21, 2025Updated 11 months ago
- Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"☆213Dec 18, 2023Updated 2 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- Paper, dataset and code list for multimodal dialogue.☆22Jan 2, 2025Updated last year
- ☆21Aug 26, 2025Updated 6 months ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆22Aug 4, 2024Updated last year
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- PL0 Compiler 编译原理 C 语言 实现的 PL/0 编译器 flex & bison☆50Dec 26, 2019Updated 6 years ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆24Sep 19, 2021Updated 4 years ago
- ☆25Apr 16, 2024Updated last year
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it☆32Mar 29, 2023Updated 2 years ago
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- A Chinese lyric corpus which contains nearly 50,000 lyrics from 500 artists☆39Jan 22, 2018Updated 8 years ago
- ☆36Feb 19, 2023Updated 3 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- ☆12Sep 25, 2023Updated 2 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆34Mar 1, 2023Updated 3 years ago
- A tool to easily modify a Stable Diffusion VAE☆84Mar 11, 2023Updated 2 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- Revisiting Anchor Mechanisms for Temporal Action Localization (TIP 2020)☆36Sep 26, 2021Updated 4 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- Official implementation of Lightweight Human Pose Estimation Using Loss Weighted by Target Heatmap that was honorably mentioned as Best P…☆11Dec 17, 2023Updated 2 years ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- Python class to explore the ImageNet database☆16Jan 12, 2012Updated 14 years ago
- ☆12Oct 28, 2024Updated last year