A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models
☆22Feb 16, 2025Updated last year
Alternatives and similar repositories for Image2Poem
Users that are interested in Image2Poem are comparing it to the libraries listed below
Sorting:
- ☆14Aug 28, 2024Updated last year
- Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'☆10Dec 10, 2024Updated last year
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆15Mar 26, 2025Updated 11 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- Gemma3的comfyui版本☆10Sep 6, 2025Updated 5 months ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated 11 months ago
- ☆11Apr 10, 2019Updated 6 years ago
- A matlab package for analyzing chaotic properties of time series data☆11Jun 29, 2018Updated 7 years ago
- Image search based on convolutional neural network feature extraction.☆14May 11, 2018Updated 7 years ago
- Finetune the controlnet+stable diffusion model using diffuser☆11Sep 18, 2023Updated 2 years ago
- Text Detection by RetinaNet with PyTorch (Code will be released soon)☆10Dec 1, 2018Updated 7 years ago
- Fine Tuning Stable Diffusion on Chinese Landscape Painting Generation(基于扩散模型的中国山水画生成)☆10Apr 10, 2023Updated 2 years ago
- ☆26Jan 8, 2026Updated last month
- Automated Image Forgery Detection through Classification of JPEG Ghosts☆12Oct 3, 2023Updated 2 years ago
- graphs from Draw.io☆13Sep 26, 2024Updated last year
- Create After Effects scripts in Python.☆13Jan 29, 2021Updated 5 years ago
- 项目描述:项目主要是在 GEC6818 开发板上实现一个综合娱乐系统,包括消灭星星,电子钢琴,2048 游戏,mp4等功能,分为游戏客户端和游戏服务端,游戏客户端具体实现 通过 vector 容器存放游戏棋盘,通过棋盘存放的数据将对应数字的 BMP 图片打印到 GEC681…☆10Feb 13, 2022Updated 4 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- Multiple Attractors simulation with customization☆14Feb 22, 2026Updated last week
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 4 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated 11 months ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- Personality prediction from Human Connectome Project fMRI data using BrainNetCNN deep learning model (Kawahara et al. 2016). Project with…☆14Jan 4, 2021Updated 5 years ago
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆16Jul 5, 2025Updated 8 months ago
- Alibaba Cloud German AI Challenge 2018, 17th place solution. https://tianchi.aliyun.com/competition/entrance/231683/introduction☆10Jun 17, 2019Updated 6 years ago
- Collect VLM models that can be tried online.☆14Apr 15, 2024Updated last year
- Unofficial Implementation of CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes☆10Sep 8, 2018Updated 7 years ago
- Why do deep convolutional networks generalize so poorly to small image transformations?☆11Jun 23, 2019Updated 6 years ago
- ☆12Feb 13, 2025Updated last year
- Official implementation for P2SAM (ACM MM 2024)☆14Dec 7, 2024Updated last year
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- Image Processing and Manipulation using python OpenCV☆12Jan 20, 2019Updated 7 years ago
- Flops counter for convolutional networks in pytorch framework☆11Oct 30, 2019Updated 6 years ago
- miemienet is a C++ AI deep learning inference framework.Supports PPYOLOE、PICODET.☆12Nov 4, 2022Updated 3 years ago
- 天池大数据竞赛2017—广东政务数据创新大赛—智能算法赛☆10Apr 1, 2018Updated 7 years ago
- 人脸检测服务, 用于输出适合人脸识别的 人脸数据集,通过 mtcnn cnn检测人脸,通过 hopenet 开源项目确定人脸是姿态,拿到头部姿态欧拉角,通过 拉普拉斯算子 拿到人脸模糊度,通过对mtcnn 三级网络和置信度,欧 拉角阈值,模糊度设置阈值筛选合适人脸☆14May 17, 2024Updated last year
- yolov5行人检测,rk3588,rknlite2部署☆12Aug 6, 2023Updated 2 years ago
- The IP-Adapter training scripts and inference for Flux Model, which is implemented based on X-Lab☆17Oct 1, 2024Updated last year
- Video-Language Alignment via Spatio–Temporal Graph Transformer; ArXiv: https://arxiv.org/abs/2407.11677☆14Jul 24, 2024Updated last year