A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models
☆21Feb 16, 2025Updated last year
Alternatives and similar repositories for Image2Poem
Users that are interested in Image2Poem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated last year
- Official implementation for P2SAM (ACM MM 2024)☆14Dec 7, 2024Updated last year
- [ACM'MM 2025] UAV Street-Satellite matching workshop Challenging paper, SkyLink: Unifying Street-Satellite Geo-Localization via UAV-Media…☆25Dec 9, 2025Updated 5 months ago
- ☆15Aug 28, 2024Updated last year
- pre-training llama3 using chinese☆13May 1, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A matlab package for analyzing chaotic properties of time series data☆11Jun 29, 2018Updated 7 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- A TensorFlow implementation of FlowQA☆15Nov 24, 2018Updated 7 years ago
- A ComfyUI extension for StyleShot.☆16Apr 23, 2025Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- ☆17May 21, 2024Updated 2 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆14Nov 8, 2023Updated 2 years ago
- ☆17May 17, 2022Updated 4 years ago
- 中文文本合成 for OCR☆12Mar 14, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Mar 11, 2021Updated 5 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- BUPT 2023 智能计算系统 课程代码☆34Jan 16, 2024Updated 2 years ago
- ☆22May 7, 2025Updated last year
- ☆20Nov 30, 2021Updated 4 years ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆43Jun 9, 2024Updated last year
- 基于yolov5+pyqt的甲骨文图形化检测工具☆44Nov 4, 2024Updated last year
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 4 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆32Jul 27, 2020Updated 5 years ago
- Solution to Stanford CS224 assignments☆16Mar 7, 2019Updated 7 years ago
- Image encryption using chaotic maps (Arnold map) ,Mandelbrot set and DNA encryption☆16Feb 14, 2023Updated 3 years ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16May 3, 2023Updated 3 years ago
- ☆31Apr 2, 2022Updated 4 years ago
- ☆42Aug 21, 2021Updated 4 years ago
- 基于stable-diffusion的虚拟换装方法☆11Apr 27, 2024Updated 2 years ago
- Text Detection by RetinaNet with PyTorch (Code will be released soon)☆10Dec 1, 2018Updated 7 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆38Mar 18, 2026Updated 2 months ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆59Aug 14, 2023Updated 2 years ago
- 如何在github上传本地项目代码(新手使用)☆22Sep 30, 2019Updated 6 years ago
- ☆23Mar 24, 2016Updated 10 years ago
- comfyui的m3net插件,m3net是不错的显著性检测模型,抠图上效果不错,我开源了一个训练的电商的模型,供大家试玩☆12Aug 16, 2024Updated last year
- ☆697May 18, 2026Updated last week