A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models
☆21Feb 16, 2025Updated last year
Alternatives and similar repositories for Image2Poem
Users that are interested in Image2Poem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- Official implementation for P2SAM (ACM MM 2024)☆14Dec 7, 2024Updated last year
- Reproducible and flexible LLM evaluations for scientific reasoning.☆27Jul 23, 2025Updated 8 months ago
- [ACM'MM 2025] UAV Street-Satellite matching workshop Challenging paper, SkyLink: Unifying Street-Satellite Geo-Localization via UAV-Media…☆24Dec 9, 2025Updated 4 months ago
- ☆25May 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jun 19, 2024Updated last year
- ☆11May 28, 2024Updated last year
- ☆15Aug 28, 2024Updated last year
- [ICCV 2025] Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models☆35Mar 20, 2026Updated 3 weeks ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆26Jul 13, 2025Updated 9 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 9 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- [ECCV2024] Immunizing text-to-image Models against Malicious Adaptation☆18Jan 17, 2025Updated last year
- [IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks☆10Sep 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- [AAAI'25 Oral] "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language".☆20Jun 14, 2025Updated 10 months ago
- Using machine learning techniques for prediction and modelling non linear dynamic systems.☆10Jun 29, 2018Updated 7 years ago
- The official dataset of the flowvqa project.☆21Mar 26, 2024Updated 2 years ago
- A matlab package for analyzing chaotic properties of time series data☆11Jun 29, 2018Updated 7 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- ☆25Apr 15, 2025Updated last year
- ☆23Oct 14, 2024Updated last year
- Contrastive Learning for Conversion Rate Prediction☆24Sep 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A TensorFlow implementation of FlowQA☆15Nov 24, 2018Updated 7 years ago
- Simple news iOS app with SwiftUI☆24Jul 31, 2021Updated 4 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆14Nov 8, 2023Updated 2 years ago
- 中文文本合成 for OCR☆12Mar 14, 2023Updated 3 years ago
- ☆39May 22, 2025Updated 10 months ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 2 years ago
- ☆18Sep 29, 2022Updated 3 years ago
- ☆16Nov 8, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆55May 31, 2025Updated 10 months ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆68Jul 22, 2025Updated 8 months ago
- Simple, but accurate drawing app for iOS☆25May 14, 2022Updated 3 years ago
- ☆22May 7, 2025Updated 11 months ago
- ☆20Nov 30, 2021Updated 4 years ago
- 项目描述:项目主要是在 GEC6818 开发板上实现一个综合娱乐系统,包括消灭星星,电子钢琴,2048 游戏,mp4等功能,分为游戏客户端和游戏服务端,游戏客户端具体实现 通过 vector 容器存放游戏棋盘,通过棋盘存放的数据将对应数字的 BMP 图片打印到 GEC681…☆10Feb 13, 2022Updated 4 years ago