Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only provides object detections or a few simple sentences.
☆12Jan 15, 2025Updated last year
Alternatives and similar repositories for PLEDGE--Paragraph-LEvel-image-Description-GEneration
Users that are interested in PLEDGE--Paragraph-LEvel-image-Description-GEneration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TrustAi website☆13Sep 1, 2024Updated last year
- ☆14Jan 6, 2024Updated 2 years ago
- Code for paper "Unsupervised Noise adaptation using Data Simulation"☆14May 16, 2024Updated 2 years ago
- Town Pass is an open-source project by Taipei City that promotes citizen participation and innovation in smart city development. It enabl…☆41Apr 6, 2026Updated 2 months ago
- NCKU Operations Research Applications course - GitHub Writings☆21Jun 25, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Latex Workshop (2024 Spring)☆11Oct 20, 2024Updated last year
- ML Application of Algorithmic Trading☆24Oct 16, 2021Updated 4 years ago
- [CVPR-2023] Re-thinking Model Inversion Attacks Against Deep Neural Networks☆43Nov 12, 2023Updated 2 years ago
- Style transfer in text using cycle-consistent WGANs☆17Jul 11, 2018Updated 7 years ago
- writing style transfer using cycle gan☆22Apr 18, 2026Updated 2 months ago
- A Python Package for Convex Regression and Frontier Estimation☆37Feb 22, 2025Updated last year
- Code for CAET5☆23Jun 12, 2023Updated 3 years ago
- Sample code for Dependency Injection Principles, Practices, and Patterns☆164Aug 10, 2023Updated 2 years ago
- Experimenting with different regression losses. Implemented in Pytorch.☆147Jan 29, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 4 years ago
- ☆91May 17, 2025Updated last year
- Breeze ASR 25 是一款先進的自動語音辨識(ASR)模型,基於 Whisper-large-v2 微調而成,特別針對台灣華語以及華語與英語混用的情境進行優化。Breeze ASR 25 is an advanced ASR model fine-tuned fro…☆154Jul 1, 2025Updated last year
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆34Sep 14, 2017Updated 8 years ago
- This project is to develop tools for investment decision-making and make investment analysis using data science techniques.☆55Jan 3, 2023Updated 3 years ago
- A simple GPT-3 interface to automate core legal writing tasks☆13Mar 8, 2023Updated 3 years ago
- ☆177May 21, 2023Updated 3 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 4 years ago
- [CVPR 2023] Deep Feature In-painting for Unsupervised Anomaly Detection in X-ray Images☆105Mar 31, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- “互联网新闻情感分析”赛题,是CCF大数据与计算智能大赛赛题之一。对新闻情绪进行分类,0代表正面情绪、1代表中性情绪、2代表负面情绪。☆151Sep 17, 2019Updated 6 years ago
- Style Transfer for Texts☆41Aug 14, 2020Updated 5 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 8 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Jun 23, 2026Updated last week
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆14Aug 16, 2022Updated 3 years ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17May 27, 2023Updated 3 years ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated 2 years ago