Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only provides object detections or a few simple sentences.
☆12Jan 15, 2025Updated last year
Alternatives and similar repositories for PLEDGE--Paragraph-LEvel-image-Description-GEneration
Users that are interested in PLEDGE--Paragraph-LEvel-image-Description-GEneration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TrustAi website☆13Sep 1, 2024Updated last year
- Optimal Planning for NTU YouBike Assignment with Operation Research and Machine Learning Techniques☆11Aug 28, 2024Updated last year
- Apply pre-trained models to help quickly grasp investment news, including three tasks, 1. summarizationm 2. sentiment analysis 3. domain …☆14Sep 1, 2024Updated last year
- A python3 library for evaluating caption's BLEU, Meteor, CIDEr, SPICE,ROUGE_L,WMD score. Fork from https://github.com/ruotianluo/coco-cap…☆22Nov 25, 2020Updated 5 years ago
- Deep Learning to Predict MLB Outcomes☆10Aug 23, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆14Jan 6, 2024Updated 2 years ago
- 使用 Go 和 ReactJS 构建聊天应用系列教程☆49Dec 11, 2022Updated 3 years ago
- A Hierarchical Approach for Generating Descriptive Image Paragraphs☆10Mar 27, 2020Updated 6 years ago
- NCKU Operations Research Applications course - GitHub Writings☆22Jun 25, 2020Updated 5 years ago
- Latex Workshop (2024 Spring)☆11Oct 20, 2024Updated last year
- [CVPR-2023] Re-thinking Model Inversion Attacks Against Deep Neural Networks☆43Nov 12, 2023Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Aug 10, 2023Updated 2 years ago
- Python 金融市場賺大錢聖經:寫出你的專屬指標 - 進階技術補充☆18Sep 5, 2021Updated 4 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆31Feb 16, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Style transfer in text using cycle-consistent WGANs☆17Jul 11, 2018Updated 7 years ago
- writing style transfer using cycle gan☆21Jul 25, 2024Updated last year
- C++ Trading Algorithm Backtest Environment☆97Sep 21, 2018Updated 7 years ago
- ☆36Dec 13, 2023Updated 2 years ago
- A Python Package for Convex Regression and Frontier Estimation☆37Feb 22, 2025Updated last year
- Sample code for Dependency Injection Principles, Practices, and Patterns☆165Aug 10, 2023Updated 2 years ago
- Experimenting with different regression losses. Implemented in Pytorch.☆148Jan 29, 2019Updated 7 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- ☆86May 17, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆34Sep 14, 2017Updated 8 years ago
- This project is to develop tools for investment decision-making and make investment analysis using data science techniques.☆53Jan 3, 2023Updated 3 years ago
- A simple GPT-3 interface to automate core legal writing tasks☆13Mar 8, 2023Updated 3 years ago
- ☆173May 21, 2023Updated 2 years ago
- Download subreddit comments☆97Feb 23, 2022Updated 4 years ago
- Colab notebook to finetune GLIDE.☆12Mar 22, 2022Updated 4 years ago
- Style Transfer for Texts☆41Aug 14, 2020Updated 5 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 11 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 11 months ago
- ☆14Aug 30, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year