Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for CaptionIMG
Users that are interested in CaptionIMG are comparing it to the libraries listed below
Sorting:
- Neural network for creating distortion while keeping embeddings as close as possible☆20Feb 6, 2024Updated 2 years ago
- ☆35Jan 23, 2024Updated 2 years ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- This repository contains a framework for converting monocular videos into side-by-side (SBS) 3D videos. It utilizes a combination of imag…☆91Feb 11, 2024Updated 2 years ago
- Code of the paper "Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analys…☆27Dec 13, 2023Updated 2 years ago
- ☆37Jan 20, 2024Updated 2 years ago
- ☆11Oct 8, 2023Updated 2 years ago
- [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition☆119Aug 29, 2025Updated 6 months ago
- "Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"☆38Nov 13, 2024Updated last year
- Python tools for easily translating your blog content to podcasts & YouTube☆211Sep 4, 2024Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Oct 20, 2023Updated 2 years ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- Optimal Transport in the Big Data Era☆118Nov 4, 2024Updated last year
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- ☆20Jan 8, 2024Updated 2 years ago
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆16Jan 16, 2024Updated 2 years ago
- AI Prediction api of the MusicLang package☆291Mar 25, 2024Updated last year
- MyFranchise for Madden 24 PC☆10Dec 4, 2023Updated 2 years ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34May 8, 2024Updated last year
- 2D road segmentation using lidar data during training☆43Dec 21, 2023Updated 2 years ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆69Sep 6, 2024Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Sep 13, 2024Updated last year
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- Run a large AI town, locally, via RWKV !☆160Dec 6, 2023Updated 2 years ago
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆86Dec 14, 2023Updated 2 years ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Jan 18, 2024Updated 2 years ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Mar 20, 2024Updated 2 years ago
- Low-latency Space-time Supersampling for Real-time Rendering☆33Feb 1, 2024Updated 2 years ago
- 🔥 React library of AI components 🔥☆140Sep 2, 2024Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆272Jan 10, 2026Updated 2 months ago
- An element merging game powered by AI☆16Mar 14, 2025Updated last year
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated 10 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆43Jun 9, 2024Updated last year
- ☆151Jan 31, 2024Updated 2 years ago
- AI-to-AI Testing | Simulation framework for LLM-based applications☆136Nov 7, 2023Updated 2 years ago
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆193Jul 30, 2024Updated last year