☆66Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for assistgpt
Users that are interested in assistgpt are comparing it to the libraries listed below
Sorting:
- This is the project page for the HOSNeRF☆16Dec 11, 2023Updated 2 years ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆87Sep 27, 2025Updated 5 months ago
- The code repository of UniRL☆51May 30, 2025Updated 9 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- [CVPR 2024] ViT-Lens: Towards Omni-modal Representations☆190Feb 3, 2025Updated last year
- [ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding☆376May 8, 2024Updated last year
- ☆14Aug 21, 2025Updated 6 months ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆17Aug 24, 2022Updated 3 years ago
- ☆73May 10, 2024Updated last year
- ☆83Aug 1, 2023Updated 2 years ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆75Dec 28, 2021Updated 4 years ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆146Dec 26, 2024Updated last year
- This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.☆41Mar 7, 2023Updated 3 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆256May 9, 2024Updated last year
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆23Jan 8, 2024Updated 2 years ago
- BlockchainGPT: An intuitive, chat-based platform to manage your blockchain environments using natural language processing capabilities.☆11Jul 6, 2023Updated 2 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆54Apr 15, 2024Updated last year
- Finetune any model on HF in less than 30 seconds☆56Jan 31, 2026Updated last month
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Apr 20, 2023Updated 2 years ago
- A curated list of vision-and-language pre-training (VLP). :-)☆62Jul 6, 2022Updated 3 years ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated 10 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- ☆39Aug 26, 2025Updated 6 months ago
- [CoRL 2022] This repository contains code for generating relevancies, training, and evaluating Semantic Abstraction.☆115Mar 9, 2023Updated 3 years ago
- ☆43Feb 16, 2026Updated 3 weeks ago
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆29Jul 14, 2022Updated 3 years ago
- [CVPR 2025] Video Narration as Vocabulary & Video as Long Document☆589Mar 13, 2025Updated 11 months ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- Heart Steps 2.0 Application☆10Sep 1, 2023Updated 2 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Nov 3, 2020Updated 5 years ago
- A template to run Lanchain Powered App using Chainlit Front UI☆13Aug 1, 2023Updated 2 years ago
- Official repo for MM-REACT☆968Jan 31, 2024Updated 2 years ago
- [Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.☆825Apr 28, 2023Updated 2 years ago
- Project automates AI news gathering and Blog post Writing. Our AI agent collects insights and news about any topic From Internet and Wri…☆36Apr 16, 2024Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated last year