Project to provide driver guidance through object recognition in the vehicle driving environment: Display bounding boxes on objects in images/videos or display guide text on the screen.
☆20Aug 25, 2024Updated last year
Alternatives and similar repositories for prometheus5_project_AIDrivingGuide
Users that are interested in prometheus5_project_AIDrivingGuide are comparing it to the libraries listed below
Sorting:
- numpy implementation of deep learning models including Transformer (With 6 exercise)☆12Feb 24, 2024Updated 2 years ago
- Simple, Unified Repository for Retrieval-based Voice Conversion☆16Jul 3, 2024Updated last year
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 8 months ago
- ☆14Dec 31, 2024Updated last year
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 9 months ago
- [ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.☆23Oct 24, 2023Updated 2 years ago
- TRT for WSOL☆30Oct 31, 2023Updated 2 years ago
- This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".☆29Aug 25, 2025Updated 6 months ago
- This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos☆43Nov 5, 2025Updated 4 months ago
- ViTOL☆32Jun 28, 2022Updated 3 years ago
- Practically and asymptotically accurate conditional sampling from diffusion generative models without conditional training☆47Nov 24, 2024Updated last year
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆59Feb 12, 2025Updated last year
- [ECCV2022] Official Pytorch Implementation of Object Discovery via Contrastive Learning for Weakly Supervised Object Detection☆50Jan 16, 2024Updated 2 years ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆86Aug 6, 2025Updated 7 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆203Jun 10, 2025Updated 8 months ago
- Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.☆142Feb 16, 2023Updated 3 years ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆214Jun 26, 2025Updated 8 months ago
- ☆306May 29, 2025Updated 9 months ago
- [CVPR 2022] Official CoTTA Code for our paper Continual Test-Time Domain Adaptation☆305Jun 17, 2024Updated last year
- Evaluating Weakly Supervised Object Localization Methods Right (CVPR 2020)☆335Sep 20, 2022Updated 3 years ago
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 8 months ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆482Oct 18, 2024Updated last year
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆478Oct 21, 2024Updated last year
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆545Sep 15, 2023Updated 2 years ago
- BertViz: Visualize Attention in Transformer Models☆7,932Jan 8, 2026Updated 2 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,393May 31, 2024Updated last year
- Official inference repo for FLUX.1 models☆25,246Jul 31, 2025Updated 7 months ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,707Feb 18, 2026Updated 2 weeks ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,420Feb 26, 2026Updated last week
- Ultralytics YOLO 🚀☆54,059Updated this week
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆56,937Feb 20, 2026Updated 2 weeks ago
- Deepfakes Software For All☆55,021Updated this week
- script loader for Google Apps Script☆19Feb 7, 2012Updated 14 years ago
- Bootstrap 4 shinydashboard using AdminLTE3☆456Aug 25, 2025Updated 6 months ago
- ☆20Dec 8, 2024Updated last year
- 3D Voxel game using Unity3D☆11Aug 12, 2015Updated 10 years ago
- This repository contains implementations and illustrative code to accompany DeepMind publications☆14,728Feb 20, 2026Updated 2 weeks ago
- Software program that emulates a PPP, SLIP, or CSLIP connection to the Internet via a shell account☆24Dec 11, 2019Updated 6 years ago
- ljTools 是一套处理数据的常用函数工具包,简化数据处理。具有高度的易用性和复用性,用户无需关注各种繁琐的实现细节,一条语句即可构建出需要的结果。 包括:日期类型,获取年月日、获取星期、将日期转为时间戳、将时间戳转为日期; Number类型,数字转化为带三位逗号的字符串…☆12Nov 1, 2018Updated 7 years ago