NKotani / PointAnywhereLinks
This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code and dataset descriptions.
☆12Updated 2 years ago
Alternatives and similar repositories for PointAnywhere
Users that are interested in PointAnywhere are comparing it to the libraries listed below
Sorting:
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Updated 10 months ago
- ☆25Updated 2 years ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated last year
- ☆12Updated last year
- Neural network for creating distortion while keeping embeddings as close as possible☆20Updated last year
- ☆16Updated last year
- XmodelLM☆38Updated last year
- ☆29Updated 2 years ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆59Updated 11 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- ☆11Updated 2 years ago
- Visual RAG using less than 300 lines of code.☆29Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- [IJCAI'23] Complete Instances Mining for Weakly Supervised Instance Segmentation☆38Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- Browser automation for creating new pages in WordPress☆13Updated 6 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- ☆16Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated last week
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Updated 2 years ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆41Updated last year
- ☆17Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Adaptive Inter-Class Similarity Distillation for Semantic Segmentation (MTAP 2025)☆27Updated 3 weeks ago
- 3D Traffic Light & Sign Dataset☆21Updated 8 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated 2 years ago
- code for training and using chess embeddings models☆13Updated last year
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆16Updated last year
- Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…☆12Updated 2 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year