☆57Mar 7, 2026Updated last week
Alternatives and similar repositories for awesome_ai_paper
Users that are interested in awesome_ai_paper are comparing it to the libraries listed below
Sorting:
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 4 months ago
- Adaptive Multimodal Learning for Remote Sensing Data Fusion☆13Dec 22, 2024Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆31May 16, 2024Updated last year
- ☆17Jul 30, 2024Updated last year
- ☆18Jun 10, 2025Updated 9 months ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Aug 21, 2024Updated last year
- ☆17Jun 17, 2023Updated 2 years ago
- Currently collecting some awesome Manus replays. Feel free to share your use cases.☆18Mar 9, 2025Updated last year
- ☆33Nov 26, 2025Updated 3 months ago
- Code release for VTW (AAAI 2025 Oral)☆66Nov 4, 2025Updated 4 months ago
- ☆14Dec 28, 2022Updated 3 years ago
- ☆50Sep 26, 2025Updated 5 months ago
- Implementation of Adverserial autoencoders☆11Dec 10, 2020Updated 5 years ago
- Code for ICCV2023 paper: Homography Guided Temporal Fusion for Road Line and Marking Segmentation☆14Oct 13, 2024Updated last year
- ☆32Sep 19, 2025Updated 6 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 6 months ago
- Survey on LLM Inference via Search (TMLR 2025)☆14May 6, 2025Updated 10 months ago
- ☆27Apr 28, 2025Updated 10 months ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆19Mar 3, 2025Updated last year
- Official implementation for "Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition" (AAAI 2022 Oral).☆17Mar 11, 2025Updated last year
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆37Dec 17, 2024Updated last year
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆24Aug 8, 2025Updated 7 months ago
- ☆13Apr 23, 2025Updated 10 months ago
- ☆22Feb 13, 2026Updated last month
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- ☆28Oct 21, 2025Updated 4 months ago
- ☆10Jun 18, 2024Updated last year
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.☆67Nov 28, 2022Updated 3 years ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 6 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆32Feb 10, 2026Updated last month
- ☆92Feb 23, 2026Updated 3 weeks ago
- AODV in OPNET 14.5☆17Dec 14, 2019Updated 6 years ago
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 10 months ago
- Data-Efficient Multimodal Fusion on a Single GPU☆68May 7, 2024Updated last year
- Internal Testing Chain☆11Sep 21, 2022Updated 3 years ago