Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍
☆27Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for Fuyu-8B---Exploration
Users that are interested in Fuyu-8B---Exploration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- ☆19May 14, 2024Updated last year
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- ☆86Feb 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆28Feb 26, 2026Updated last month
- ☆18Nov 25, 2023Updated 2 years ago
- Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. ACM Multimedia 2019.☆12Sep 19, 2019Updated 6 years ago
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- ☆30Aug 7, 2025Updated 7 months ago
- ☆12Jul 19, 2023Updated 2 years ago
- M-HalDetect Dataset Release☆28Nov 4, 2023Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Apr 28, 2023Updated 2 years ago
- Source code for paper "Local Spectral Graph Convolution for Point Set Feature Learning"☆10Jul 11, 2018Updated 7 years ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- CVPR 2022:Point Cloud Color Constancy☆10Mar 17, 2023Updated 3 years ago
- ☆112Jan 8, 2025Updated last year
- Dynamic Distribution Pruning for Efficient Network Architecture Search☆47Jun 24, 2019Updated 6 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- neural sketch project, currently in generative regex, list transformation (deepcoder), and text editing (robustfill) domains☆24Mar 4, 2020Updated 6 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆341Jan 8, 2024Updated 2 years ago
- [MICCAI' 22] Semi-Supervised Medical Image Classification with Temporal Knowledge-Aware Regularization☆14Jun 27, 2022Updated 3 years ago
- Implementation for Face Illumination Transfer through Edge-preserving Filters CVPR11☆13Dec 21, 2017Updated 8 years ago
- C2-Matching for CUDA11☆11Aug 24, 2023Updated 2 years ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10May 29, 2022Updated 3 years ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆25May 28, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Jan 22, 2025Updated last year
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 4 years ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆26Jan 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Aug 9, 2022Updated 3 years ago
- Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs☆98Jan 16, 2025Updated last year
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆112Aug 21, 2025Updated 7 months ago
- Master's semester project at EPFL: implement a depth map fusion algorithm for structured light.☆11Jan 13, 2017Updated 9 years ago
- Cython iterative farthest point sampling implementation☆12Mar 10, 2020Updated 6 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆10Mar 13, 2023Updated 3 years ago