Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.
☆85May 29, 2024Updated last year
Alternatives and similar repositories for YoloGemma
Users that are interested in YoloGemma are comparing it to the libraries listed below
Sorting:
- X Developer Challenge☆12Apr 25, 2024Updated last year
- [WINNER SOLUTION] soccernet monocular depth estimation solution☆13Sep 3, 2025Updated 6 months ago
- My personal website☆11Dec 22, 2024Updated last year
- A context-based chatbot following the ACR appropriateness guidelines☆11Aug 30, 2023Updated 2 years ago
- Le Tigre integrates speech recognition, vision, and text-to-speech capabilities to offer a comprehensive multimodal AI solution. It can p…☆12Jun 5, 2024Updated last year
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆12Apr 24, 2025Updated 10 months ago
- A browser-based AI inference network.☆126Jul 28, 2024Updated last year
- Official repository for the paper "Fast Predictive Uncertainty for Classification with Bayesian Deep Networks". Accepted at UAI 2022. htt…☆12May 25, 2022Updated 3 years ago
- Collect VLM models that can be tried online.☆14Apr 15, 2024Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆30Dec 19, 2023Updated 2 years ago
- Janus is an opensource IA for Star Citizen☆11Dec 23, 2023Updated 2 years ago
- a better menu☆14Mar 14, 2024Updated last year
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year
- A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…☆463Feb 13, 2026Updated 3 weeks ago
- An experimental and alternative approach to Finetuning and RAG.☆34Dec 9, 2023Updated 2 years ago
- A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview☆14Jul 9, 2024Updated last year
- A frontend that is compatible to the school-bud-e-backend.☆22Oct 23, 2025Updated 4 months ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆17Jun 24, 2024Updated last year
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- 👾 DX-focused decentralized zero-knowledge framework 🛸☆39Apr 18, 2024Updated last year
- Data Questionnaire Agent Chatbot☆71Feb 5, 2026Updated last month
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆496Jul 23, 2025Updated 7 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆205Dec 1, 2024Updated last year
- Lightweight open-source perplexity☆62May 6, 2024Updated last year
- [ICME'23, oral] CCLAP: Controllable Chinese Landscape Painting Generation☆19Apr 20, 2025Updated 10 months ago
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆23Oct 29, 2024Updated last year
- Code repository for Liquid Time-stochasticity networks (LTSs)☆23Apr 26, 2023Updated 2 years ago
- A framework for orchestrating AI agents using a mermaid graph☆76May 16, 2024Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Aug 23, 2024Updated last year
- ☆282Jun 4, 2024Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆74Aug 14, 2024Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- ☆16Nov 13, 2023Updated 2 years ago
- ☆37Jan 25, 2026Updated last month
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- ☆19Aug 7, 2024Updated last year