adithya-s-k / YoloGemmaView external linksLinks
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.
☆85May 29, 2024Updated last year
Alternatives and similar repositories for YoloGemma
Users that are interested in YoloGemma are comparing it to the libraries listed below
Sorting:
- X Developer Challenge☆11Apr 25, 2024Updated last year
- A Catalog lists instruction sets, models available for Indic language☆10Mar 14, 2024Updated last year
- [WINNER SOLUTION] soccernet monocular depth estimation solution☆13Sep 3, 2025Updated 5 months ago
- Le Tigre integrates speech recognition, vision, and text-to-speech capabilities to offer a comprehensive multimodal AI solution. It can p…☆12Jun 5, 2024Updated last year
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆12Apr 24, 2025Updated 9 months ago
- A browser-based AI inference network.☆126Jul 28, 2024Updated last year
- Official repository for the paper "Fast Predictive Uncertainty for Classification with Bayesian Deep Networks". Accepted at UAI 2022. htt…☆12May 25, 2022Updated 3 years ago
- Collect VLM models that can be tried online.☆14Apr 15, 2024Updated last year
- Langchain Usecases☆16May 26, 2024Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆30Dec 19, 2023Updated 2 years ago
- Janus is an opensource IA for Star Citizen☆11Dec 23, 2023Updated 2 years ago
- minimal diffusion transformer in pytorch.☆16Oct 6, 2024Updated last year
- A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…☆462Updated this week
- Verbosity control for AI agents☆66May 23, 2024Updated last year
- An experimental and alternative approach to Finetuning and RAG.☆34Dec 9, 2023Updated 2 years ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆17Jun 24, 2024Updated last year
- A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview☆14Jul 9, 2024Updated last year
- Lightweight open-source perplexity☆62May 6, 2024Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- Run, deploy and monitor CLI agents in secure cloud sandboxes.☆41Updated this week
- [ICME'23, oral] CCLAP: Controllable Chinese Landscape Painting Generation☆19Apr 20, 2025Updated 9 months ago
- A framework for orchestrating AI agents using a mermaid graph☆76May 16, 2024Updated last year
- ☆19Mar 16, 2025Updated 11 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Jan 23, 2026Updated 3 weeks ago
- ☆283Jun 4, 2024Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Aug 23, 2024Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆71Aug 14, 2024Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- ☆18Jun 22, 2024Updated last year
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- Implementation of Bitune: Bidirectional Instruction-Tuning☆23Jun 19, 2025Updated 7 months ago
- ☆19Sep 12, 2024Updated last year
- ☆16Nov 13, 2023Updated 2 years ago
- ☆21Oct 14, 2024Updated last year
- Deep research agents using MiniMax M2.1 interleaved thinking☆197Dec 23, 2025Updated last month
- Prompt, run, edit, and deploy full-stack web applications using any LLM you want!☆26Nov 21, 2024Updated last year
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated 11 months ago