mbzuai-oryx / BiMediX2
Bio-Medical EXpert LMM with English and Arabic Language Capabilities
☆63Updated last month
Alternatives and similar repositories for BiMediX2:
Users that are interested in BiMediX2 are comparing it to the libraries listed below
- From scratch implementation of a vision language model in pure PyTorch☆192Updated 8 months ago
- Agent benchmark for medical diagnosis☆154Updated last month
- Notebooks for fine tuning pali gemma☆90Updated last month
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆44Updated 7 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆32Updated 3 weeks ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆104Updated 4 months ago
- Bilingual Medical Mixture of Experts LLM☆28Updated 2 months ago
- Qwen2 VL Fine Tuning using Llama Factory☆12Updated 4 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆35Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 8 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆37Updated 3 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆251Updated last month
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- A Large Language-Vision Assistant for Pathology Image Understanding (BIBM-2024)☆40Updated 2 weeks ago
- ☆70Updated last week
- Official repository of paper titled "UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalitie…☆68Updated last month
- ☆42Updated 4 months ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 4 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆101Updated this week
- ☆98Updated 2 months ago
- ☆43Updated 4 months ago
- Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10Updated 4 months ago
- Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.☆50Updated last year
- [Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation☆137Updated 3 weeks ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆23Updated 2 weeks ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆169Updated this week
- The application of multimodal RAG for Sustainable finance☆17Updated 6 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated 10 months ago