lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
โ60Updated 7 months ago
Alternatives and similar repositories for AMIE-pytorch:
Users that are interested in AMIE-pytorch are comparing it to the libraries listed below
- Implementation of ๐ป Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorchโ88Updated last year
- A repository to house some personal attempts to beat some state-of-the-art for medical datasetsโ98Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorchโ24Updated 3 months ago
- โ43Updated 7 months ago
- Implementation of Infini-Transformer in Pytorchโ110Updated 4 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amountโฆโ53Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)โ50Updated last month
- Utilities for Training Very Large Modelsโ58Updated 7 months ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)โ19Updated 6 months ago
- โ49Updated last year
- โ63Updated 7 months ago
- โ31Updated 4 months ago
- Implementation of Bitune: Bidirectional Instruction-Tuningโ19Updated 11 months ago
- โ30Updated 11 months ago
- โ41Updated 9 months ago
- Holistic evaluation of multimodal foundation modelsโ47Updated 8 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.โ29Updated last month
- Explorations into adversarial losses on top of autoregressive loss for language modelingโ35Updated 2 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Modelsโ25Updated 3 weeks ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonettoโ55Updated 11 months ago
- Implementation of a holodeck, written in Pytorchโ17Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-expertsโ118Updated 6 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorchโ100Updated last year
- Easily run PyTorch on multiple GPUs & machinesโ45Updated last month
- โ53Updated last year
- Contrastive Language-Image Pretrainingโ38Updated 9 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"โ23Updated 2 weeks ago
- Triton Implementation of HyperAttention Algorithmโ47Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataโ21Updated 9 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.โ23Updated last year