LL3M: Large Language and Multi-Modal Model in Jax
☆74Apr 23, 2024Updated last year
Alternatives and similar repositories for LL3M
Users that are interested in LL3M are comparing it to the libraries listed below
Sorting:
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- ☆13Jun 18, 2025Updated 8 months ago
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- ☆24Sep 25, 2024Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- ☆15May 11, 2025Updated 9 months ago
- ☆643Feb 15, 2024Updated 2 years ago
- ☆231Dec 18, 2023Updated 2 years ago
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- ☆12Mar 25, 2024Updated last year
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate …☆17Jan 13, 2025Updated last year
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- CHI 2021 Paper Website☆10Jan 13, 2021Updated 5 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Sep 19, 2024Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- A library to extract plaintexts from the JSON dump file of namu wiki☆26Oct 6, 2022Updated 3 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- ☆16Aug 19, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆91Apr 30, 2024Updated last year
- Official repo for StableLLAVA☆95Dec 22, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 2 months ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.☆15Aug 13, 2023Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆42Jun 6, 2024Updated last year
- Official Code of Decoupled Graph Convolution (DGC)☆16Jan 31, 2026Updated last month
- ☆17Mar 30, 2024Updated last year
- PANENE: Progressive Approximate NEarest NEighbors☆20Feb 12, 2025Updated last year
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- [ICLR 2024 Oral] Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness.☆17Jan 19, 2024Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- Official code for "Attribution Guided Factorization for Neural Networks Visualization"☆16Oct 23, 2022Updated 3 years ago
- ☆20Jan 22, 2024Updated 2 years ago
- ☆18Apr 24, 2024Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- ☆14Jul 26, 2023Updated 2 years ago