LL3M: Large Language and Multi-Modal Model in Jax
☆74Apr 23, 2024Updated 2 years ago
Alternatives and similar repositories for LL3M
Users that are interested in LL3M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆648Feb 15, 2024Updated 2 years ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- ☆13Aug 19, 2024Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆106Mar 6, 2025Updated last year
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- Datastructure for data science☆23Apr 12, 2024Updated 2 years ago
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CHI 2021 Paper Website☆10Jan 13, 2021Updated 5 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- ☆24Sep 25, 2024Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- ☆12May 3, 2018Updated 8 years ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆93Apr 30, 2024Updated 2 years ago
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- ☆16Aug 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 11 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆42Jun 6, 2024Updated 2 years ago
- Framework for Algorithmic Correctness Testing of Operators☆16Mar 9, 2026Updated 2 months ago
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆30Apr 18, 2025Updated last year
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆24Mar 18, 2025Updated last year
- Perplexity Lite using Langgraph, Tavily, and GPT-4.☆25May 1, 2024Updated 2 years ago
- A library to extract plaintexts from the JSON dump file of namu wiki☆26Oct 6, 2022Updated 3 years ago
- A Note On Over-Smoothing for Graph Neural Network☆20Jun 29, 2020Updated 5 years ago
- Progressive Uniform Manifold Approximation and Projection (EuroVis 2020 short)☆13Feb 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Nov 30, 2022Updated 3 years ago
- ☆16Apr 9, 2021Updated 5 years ago
- Official repo for StableLLAVA☆95Dec 22, 2023Updated 2 years ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆709Jan 26, 2026Updated 4 months ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆641Sep 21, 2024Updated last year
- ☆24Jun 12, 2023Updated 2 years ago