Composition of Multimodal Language Models From Scratch
☆15Aug 16, 2024Updated last year
Alternatives and similar repositories for vlm
Users that are interested in vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From scratch implementation of a vision language model in pure PyTorch☆258May 6, 2024Updated 2 years ago
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs☆172Oct 25, 2025Updated 6 months ago
- This repository demonstrates the utilization of UNETR for brain tumor segmentation.☆11Feb 23, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Building a VLM model starts from the basic module.☆18Apr 7, 2024Updated 2 years ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 months ago
- A Python Scraper Designed to Scrape PDFs from Libgen and Scihub☆23May 13, 2024Updated last year
- built a 124M param GPT☆23Jan 28, 2025Updated last year
- ☆26Nov 11, 2024Updated last year
- ☆27Aug 5, 2024Updated last year
- ☆31Apr 29, 2026Updated last week
- A machine learning library focused on deep learning☆11Jun 2, 2015Updated 10 years ago
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆48Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Mar 20, 2017Updated 9 years ago
- 机器人人工智能,优达学城cs373作业。 Artificial Intelligence for Robotics, this repository contains all the homework…☆12Nov 12, 2017Updated 8 years ago
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 3 years ago
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆30Nov 15, 2023Updated 2 years ago
- A pytorch implementation of a text to videos GAN☆12Jul 26, 2019Updated 6 years ago
- ☆16Mar 5, 2023Updated 3 years ago
- The repo of the paper: Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medic…☆11May 26, 2023Updated 2 years ago
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year
- CRUD Word documents with Python☆13Feb 5, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆79Apr 7, 2025Updated last year
- [NeurIPS'25 Spotlight🔥]Official implementation of Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Ma…☆32Apr 13, 2026Updated 3 weeks ago
- ☆10Mar 28, 2022Updated 4 years ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- ☆16Apr 9, 2024Updated 2 years ago
- From a+b to sparsemax(QK^T)V in Triton!☆33Jun 19, 2025Updated 10 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Jan 7, 2024Updated 2 years ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- ☆10Nov 8, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 4 months ago
- Perform automatic skull-stripping for neuroimage analysis☆13Apr 23, 2026Updated last week
- ☆10Mar 19, 2024Updated 2 years ago
- imdby is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companie…☆11May 8, 2025Updated 11 months ago
- ☆10May 19, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 9 months ago
- ☆12Jul 17, 2024Updated last year