From scratch implementation of a vision language model in pure PyTorch
☆257May 6, 2024Updated last year
Alternatives and similar repositories for seemore
Users that are interested in seemore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆796Oct 30, 2024Updated last year
- A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimizat…☆35Nov 20, 2025Updated 4 months ago
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆116Jun 4, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆87May 29, 2024Updated last year
- Fine tune Gemma 3 on an object detection task☆100Jul 14, 2025Updated 8 months ago
- ☆242Jan 2, 2025Updated last year
- ☆18Jul 7, 2025Updated 8 months ago
- a family of highly capabale yet efficient large multimodal models☆193Aug 23, 2024Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Famous Vision Language Models and Their Architectures☆1,210Jan 11, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An automated data pipeline scaling RL to pretraining levels☆74Oct 11, 2025Updated 5 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"☆41Dec 27, 2024Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 11 months ago
- Train LLM on Hugging Face infra☆71Nov 13, 2025Updated 4 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,738Oct 27, 2025Updated 5 months ago
- A python package for text sanitization with differential privacy☆39Dec 25, 2025Updated 3 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆411Nov 11, 2025Updated 4 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆23Oct 6, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,901Jan 9, 2026Updated 2 months ago
- llama3 implementation one matrix multiplication at a time☆15,255May 23, 2024Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆956Nov 16, 2025Updated 4 months ago
- A metric suite leveraging the logical inference capabilities of LLMs, for radiology report generation both with and without grounding☆94Jan 16, 2026Updated 2 months ago
- ☆68Jun 20, 2024Updated last year
- This project is under development.☆23Aug 20, 2023Updated 2 years ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆37Feb 28, 2025Updated last year
- ☆138Sep 29, 2024Updated last year
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆615Jun 11, 2024Updated last year
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,545Mar 9, 2026Updated 2 weeks ago
- Tokenizer for Text to Speech (TTS) models☆13Jan 16, 2025Updated last year
- Gemma 2 optimized for your local machine.☆380Aug 7, 2024Updated last year
- Video+code lecture on building nanoGPT from scratch☆67Jun 14, 2024Updated last year
- Quick exploration into fine tuning florence 2☆340Sep 19, 2024Updated last year
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆23Apr 1, 2024Updated last year