From scratch implementation of a vision language model in pure PyTorch
☆258May 6, 2024Updated 2 years ago
Alternatives and similar repositories for seemore
Users that are interested in seemore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- World's Smallest Vision-Language Model☆33Apr 7, 2024Updated 2 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆804Oct 30, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated 2 years ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Nov 22, 2023Updated 2 years ago
- Fine tune Gemma 3 on an object detection task☆106Jul 14, 2025Updated 10 months ago
- ☆251Jan 2, 2025Updated last year
- a family of highly capabale yet efficient large multimodal models☆193Aug 23, 2024Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- An automated data pipeline scaling RL to pretraining levels☆77Oct 11, 2025Updated 7 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"☆42Dec 27, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- ☆46May 24, 2025Updated last year
- A python package for text sanitization with differential privacy☆46Dec 25, 2025Updated 5 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,876Oct 27, 2025Updated 7 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆418Nov 11, 2025Updated 6 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆23Oct 6, 2025Updated 7 months ago
- Train LLM on Hugging Face infra☆72Apr 2, 2026Updated last month
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,919Jan 9, 2026Updated 4 months ago
- llama3 implementation one matrix multiplication at a time☆15,234May 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆956Nov 16, 2025Updated 6 months ago
- tiny vision language model☆9,707Apr 20, 2026Updated last month
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- This project is under development.☆23Aug 20, 2023Updated 2 years ago
- A metric suite leveraging the logical inference capabilities of LLMs, for radiology report generation both with and without grounding☆97May 13, 2026Updated 2 weeks ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆38Feb 28, 2025Updated last year
- ☆138Sep 29, 2024Updated last year
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆642Jun 11, 2024Updated last year
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,628Apr 20, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Tokenizer for Text to Speech (TTS) models☆13Jan 16, 2025Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆203Jun 1, 2025Updated 11 months ago
- Video+code lecture on building nanoGPT from scratch☆67Jun 14, 2024Updated last year
- Gemma 2 optimized for your local machine.☆385Aug 7, 2024Updated last year
- Quick exploration into fine tuning florence 2☆340Sep 19, 2024Updated last year
- FinCUGE Instruction dataset☆16Apr 29, 2023Updated 3 years ago