Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
☆48Oct 2, 2023Updated 2 years ago
Alternatives and similar repositories for VisionGPT2
Users that are interested in VisionGPT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Indic-Conformer models for ASR☆20Jul 19, 2024Updated last year
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 2 months ago
- understanding language modeling by training a small GPT on Shakespeare plays.☆13Feb 15, 2023Updated 3 years ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Jul 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jan 19, 2024Updated 2 years ago
- Tacotron 2 training notebook supporting Japanese, French, and Mandarin☆11Nov 19, 2022Updated 3 years ago
- Express.js ported to a Service Worker context☆18Mar 6, 2025Updated last year
- Tutorial on how to train a custom voice recognition model using Hugging face models.☆11Jul 2, 2023Updated 2 years ago
- Easy local FLUX.1 Inference☆10Aug 29, 2024Updated last year
- General information about DEEP BERLIN's AI for Good Hackathon 2020☆11Apr 14, 2020Updated 6 years ago
- Audio GIFs (.a.gif) -- "Sounds like a Bad Idea."☆11Jun 7, 2019Updated 6 years ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Mar 4, 2023Updated 3 years ago
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆35Dec 16, 2025Updated 4 months ago
- Ollama with RAG and Chainlit is a chatbot project leveraging Ollama, RAG, and Chainlit. It uses Chromadb for vector storage, gpt4all for …☆14Feb 15, 2024Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Experimental AI chat app☆23Jan 3, 2025Updated last year
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Oct 5, 2023Updated 2 years ago
- Spawn multiple cordova enabled webviews in one app☆12May 13, 2019Updated 6 years ago
- Early Detection of Alzheimer Disease using NLP & Deep Learning☆37May 22, 2019Updated 6 years ago
- Run Gemini Nano locally on chrome☆24Jun 27, 2024Updated last year
- Useful Python scripts written in 3! Uses a lot of modules and APIs.☆18Oct 1, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Various Vector Similarity Search examples☆13Dec 30, 2022Updated 3 years ago
- ☆15Jan 26, 2025Updated last year
- ☆20Jul 18, 2024Updated last year
- Examples to finetune encoder-only and encoder-decoder transformers for Japanese language in Hugging Face (Oct 2022)☆16Oct 6, 2023Updated 2 years ago
- A simple script to convert HTML files to images (PNG, JPG) or to PDF☆16May 21, 2022Updated 3 years ago
- Boot to Gecko aims to create a complete, standalone operating system for the open web.☆14Mar 9, 2015Updated 11 years ago
- ☆25May 23, 2025Updated 10 months ago
- A web based tool for visualization of the forward and reverse modes of automatic differentiation☆21May 2, 2024Updated last year
- ☆15Nov 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An SVM model for multi-class classification of Thyroid data.☆11Dec 9, 2019Updated 6 years ago
- Text only Bulletin board☆13Feb 16, 2024Updated 2 years ago
- Stream of consciousness nexus REST microservice☆19Sep 4, 2022Updated 3 years ago
- Easy Landmark Image Recognition with TensorFlow Hub DELF Module☆20Jun 21, 2018Updated 7 years ago
- Video+code lecture on building nanoGPT from scratch☆67Jun 14, 2024Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- Created to house D3.js (and others as time goes by) files used at the Pittsburgh Data Visualization meetup☆15Jan 14, 2021Updated 5 years ago