Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
☆49Oct 2, 2023Updated 2 years ago
Alternatives and similar repositories for VisionGPT2
Users that are interested in VisionGPT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A camera pose estimation programme☆12Feb 3, 2014Updated 12 years ago
- ☆11Sep 18, 2023Updated 2 years ago
- An inverse kinematics solver written with KDL and URDF libraries☆14Apr 25, 2018Updated 8 years ago
- FinCUGE Instruction dataset☆16Apr 29, 2023Updated 3 years ago
- This project is a 2D simulation focused on learning and implementing differential drive kinematics and PID control from scratch using Pyg…☆12Oct 3, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Mesh compression algorithm☆17Jul 12, 2022Updated 3 years ago
- Indic-Conformer models for ASR☆19Jul 19, 2024Updated last year
- ExMeshCNN: An Explainable Convolutional Neural Network Architecture for 3D Shape Analysis (KDD 2022)☆12Updated this week
- Inprocess : Installable version of FreeCAD OpenSCAD workbench☆16Feb 24, 2026Updated 3 months ago
- An implementation of LLMzip using GPT-2☆14Aug 7, 2023Updated 2 years ago
- HoloLens projects.☆15Oct 14, 2017Updated 8 years ago
- Image captioning with a locally stored Large Language Model (LLM)☆15Updated this week
- Easy local FLUX.1 Inference☆10Aug 29, 2024Updated last year
- Express.js ported to a Service Worker context☆17Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Converting Assembly back to C Code using Transformers.☆28Feb 3, 2024Updated 2 years ago
- Tutorial on how to train a custom voice recognition model using Hugging face models.☆11Jul 2, 2023Updated 2 years ago
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 3 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- This project is from the Airbnb Recruitment Challenge on Kaggle. The challenge is to solve a multi-class classification problem of predic…☆11Feb 22, 2022Updated 4 years ago
- General information about DEEP BERLIN's AI for Good Hackathon 2020☆11Apr 14, 2020Updated 6 years ago
- Audio GIFs (.a.gif) -- "Sounds like a Bad Idea."☆11Jun 7, 2019Updated 7 years ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Mar 4, 2023Updated 3 years ago
- Collection of datasets for network research.☆15Jul 26, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆65Jan 13, 2025Updated last year
- Alpha-Zero Connect Four NN trained via self play☆27Mar 7, 2025Updated last year
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- ezMPEG is an easy-to-use and easy-to-understand MPEG1 video encoder API☆11Mar 26, 2017Updated 9 years ago
- This repository contains the code for our paper "Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering" [EMNLP…☆15Oct 8, 2024Updated last year
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- ☆12May 12, 2025Updated last year
- Experimental AI chat app☆23Jan 3, 2025Updated last year
- ML from scratch in Jax☆12Aug 20, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- H264 encoder + MP4 output for the web☆15Dec 4, 2020Updated 5 years ago
- ☆13Oct 18, 2024Updated last year
- Train a model for Image Caption from ViT and GPT pretrained model☆18Mar 25, 2023Updated 3 years ago
- A Game Engine for J2ME Platform☆10Mar 13, 2015Updated 11 years ago
- ☆15Jan 26, 2025Updated last year
- This is an attempt to fine-tune the Llama model for Central Kurdish.☆15May 24, 2023Updated 3 years ago
- Examples to finetune encoder-only and encoder-decoder transformers for Japanese language in Hugging Face (Oct 2022)☆16Oct 6, 2023Updated 2 years ago