This repository demonstrates the data preparation and fine-tuning the IDEFICS Vision Language Model.
☆26May 16, 2024Updated 2 years ago
Alternatives and similar repositories for Fine-tune-IDEFICS-Vision-Language-Model
Users that are interested in Fine-tune-IDEFICS-Vision-Language-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of Google Colab notebooks documenting a cruise from Buenos Aires to Antarctica and back through Chile, aboard the Holland Am…☆10Jan 13, 2025Updated last year
- ☆13Aug 30, 2024Updated last year
- Practice Notebook for AI Course☆13Mar 1, 2025Updated last year
- This repository is an implementation of virtually trying on different outfits by image editing and inpainting using Imagen 3 model.☆25Apr 4, 2025Updated last year
- An AR-based Shopping App☆13Jun 14, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scrape South African news☆12May 22, 2023Updated 3 years ago
- Codebase, data and models for the Headline Grouping paper at NAACL2021☆12Oct 2, 2022Updated 3 years ago
- LMT (LayeredMemoryTrader) is a multi-agent trading system using LLMs with human-style short/mid/long memory debates.☆32Jul 24, 2025Updated 10 months ago
- Facebook Messenger Chatbot built with Python + Flask for Starters.☆13Oct 15, 2017Updated 8 years ago
- end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace☆11Aug 15, 2023Updated 2 years ago
- automated insights for tabular data☆10Feb 10, 2025Updated last year
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 5 years ago
- A demo and tutorial for Council that implements a financial analyst agent.☆11Jun 21, 2024Updated last year
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆15May 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆43Mar 8, 2026Updated 3 months ago
- Using Siamese LSTM to classify repeated quora questions. Attempted pretrained bert embeddings, Word2Vec and training own embeddings toget…☆10Aug 28, 2020Updated 5 years ago
- 3D Mesh Generation from 2D Images in Python☆13Feb 12, 2024Updated 2 years ago
- This repo contains the code for the tutorial for using the CrewAI agent framework to generate Sales Reports based on Salesforce data☆13Mar 16, 2024Updated 2 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Aug 20, 2024Updated last year
- A web based 3d editor inspired by blender, demo app for the vibe coding book☆55Feb 22, 2026Updated 3 months ago
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated last year
- This AI tool leverages different LLM services to generate product information from a given image. Simply upload an image of a product and…☆15Jun 25, 2024Updated last year
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Proyecto final de marca personal. ->☆11Sep 17, 2021Updated 4 years ago
- How Media Cloud approaches extracting metadata from online news stories☆17Apr 15, 2026Updated 2 months ago
- The code repo for Youtube tutorial series about using Python asyncio with OpenCV to grab frames from video cameras concurrently☆16Oct 3, 2021Updated 4 years ago
- A simple package of face detection☆14Nov 27, 2020Updated 5 years ago
- detecting the meotions using by analysing the sound of the person unsing python☆11Oct 7, 2019Updated 6 years ago
- a rust crate for easily implementing faster-whisper stt into your rust programs.☆24Oct 20, 2025Updated 7 months ago
- Implementations of GANs in Tensorflow 2.x☆16Feb 12, 2022Updated 4 years ago
- GHUStereo models are novel real-time stereo matching architectures with a low computation complexity characterized by compact cost volum…☆32Dec 14, 2025Updated 6 months ago
- ☆16Oct 13, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Harvard University's CS50 : Problem Set 1-6 completed(all C-based programs)☆10Dec 6, 2016Updated 9 years ago
- Autoencoder-based image compression using pictures of the surface of Mars.☆18Jun 12, 2020Updated 6 years ago
- ☆12Apr 9, 2021Updated 5 years ago
- Bitcoin Hourly OHLCV with 70+ Technical Indicators | Daily Updated Dataset for ML & Trading Analysis☆26Updated this week
- Demo project with very fast blurring image through Apple Accelerate Framework☆12May 23, 2018Updated 8 years ago
- This repository contains resources, labs, and notes from a comprehensive Generative AI course, covering topics such as Natural Language P…☆21Nov 17, 2025Updated 7 months ago
- A simple hack to extract the Subject-Verb-Object from the phrase structure parse tree generated by stanford parser☆16Nov 8, 2012Updated 13 years ago