deepmancer / vlm-toolboxLinks
Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation
☆12Updated 11 months ago
Alternatives and similar repositories for vlm-toolbox
Users that are interested in vlm-toolbox are comparing it to the libraries listed below
Sorting:
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 3 years ago
- ☆22Updated 2 months ago
- ☆21Updated 3 years ago
- Create a source of truth for ML model results and browse it on Papers with Code☆34Updated 4 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- Interface for GenAI-Arena [NeurIPS24]☆17Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Updated this week
- A tiny package supporting distributed computation of COCO metrics for PyTorch models.☆15Updated 2 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 5 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- Generating Training Data Made Easy☆43Updated 5 years ago
- ☆28Updated last year
- Easy, efficient and Pythonic data loading of Parquet files for PyTorch-based libraries☆24Updated 5 years ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 3 years ago
- Zero Shot Image Classification but more, Supports Multilingual labelling and a variety of CNN based models for a vision backbone by using…☆49Updated 3 years ago
- Solution of Kaggle competition: Feedback Prize - Evaluating Student Writing☆16Updated 3 years ago
- ☆15Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- Scripts for text classification with llama and bert☆32Updated 6 months ago
- Repository for Multimodal AutoML Benchmark☆66Updated 4 years ago
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆47Updated 3 years ago
- Bi-encoder entity linking architecture☆52Updated last year
- ☆73Updated 6 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Updated last year
- [Intemarché] Sales forecasting challenge☆11Updated 4 years ago
- ☆10Updated 2 years ago