deekshaaneja / Qwen2-VLLinks
☆11Updated 7 months ago
Alternatives and similar repositories for Qwen2-VL
Users that are interested in Qwen2-VL are comparing it to the libraries listed below
Sorting:
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆70Updated 8 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 7 months ago
- ☆43Updated 3 months ago
- ☆41Updated 11 months ago
- ☆31Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- ☆41Updated 5 months ago
- A hybrid search implementation combining text search and vector search☆20Updated last year
- A new novel multi-modality (Vision) RAG architecture☆27Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- ☆22Updated last year
- 7 query strategies for navigating knowledge graphs with LlamaIndex☆43Updated last year
- Fine-Tuning LLM and embedding models☆27Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆39Updated 2 weeks ago
- ☆32Updated last year
- ☆29Updated 9 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 10 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 8 months ago
- Flow Chart Image-to-Code Generation☆32Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- ☆24Updated 4 months ago
- ☆63Updated 8 months ago
- Embedding models from Jina AI☆60Updated last year
- ☆46Updated 8 months ago
- ☆20Updated 2 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆73Updated 2 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year