This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision models and pretrained language models for visual question answering (VQA) task in Vietnamese.
☆15Dec 31, 2024Updated last year
Alternatives and similar repositories for OpenViVQA
Users that are interested in OpenViVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jul 10, 2022Updated 3 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated last year
- Vietnamese Social Media Censorship Application☆15Sep 6, 2023Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- Integrate Blender with the Text2Motion platform to generate 3D animations from text prompts using Generative AI.☆22Mar 27, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆37May 11, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS☆13Mar 5, 2024Updated 2 years ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆13Dec 31, 2024Updated last year
- This repository is related to all about Machine Learning, Deep Learning, Computer Vision, NLP, and Research Projects☆16Apr 28, 2026Updated 3 weeks ago
- ☆16Feb 5, 2024Updated 2 years ago
- TabMini: A Benchmark Suite for Evaluating and Analyzing the Data Efficiency of Tabular Classifiers☆10Mar 31, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Jun 22, 2023Updated 2 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆16Apr 21, 2025Updated last year
- Few-Shot Preference Optimization (FSPO) personalizes LLMs by reframing reward modeling as a meta-learning problem, enabling rapid adaptat…☆16Feb 27, 2025Updated last year
- ☆15Sep 8, 2018Updated 7 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Nov 28, 2021Updated 4 years ago
- ☆13Apr 17, 2018Updated 8 years ago
- Trains small LMs. Designed for training on SimpleStories☆14Sep 15, 2025Updated 8 months ago
- Contains the code for my Imperial College London Master's thesis on text summarization☆10Oct 25, 2022Updated 3 years ago
- ☆14Sep 7, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code to reproduce results of our experiments using LoRe☆17Apr 8, 2026Updated last month
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- OffensEval2020 Shared Task☆17Apr 5, 2021Updated 5 years ago
- Chia sẻ kinh nghiệm dùng Python (3.6), và các thuật toán được học trong môn "Các thuật toán thông minh nhân tạo và ứng dụng"☆16Apr 16, 2019Updated 7 years ago
- Making of cuda kernel☆16May 27, 2025Updated 11 months ago
- Source code of the paper "Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task" …☆22Dec 8, 2022Updated 3 years ago
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Aug 16, 2023Updated 2 years ago
- Go library for the WebSub protocol, allowing you to run your own scalable WebSub Hub☆19Oct 20, 2024Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆32Feb 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications☆23Sep 4, 2021Updated 4 years ago
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- Sugar functions for manipulating paths in rust.☆26Feb 24, 2026Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Jul 1, 2024Updated last year
- Calculating FLOPs of Pre-trained Models in NLP☆18Mar 29, 2021Updated 5 years ago
- A TinyStories LM with SAEs and transcoders☆14Apr 3, 2025Updated last year
- An Enhanced Version of Piper especially for Vietnamese :)☆28Apr 24, 2025Updated last year