This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"
☆119Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for Pangea
Users that are interested in Pangea are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆71Oct 17, 2025Updated 4 months ago
- ☆16Jul 23, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- ☆16Sep 17, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆37Dec 2, 2025Updated 3 months ago
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".☆30Jul 22, 2025Updated 7 months ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- ☆43Jul 10, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- ☆58Feb 27, 2025Updated last year
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆46Sep 1, 2024Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Apr 23, 2025Updated 10 months ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- AutoTag-YOLOv8 is an object detection project that uses the YOLOv8 model and leverages the power of SAM and DINGO models for automatic la…☆13May 3, 2023Updated 2 years ago
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆65Feb 27, 2026Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆354Jun 2, 2025Updated 9 months ago
- [ICCV 2023] The official code for "SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels"☆15May 1, 2024Updated last year
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Nov 15, 2023Updated 2 years ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64May 21, 2025Updated 9 months ago
- ☆15Sep 10, 2023Updated 2 years ago
- a lightweight C++ LLaMA inference engine for mobile devices☆15Oct 28, 2023Updated 2 years ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Oct 31, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated last year
- ☆15Dec 15, 2025Updated 2 months ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- PyTorch implementation of models from the Zamba2 series.☆187Jan 23, 2025Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,537Feb 13, 2026Updated 3 weeks ago
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆868Updated this week
- Neural theorem proving tutorial, version II☆40Apr 26, 2024Updated last year
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- Agent-based implementation of RAG, incorporating AI agents into the RAG pipeline to orchestrate its components and perform additional act…☆19Feb 20, 2025Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year