neulab / PangeaView external linksLinks
This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"
☆119Jun 27, 2025Updated 7 months ago
Alternatives and similar repositories for Pangea
Users that are interested in Pangea are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆70Oct 17, 2025Updated 3 months ago
- ☆16Jul 23, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ☆11Dec 11, 2024Updated last year
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- ☆43Jul 10, 2024Updated last year
- ☆58Feb 27, 2025Updated 11 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Apr 23, 2025Updated 9 months ago
- AutoTag-YOLOv8 is an object detection project that uses the YOLOv8 model and leverages the power of SAM and DINGO models for automatic la…☆13May 3, 2023Updated 2 years ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Jun 2, 2025Updated 8 months ago
- [ICCV 2023] The official code for "SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels"☆15May 1, 2024Updated last year
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated last month
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64May 21, 2025Updated 8 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Oct 31, 2024Updated last year
- This repository provides a UI for the Hugging Face Transformers Agent. 🤗🕵️☆13May 16, 2023Updated 2 years ago
- a lightweight C++ LLaMA inference engine for mobile devices☆15Oct 28, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- PyTorch implementation of models from the Zamba2 series.☆186Jan 23, 2025Updated last year
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆845Feb 6, 2026Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,533Feb 3, 2026Updated last week
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Dec 13, 2024Updated last year
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Tra…☆16Jul 14, 2023Updated 2 years ago
- Agent-based implementation of RAG, incorporating AI agents into the RAG pipeline to orchestrate its components and perform additional act…☆19Feb 20, 2025Updated 11 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆99Jun 23, 2024Updated last year
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆49Jul 15, 2025Updated 6 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆72Jan 13, 2025Updated last year
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆18Sep 5, 2024Updated last year
- This repository contains the code for our CVPR 2022 paper on "Non-isotropy Regularization for Proxy-based Deep Metric Learning".☆15Mar 10, 2023Updated 2 years ago
- Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation☆18Nov 12, 2025Updated 3 months ago
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆16Jun 1, 2024Updated last year
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆206Jun 23, 2025Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Aug 7, 2025Updated 6 months ago
- [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆43Jun 26, 2025Updated 7 months ago