swiss-ai / mmoreLinks
Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!
☆180Updated last week
Alternatives and similar repositories for mmore
Users that are interested in mmore are comparing it to the libraries listed below
Sorting:
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆218Updated 4 months ago
- Unified Schema-Based Information Extraction☆462Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆306Updated 7 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆96Updated last week
- Pretraining data reconstruction scripts for Apertus☆111Updated 2 months ago
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆168Updated this week
- Granite 3.1 Language Models☆136Updated 6 months ago
- ☆269Updated 6 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆56Updated this week
- A Lightweight Library for AI Observability☆253Updated 10 months ago
- lossily compress representation vectors using product quantization☆59Updated 2 months ago
- Fully Open Language Models with Stellar Performance☆312Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆423Updated last week
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆620Updated this week
- Datamodels for hugging face tokenizers☆86Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆791Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆112Updated 2 weeks ago
- Your buddy in the (L)LM space.☆64Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆494Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week
- Blueprint by Mozilla.ai for answering questions about structured documents☆37Updated 9 months ago
- ☆138Updated 4 months ago
- ☆236Updated last month
- Codebase for FinePDFs☆159Updated 2 months ago
- Tech Report of the Apertus LLM Suite☆127Updated 3 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Utils for Unsloth https://github.com/unslothai/unsloth☆186Updated this week
- ☆217Updated 2 months ago