swiss-ai / mmoreLinks
Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!
☆174Updated last week
Alternatives and similar repositories for mmore
Users that are interested in mmore are comparing it to the libraries listed below
Sorting:
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆211Updated 4 months ago
- Pretraining data reconstruction scripts for Apertus☆109Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆485Updated 3 months ago
- ☆267Updated 5 months ago
- Fully Open Language Models with Stellar Performance☆310Updated last month
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆446Updated this week
- Codebase for FinePDFs☆156Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆722Updated this week
- Granite 3.1 Language Models☆133Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- ☆235Updated 3 weeks ago
- lossily compress representation vectors using product quantization☆59Updated last month
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆54Updated 3 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆137Updated this week
- ☆212Updated last month
- Benchmark and optimize LLM inference across frameworks with ease☆150Updated 3 months ago
- ☆138Updated 4 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆58Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆339Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- Code for Bolmo: Byteifying the Next Generation of Language Models☆66Updated this week
- VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for cor…☆77Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆286Updated 2 months ago
- Build datasets using natural language☆552Updated 3 months ago
- Unified Schema-Based Information Extraction☆387Updated this week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆103Updated 7 months ago
- ☆146Updated 2 weeks ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆272Updated this week
- A Lightweight Library for AI Observability☆252Updated 10 months ago