Pleias / pleias_ScholasticAILinks
☆71Updated 11 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- Small python package to measure OCR quality and other related metrics.☆26Updated last year
- ☆67Updated last year
- Python library to use Pleias-RAG models☆68Updated 8 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆221Updated 5 months ago
- Pretraining data reconstruction scripts for Apertus☆113Updated 3 months ago
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆184Updated last week
- An easy way to chunk spaCy docs.☆22Updated last year
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆59Updated this week
- Plug-and-play document AI with zero-shot models.☆122Updated last week
- Hyperparam local dataset viewer☆27Updated this week
- Robust and fast topic models with sentence-transformers.☆88Updated last week
- Code for collecting, processing, and preparing datasets for the Common Pile☆249Updated 4 months ago
- Synthetic Text Dataset Generation for LLM projects☆55Updated 2 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆330Updated last month
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if t…☆85Updated last year
- ☆102Updated 7 months ago
- Tools for interactive visual exploration of semantic embeddings.☆42Updated last year
- VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for cor…☆76Updated 3 months ago
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆63Updated 9 months ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆198Updated 8 months ago
- Knowledge Graph Generator app☆34Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆68Updated 3 weeks ago
- RAG app for patent similarity search with chatgpt llm over google patents☆41Updated last year
- A simple tool that let's you explore different possible paths that an LLM might sample.☆200Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated last week
- A Prodigy plugin for PDF annotation☆36Updated 5 months ago