Pleias / pleias_ScholasticAILinks
☆71Updated 9 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆67Updated last year
- Python library to use Pleias-RAG models☆67Updated 6 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆205Updated 3 months ago
- Pretraining data reconstruction scripts for Apertus☆103Updated last month
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆52Updated this week
- Code for collecting, processing, and preparing datasets for the Common Pile☆243Updated 2 months ago
- Data2Neo is a library that simplifies the conversion of data in relational format to a graph knowledge database.☆28Updated last year
- PDF parser powered by grobid☆28Updated last year
- Plug-and-play, zero-shot document AI pipelines.☆117Updated last week
- Scientific Document Insight Q/A☆32Updated 2 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆81Updated last year
- VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for cor…☆77Updated last month
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆65Updated this week
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 8 months ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆191Updated 6 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆24Updated last week
- Robust and fast topic models with sentence-transformers.☆82Updated this week
- Tools for interactive visual exploration of semantic embeddings.☆39Updated last year
- A simple tool that let's you explore different possible paths that an LLM might sample.☆193Updated 6 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆65Updated last year
- LLM plugin for clustering embeddings☆82Updated last year
- A Prodigy plugin for PDF annotation☆36Updated 3 months ago
- PyLate efficient inference engine☆67Updated 2 months ago
- spaCy entry points for Curated Transformers☆32Updated 6 months ago
- Evaluation framework for document processing models and services.☆55Updated this week
- Pre-train Static Word Embeddings☆91Updated 2 months ago