Pleias / pleias_ScholasticAILinks
☆70Updated 8 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆67Updated last year
- Python library to use Pleias-RAG models☆63Updated 5 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆18Updated 2 months ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆234Updated last month
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆58Updated last week
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- PyLate efficient inference engine☆66Updated last month
- Libraries, Archives and Museums (LAM)☆87Updated 3 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated 3 weeks ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- PDF parser powered by grobid☆28Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆18Updated 11 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆42Updated this week
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆18Updated 4 months ago
- An easy way to chunk spaCy docs.☆22Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆65Updated last year
- Plug-and-play, zero-shot document processing pipelines.☆107Updated last week
- Robust and fast topic models with sentence-transformers.☆80Updated this week
- A spaCy wrapper for GliNER☆122Updated 8 months ago
- Pretraining data reconstruction scripts for Apertus☆97Updated last week
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- ☆49Updated 8 months ago
- GLiNER model in a FastAPI microservice.☆45Updated 10 months ago
- Hyperparam local dataset viewer☆24Updated this week
- Knowledge Graph Generator app☆34Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆189Updated 2 months ago
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆53Updated 6 months ago
- Datamodels for hugging face tokenizers☆77Updated 3 weeks ago