Pleias / pleias_ScholasticAI
☆36Updated last week
Alternatives and similar repositories for pleias_ScholasticAI:
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
- Small python package to measure OCR quality and other related metrics.☆21Updated 10 months ago
- ☆67Updated 10 months ago
- An easy way to chunk spaCy docs.☆18Updated 5 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Repository hosting the common code for the entity-fishing clients☆9Updated 7 months ago
- Knowledge Graph Generator app☆30Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- Fork du code de LMSYS (FastChat) pour l'arène de comparaison de LLM francophones Compar:IA☆13Updated this week
- The NLP Bias Identification Toolkit☆36Updated last year
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆26Updated 4 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆19Updated last month
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆22Updated 6 months ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- 🌸 Train floret vectors☆18Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆52Updated 3 weeks ago
- C++ inference engine for running GLiNER (Generalist and Lightweight Named Entity Recognition) models☆23Updated last month
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆13Updated 2 months ago
- link raw affiliation to ROR ids☆25Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆63Updated 5 months ago
- Open source text annotation software created by the french supreme court 'Cour de cassation'☆19Updated this week
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 8 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆26Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆60Updated 7 months ago
- Efficient BM25 with DuckDB 🦆☆36Updated 3 weeks ago
- ☆21Updated last year
- PDF parser powered by grobid☆26Updated 5 months ago
- Pre-train Static Word Embeddings☆34Updated this week