Pleias / pleias_ScholasticAILinks
☆67Updated 6 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- ☆67Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Python library to use Pleias-RAG models☆61Updated 3 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆39Updated this week
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆85Updated 2 years ago
- An easy way to chunk spaCy docs.☆21Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last week
- PDF parser powered by grobid☆28Updated last year
- Plug-and-play document processing pipelines with zero-shot models.☆86Updated 2 weeks ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆183Updated 2 months ago
- ☆27Updated last year
- Robust and fast topic models with sentence-transformers.☆76Updated last month
- Tools for interactive visual exploration of semantic embeddings.☆35Updated 11 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 10 months ago
- Hyperparam local dataset viewer☆25Updated last week
- Pre-train Static Word Embeddings☆85Updated 2 months ago
- NLP with Rust for Python 🦀🐍☆64Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆63Updated 2 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆56Updated 10 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- ☆79Updated 2 months ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆18Updated 9 months ago
- A spaCy wrapper for GliNER☆118Updated 6 months ago
- Generalist and Lightweight Model for Text Classification☆148Updated last month
- Code for collecting, processing, and preparing datasets for the Common Pile☆216Updated 2 weeks ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 11 months ago
- ☆55Updated last year
- ☆49Updated 6 months ago