Pleias / pleias_ScholasticAI
☆61Updated 3 months ago
Alternatives and similar repositories for pleias_ScholasticAI:
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
- Python library to use Pleias-RAG models☆40Updated last week
- ☆67Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆15Updated 6 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month
- Plug-and-play document processing pipelines with zero-shot models. Batteries included.☆57Updated last week
- PDF parser powered by grobid☆26Updated 9 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆20Updated 3 months ago
- CLI that queries multiple language models in parallel using prompts from a CSV file☆26Updated this week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆47Updated this week
- Knowledge Graph Generator app☆31Updated last year
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆272Updated 2 months ago
- Works-magnet: Retrieve and promote the scholarly works of your institution.☆23Updated 3 weeks ago
- ☆54Updated last year
- An easy way to chunk spaCy docs.☆20Updated 8 months ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- The NLP Bias Identification Toolkit☆36Updated last year
- Extract networks of entities from journalistic reporting☆48Updated last year
- ☆39Updated this week
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 8 months ago
- Robust and fast topic models with sentence-transformers.☆48Updated this week
- A Prodigy plugin for PDF annotation☆31Updated last month
- ☆20Updated last year
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆29Updated 7 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago