Pleias / pleias_ScholasticAILinks
☆62Updated 4 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- Small python package to measure OCR quality and other related metrics.☆22Updated last year
- ☆67Updated last year
- Python library to use Pleias-RAG models☆53Updated last month
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆34Updated this week
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- An easy way to chunk spaCy docs.☆20Updated 9 months ago
- Montelimar - Extract text from anywhere☆77Updated last week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆42Updated last month
- Open-Source Synthetic Text Dataset Generation for LLM projects☆27Updated last week
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆16Updated 6 months ago
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated last year
- PDF parser powered by grobid☆27Updated 10 months ago
- The NLP Bias Identification Toolkit☆36Updated last year
- Plug-and-play document processing pipelines with zero-shot models.☆64Updated 3 weeks ago
- Layout Analysis Dataset with Segmonto (LADaS)☆20Updated last week
- Collection of resources for RL and Reasoning☆25Updated 4 months ago
- Knowledge Graph Generator app☆31Updated last year
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 8 months ago
- Adding Marimo to Datasette☆20Updated 2 months ago
- NLP with Rust for Python 🦀🐍☆62Updated 3 weeks ago
- Hosting examples of interactive datamapplot output☆21Updated this week
- An introduction to LLM Sampling☆78Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆19Updated 11 months ago
- Tools for interactive visual exploration of semantic embeddings.☆34Updated 9 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- ☆11Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago