Pleias / pleias_ScholasticAILinks
☆68Updated 7 months ago
Alternatives and similar repositories for pleias_ScholasticAI
Users that are interested in pleias_ScholasticAI are comparing it to the libraries listed below
Sorting:
- ☆67Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Python library to use Pleias-RAG models☆62Updated 4 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆42Updated this week
- Libraries, Archives and Museums (LAM)☆85Updated 2 years ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆227Updated last week
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆55Updated last week
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month
- An easy way to chunk spaCy docs.☆22Updated last year
- Plug-and-play, zero-shot document processing pipelines.☆101Updated this week
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆186Updated 4 months ago
- Robust and fast topic models with sentence-transformers.☆80Updated this week
- Knowledge Graph Generator app☆33Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆18Updated 10 months ago
- VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for cor…☆76Updated 6 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆19Updated last year
- Discourse Analysis Tool Suite☆34Updated this week
- The NLP Bias Identification Toolkit☆37Updated 2 years ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆175Updated last month
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 6 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆21Updated 2 months ago
- An introduction to LLM Sampling☆79Updated 9 months ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated 2 years ago
- Evaluation framework for document processing models and services.☆34Updated this week
- Adding Marimo to Datasette☆20Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago