☆18Feb 25, 2025Updated last year
Alternatives and similar repositories for Pleias-Rag
Users that are interested in Pleias-Rag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22May 8, 2026Updated last month
- Truth table generator, (basic) proof builder, and more, built with Next.js and Ohm☆24Dec 17, 2023Updated 2 years ago
- ☆18Jul 20, 2023Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Sep 17, 2022Updated 3 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Tool for parsing English phonemes into syllables.☆10Jan 15, 2018Updated 8 years ago
- Libraries, Archives and Museums (LAM)☆89Oct 4, 2022Updated 3 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Repository hosting the common code for the entity-fishing clients☆10May 18, 2026Updated 3 weeks ago
- In this project, we need to find out commercial products listed on Google that refer to the same entity across Amazon by comparing the si…☆11Nov 7, 2016Updated 9 years ago
- Download and load spaCy models on-the-fly☆15Feb 9, 2023Updated 3 years ago
- ☆10Nov 15, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Streamlit App that performs object detection and instance segmentation with Detectron2☆13Nov 4, 2020Updated 5 years ago
- Annotation tool (NER) for XML documents (TEI, EAD) - WIP☆11Jul 22, 2022Updated 3 years ago
- Script to get ACL Anthology☆16Jan 2, 2025Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- ☆23Jun 2, 2026Updated last week
- Collatinus Python Lemmatizer☆10Jun 1, 2021Updated 5 years ago
- The core data of the Iconclass Classification System☆17May 21, 2026Updated 3 weeks ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- Frozen Pretrained Transformers for Neural Sign Language Translation☆15Apr 23, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Jul 25, 2024Updated last year
- Blocks is a plugin for mdbook which preprocesses "Blocks" based markdown into beautiful Bootstrap components.☆11Jun 15, 2024Updated 2 years ago
- TEI Transviewer is an interface intended to the exploration of primary and secondary sources, at the document level, in historical or oth…☆14Jul 17, 2021Updated 4 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- ☆10Oct 2, 2024Updated last year
- ☆13Jun 16, 2021Updated 5 years ago
- Bilingual sentence similarity classifier using Tensorflow☆24Sep 26, 2019Updated 6 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- decontamination☆33Mar 4, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Loadable spellfix1 extension for sqlite as python package☆27Apr 21, 2024Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated last year
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 9 months ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- A tool for turning mdbooks into slide shows☆23Feb 20, 2026Updated 3 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year