Python library to use Pleias-RAG models
☆68May 1, 2025Updated 10 months ago
Alternatives and similar repositories for Pleias-RAG-Library
Users that are interested in Pleias-RAG-Library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Oct 14, 2024Updated last year
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- ☆15Apr 26, 2025Updated 11 months ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- decontamination☆27Mar 4, 2026Updated 3 weeks ago
- ☆53Jul 10, 2025Updated 8 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆83Feb 10, 2026Updated last month
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Jun 6, 2022Updated 3 years ago
- Generalist and Lightweight Model for Text Classification☆200Feb 17, 2026Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Mar 13, 2026Updated 2 weeks ago
- Evaluation framework for document processing models and services.☆66Mar 19, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆17May 8, 2024Updated last year
- Fast Multimodal Semantic Deduplication & Filtering☆906Jan 20, 2026Updated 2 months ago
- ☆20Apr 24, 2025Updated 11 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆49Oct 20, 2025Updated 5 months ago
- ☆54Oct 13, 2025Updated 5 months ago
- Late Interaction Models Training & Retrieval☆754Mar 6, 2026Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- ☆13Dec 17, 2021Updated 4 years ago
- Pre-train Static Word Embeddings☆95Updated this week
- ☆95Jul 4, 2025Updated 8 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- This codebase demonstrates various DSPy functionalities through practical examples.☆57Feb 16, 2025Updated last year
- ☆44Feb 11, 2026Updated last month
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26May 22, 2024Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆64Feb 6, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 10 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆29Nov 18, 2025Updated 4 months ago
- ☆17Jan 5, 2023Updated 3 years ago
- code for training & evaluating Contextual Document Embedding models☆202May 14, 2025Updated 10 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 6 months ago
- benchmarks for LLM tokenizers☆17Feb 27, 2026Updated 3 weeks ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31May 11, 2020Updated 5 years ago