raphaelsty / LeNLPLinks
NLP with Rust for Python π¦π
β63Updated last month
Alternatives and similar repositories for LeNLP
Users that are interested in LeNLP are comparing it to the libraries listed below
Sorting:
- Pre-train Static Word Embeddingsβ84Updated last month
- β62Updated last week
- High-Performance Engine for Multi-Vector Searchβ116Updated last month
- minimal pytorch implementation of bm25 (with sparse tensors)β102Updated last year
- β48Updated 5 months ago
- Python library to use Pleias-RAG modelsβ58Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β61Updated last month
- Tree-based indexes for neural-searchβ32Updated last year
- Efficient BM25 with DuckDB π¦β51Updated 6 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- utilities for loading and running text embeddings with onnxβ44Updated 11 months ago
- Efficient few-shot learning with cross-encoders.β54Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β138Updated last month
- An introduction to LLM Samplingβ78Updated 6 months ago
- π€ Trade any tensors over the networkβ30Updated last year
- Efficiently computing & storing token n-grams from large corporaβ24Updated 9 months ago
- Training code for Sparse Autoencoders on Embedding modelsβ38Updated 4 months ago
- PyTorch implementation for MRLβ18Updated last year
- lossily compress representation vectors using product quantizationβ57Updated 2 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- Small python package to measure OCR quality and other related metrics.β24Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β137Updated 2 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β66Updated last week
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 9 months ago
- β48Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β31Updated 10 months ago
- β30Updated 7 months ago