raphaelsty / LeNLPLinks
NLP with Rust for Python π¦π
β62Updated 2 weeks ago
Alternatives and similar repositories for LeNLP
Users that are interested in LeNLP are comparing it to the libraries listed below
Sorting:
- Tree-based indexes for neural-searchβ32Updated last year
- β57Updated 2 weeks ago
- Efficient BM25 with DuckDB π¦β49Updated 5 months ago
- Pre-train Static Word Embeddingsβ70Updated this week
- Trully flash implementation of DeBERTa disentangled attention mechanism.β55Updated 2 weeks ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ99Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β134Updated last week
- β43Updated 3 months ago
- utilities for loading and running text embeddings with onnxβ44Updated 9 months ago
- Lightweight tools for quick and easy LLM demo'sβ27Updated 8 months ago
- Training code for Sparse Autoencoders on Embedding modelsβ38Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β101Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- lossily compress representation vectors using product quantizationβ54Updated last month
- Plug-and-play document processing pipelines with zero-shot models.β64Updated 3 weeks ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β31Updated 9 months ago
- Python library to use Pleias-RAG modelsβ53Updated last month
- An introduction to LLM Samplingβ78Updated 5 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β41Updated last year
- β48Updated last year
- β29Updated 6 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ29Updated 8 months ago
- Efficiently computing & storing token n-grams from large corporaβ23Updated 7 months ago
- Crispy reranking models by Mixedbreadβ31Updated 3 weeks ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated 2 months ago
- β9Updated 7 months ago
- Library for fast text representation and classification.β28Updated last year