raphaelsty / LeNLP
NLP with Rust for Python π¦π
β61Updated 9 months ago
Alternatives and similar repositories for LeNLP:
Users that are interested in LeNLP are comparing it to the libraries listed below
- Efficient BM25 with DuckDB π¦β44Updated 3 months ago
- Tree-based indexes for neural-searchβ29Updated last year
- Pre-train Static Word Embeddingsβ49Updated 2 weeks ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β125Updated 3 months ago
- utilities for loading and running text embeddings with onnxβ44Updated 7 months ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- Lightweight tools for quick and easy LLM demo'sβ26Updated 5 months ago
- Tools to make language models a bit easier to useβ39Updated 2 weeks ago
- An introduction to LLM Samplingβ77Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β97Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- β47Updated last year
- Training code for Sparse Autoencoders on Embedding modelsβ35Updated 3 weeks ago
- β52Updated 6 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 weeks ago
- Library for fast text representation and classification.β28Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ29Updated 6 months ago
- Generalist and Lightweight Model for Text Classificationβ91Updated this week
- QLoRA for Masked Language Modelingβ21Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- β48Updated last year
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated 11 months ago
- Chat Markup Language conversation libraryβ55Updated last year
- Efficient few-shot learning with cross-encoders.β49Updated last year