raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆59Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for LeNLP
- Efficient BM25 with DuckDB 🦆☆29Updated last month
- Tree-based indexes for neural-search☆28Updated 8 months ago
- ☆108Updated this week
- ☆66Updated this week
- Late Interaction Models Training & Retrieval☆165Updated this week
- Tools to make language models a bit easier to use☆30Updated this week
- ☆44Updated last week
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Training code for Sparse Autoencoders on Embedding models☆33Updated 3 weeks ago
- Generalist and Lightweight Model for Text Classification☆49Updated last week
- ☆48Updated last year
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 8 months ago
- Chat Markup Language conversation library☆54Updated 10 months ago
- ☆36Updated 3 months ago
- ☆27Updated last month
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆61Updated 2 weeks ago
- An introduction to LLM Sampling☆64Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Simple examples using Argilla tools to build AI☆40Updated this week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆28Updated last month
- ☆40Updated 2 weeks ago