dleemiller / WordLlamaLinks
Things you can do with the token embeddings of an LLM
☆1,449Updated last month
Alternatives and similar repositories for WordLlama
Users that are interested in WordLlama are comparing it to the libraries listed below
Sorting:
- Fast State-of-the-Art Static Embeddings☆1,917Updated 2 weeks ago
- LLM Analytics☆696Updated last year
- High-performance retrieval engine for unstructured data☆1,533Updated 3 weeks ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,276Updated 8 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,845Updated 2 months ago
- clean & curate your data with LLMs.☆490Updated last year
- A hub for various industry-specific schemas to be used with VLMs.☆537Updated 6 months ago
- OCR Benchmark☆595Updated last month
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆626Updated 8 months ago
- Fast Semantic Text Deduplication & Filtering☆848Updated last month
- Fully neural approach for text chunking☆401Updated last month
- ☆747Updated last year
- ☆1,451Updated 9 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,579Updated 6 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,035Updated 9 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,413Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,453Updated 10 months ago
- ☆3,039Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆832Updated 10 months ago
- AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,096Updated 10 months ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,854Updated last week
- See Through Your Models☆402Updated 4 months ago
- A scientific instrument for investigating latent spaces☆736Updated 2 weeks ago
- DOM to Semantic-Markdown for use with LLMs☆935Updated 6 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆854Updated last year
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,081Updated 10 months ago
- ☆447Updated last year
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆875Updated this week
- Improved file parsing for LLM’s☆3,137Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,550Updated last week