recombee / beeformerLinks
Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems
☆106Updated 8 months ago
Alternatives and similar repositories for beeformer
Users that are interested in beeformer are comparing it to the libraries listed below
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆322Updated 2 months ago
- High-Performance Implementation of OpenAI's TikToken.☆464Updated 5 months ago
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆243Updated 2 weeks ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 7 months ago
- Fast Diversification for Search & Retrieval☆432Updated 3 weeks ago
- ai for jq☆246Updated last year
- Dead Simple LLM Abliteration☆243Updated 9 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆286Updated 2 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆281Updated last month
- Parallel thinking for LLMs. Confidence‑gated, strategy‑driven, offline‑friendly☆274Updated 2 months ago
- A Python toolkit for chain-of-thought prompting 🐍☆179Updated 3 months ago
- ☆165Updated last year
- Lightweight Pandas monkey-patch that adds async support to map, apply, applymap, aggregate, and transform, enabling seamless handling of …☆131Updated 6 months ago
- Heirarchical Navigable Small Worlds☆101Updated 4 months ago
- OpenAI's Structured Outputs with Logprobs☆199Updated 6 months ago
- Docker-based inference engine for AMD GPUs☆230Updated last year
- Visualize text embeddings☆40Updated 2 years ago
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated last year
- Multimodal RAG to search and interact locally with technical documents of any kind☆279Updated last month
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Fully neural approach for text chunking☆402Updated last month
- Examples and guides for using the VLM Run API☆300Updated last week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 11 months ago
- ☆280Updated 6 months ago
- Fast similarity search using DuckDB☆142Updated last year
- ☆199Updated 7 months ago
- See Through Your Models☆402Updated 5 months ago
- Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)☆360Updated last month
- A GPU Accelerated Binary Vector Store☆47Updated 9 months ago