ashvardanian / cpp-cuda-python-starter-kit
Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11
☆16Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for cpp-cuda-python-starter-kit
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- build your own vector database -- the littlest hnsw☆21Updated 11 months ago
- A list of awesome resources and blogs on topics related to Unum☆31Updated last month
- Efficient BM25 with DuckDB 🦆☆29Updated last month
- A high-performance image processing library designed to optimize and extend the Albumentations library with specialized functions for adv…☆12Updated 2 weeks ago
- Tree-based indexes for neural-search☆28Updated 8 months ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆16Updated 7 months ago
- Tiny Semantic Versioning (SemVer) library with LLMs and GitHub CI, that doesn't depend on 300K lines of JavaScript code and fits in a sin…☆17Updated this week
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated last month
- Cortex-compatible model server for Python and TensorFlow☆16Updated last year
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆12Updated 2 years ago
- 🌩️ The Deep Learning framework based on Lightning☆10Updated 6 months ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 3 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 9 months ago
- Rust Implementation of micrograd☆51Updated 4 months ago
- Rust bindings for CTranslate2☆13Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆16Updated last month
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- scraping and querying documents for LLMs☆14Updated last week
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated 2 months ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆40Updated 10 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆18Updated 7 months ago
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 10 months ago
- A visual tool to interpret and understand PyTorch machine learning models☆15Updated 9 months ago