ashvardanian / cpp-cuda-python-starter-kit

Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11

☆16

Related projects ⓘ

Alternatives and complementary repositories for cpp-cuda-python-starter-kit

ashvardanian / usearch-binary
Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread
☆19Updated 7 months ago
jbarrow / tinyhnsw
build your own vector database -- the littlest hnsw
☆21Updated 11 months ago
unum-cloud / awesome
A list of awesome resources and blogs on topics related to Unum
☆31Updated last month
lightonai / ducksearch
Efficient BM25 with DuckDB 🦆
☆29Updated last month
albumentations-team / albucore
A high-performance image processing library designed to optimize and extend the Albumentations library with specialized functions for adv…
☆12Updated 2 weeks ago
raphaelsty / neural-tree
Tree-based indexes for neural-search
☆28Updated 8 months ago
mixedbread-ai / binary-embeddings
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆16Updated 7 months ago
ashvardanian / tinysemver
Tiny Semantic Versioning (SemVer) library with LLMs and GitHub CI, that doesn't depend on 300K lines of JavaScript code and fits in a sin…
☆17Updated this week
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆59Updated 5 months ago
DeployQL / LintDB
Vector Database with support for late interaction and token level embeddings.
☆54Updated last month
cortexlabs / nucleus
Cortex-compatible model server for Python and TensorFlow
☆16Updated last year
modal-labs / cadre
🛠 Self-hosted, fast, and consistent remote configuration for apps.
☆12Updated 2 years ago
neuro-ml / thunder
🌩️ The Deep Learning framework based on Lightning
☆10Updated 6 months ago
withmartian / leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆17Updated 3 months ago
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆36Updated 7 months ago
maxdotio / neural-solr
Neural Solr = Solr 9 + Mighty Inference + Node
☆16Updated 2 years ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆51Updated 9 months ago
shivance / picograd
Rust Implementation of micrograd
☆51Updated 4 months ago
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆13Updated last year
dwarvesf / llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…
☆16Updated last month
facebookresearch / PostText
PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…
☆31Updated last year
jackbandy / bookcorpus-datasheet
Documentation effort for the BookCorpus dataset
☆33Updated 3 years ago
zzstoatzz / raggy
scraping and querying documents for LLMs
☆14Updated last week
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆48Updated 2 months ago
ashvardanian / usearch-images
Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…
☆40Updated 10 months ago
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated last year
janbjorge / PGCacheWatch
A Python library for real-time PostgreSQL event-driven cache invalidation.
☆18Updated 7 months ago
KompleteAI / xllm
🦖 X—LLM: Simple & Cutting Edge LLM Finetuning
☆11Updated last year
BlackHC / llmtracer
Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp
☆14Updated 10 months ago
freedmand / interpogate
A visual tool to interpret and understand PyTorch machine learning models
☆15Updated 9 months ago