C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.
☆23Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for blitz-embed
Users that are interested in blitz-embed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyze trends in articles published on arXiv☆19Apr 13, 2023Updated 3 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆86Oct 29, 2024Updated last year
- ☆30Mar 18, 2024Updated 2 years ago
- ☆12Mar 20, 2023Updated 3 years ago
- Headless, zero-runtime video editing using MCP and FFMPEG | Pure Bash - no Python/Node runtime needed☆21Jun 5, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆24Jan 30, 2025Updated last year
- The web API server that runs program codes in an isolated environment using Docker.☆18Jul 20, 2023Updated 2 years ago
- Index and search your personal data quickly and privately.☆28Nov 20, 2021Updated 4 years ago
- ☆15Apr 26, 2025Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- ☆17Jun 3, 2024Updated last year
- text2sql with modern LLMs (duckdb-nsql, SQLCoder etc ...)☆18Apr 13, 2024Updated 2 years ago
- ☆160Apr 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Using OpenAI's Whisper via whisper.cpp with SFML☆14Dec 2, 2025Updated 5 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 3 weeks ago
- ☆13Jun 29, 2024Updated last year
- Fine Tune Multimodal LLM "Idefics 2" using QLoRA.☆11Apr 20, 2024Updated 2 years ago
- ☆73May 17, 2018Updated 7 years ago
- Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib☆19Nov 19, 2021Updated 4 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- ☆18Apr 10, 2023Updated 3 years ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Jul 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Mar 7, 2025Updated last year
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated 2 years ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆49Jan 12, 2024Updated 2 years ago
- Wikipedia Citations in Wikidata☆10May 6, 2021Updated 5 years ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆273Mar 30, 2026Updated last month
- An easy way to chunk spaCy docs.☆23Aug 14, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆24Mar 3, 2024Updated 2 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- ☆18Jul 11, 2021Updated 4 years ago
- 80x faster and 95% accurate language identification with Fasttext☆167Jan 23, 2024Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year