Create embeddings with infinity as serverless endpoint
☆47Nov 21, 2025Updated 7 months ago
Alternatives and similar repositories for worker-infinity-embedding
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago
- The Runpod worker template for serving our large language model endpoints. Powered by vLLM.☆456Updated this week
- ☆13Feb 22, 2024Updated 2 years ago
- ☆15Dec 21, 2025Updated 6 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆31Nov 21, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Golang SDK for Truss☆40Apr 8, 2026Updated 2 months ago
- My Gen AI research☆11Jun 3, 2024Updated 2 years ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- ☆14Sep 18, 2024Updated last year
- Node.js Logical reasoning machine (WIP)☆10Dec 18, 2014Updated 11 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A card reader powered by WebUSB.☆14Mar 31, 2019Updated 7 years ago
- Resources for the Web MIDI API: Books, articles, software, demos etc...☆11Sep 14, 2017Updated 8 years ago
- Declarative AI Pipelines☆22Oct 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- Machinery data, made easy. Easily download and prepare common industrial datasets.☆23Feb 13, 2024Updated 2 years ago
- ☆38Jun 8, 2026Updated 3 weeks ago
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- A minimalist component for displaying dynamic geojson features on a Mapbox GL or MapLibre GL map!☆14Jul 7, 2023Updated 2 years ago
- Let's play with canvas drawing and WebAudio API, see if something interesting might appear. A first attempt to "Code like no one's watchi…☆11Jul 30, 2023Updated 2 years ago
- [UNMAINTAINED] Tessel 1's getting started page☆32Oct 26, 2015Updated 10 years ago
- Python implementation of the img2net algorithm.☆10Jan 7, 2026Updated 5 months ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆23Dec 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated 2 months ago
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 3 years ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago
- Web VJing for everyone.☆11May 26, 2016Updated 10 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,857Mar 24, 2026Updated 3 months ago
- Gulp-based toolkit for WordPress theme development (Sass and ES6)☆11Apr 11, 2017Updated 9 years ago
- A modified version of searx (the privacy-respecting metasearch engine) to only search an allowlist of sites, to build functionality simil…☆19Sep 17, 2021Updated 4 years ago
- A quarantine-ready Wikipedia game, to be played by two people☆16Feb 6, 2025Updated last year
- ☆15Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆47Jun 9, 2025Updated last year
- ☆21Jun 4, 2024Updated 2 years ago
- Convert your Raspberry Pi into a DMX512 controller☆11Apr 14, 2024Updated 2 years ago
- New abstractions for Tessel Neopixels☆16Jul 15, 2020Updated 5 years ago
- Tokun to can tokens☆18Jun 19, 2025Updated last year
- ☆35May 9, 2024Updated 2 years ago
- Get ready to have your mind blown by the magic of vw CSS units and take your CSS acrobatics to the next level.☆15Jul 7, 2022Updated 3 years ago