Create embeddings with infinity as serverless endpoint
☆43Nov 21, 2025Updated 5 months ago
Alternatives and similar repositories for worker-infinity-embedding
Users that are interested in worker-infinity-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago
- RunPod worker for Stable Diffusion XL☆43Nov 21, 2025Updated 5 months ago
- ☆12Feb 22, 2024Updated 2 years ago
- SGLang is fast serving framework for large language models and vision language models.☆34Nov 24, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 5 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Human I/O, published at CHI 2024, Honorable Mentions Award☆15Oct 22, 2025Updated 6 months ago
- Starting point to build your own custom serverless endpoint☆132May 9, 2025Updated 11 months ago
- MANtIS - a multi-domain information seeking dialogues dataset☆22May 12, 2021Updated 4 years ago
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Clarify your words with emojis☆12Aug 25, 2016Updated 9 years ago
- Golang SDK for Truss☆40Apr 8, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Nimble matchers for Fox☆14Jul 14, 2015Updated 10 years ago
- This is the ultimate web scraping tool for extracting the most relevant data points from products on Walmart.com! this powerful scraper i…☆20Mar 6, 2023Updated 3 years ago
- [NeurIPS 2025] GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer☆26Mar 20, 2026Updated last month
- A cross platform and file system Python module for linking files.☆14Aug 3, 2017Updated 8 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- Declarative AI Pipelines☆22Oct 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Resources for the Web MIDI API: Books, articles, software, demos etc...☆11Sep 14, 2017Updated 8 years ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago
- HTM Learning Algorithm Implementation for learning and generating musical sequences☆10Apr 14, 2015Updated 11 years ago
- Enabling Live Migration for Computational Notebooks.☆13Mar 11, 2024Updated 2 years ago
- Let's play with canvas drawing and WebAudio API, see if something interesting might appear. A first attempt to "Code like no one's watchi…☆11Jul 30, 2023Updated 2 years ago
- [UNMAINTAINED] Tessel 1's getting started page☆32Oct 26, 2015Updated 10 years ago
- Universal connector to LLMs for Node.js & Bun☆30Updated this week
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated 2 weeks ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Web VJing for everyone.☆11May 26, 2016Updated 9 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,773Mar 24, 2026Updated last month
- Gulp-based toolkit for WordPress theme development (Sass and ES6)☆11Apr 11, 2017Updated 9 years ago
- German Anki Decks based on KIT lectures☆15Feb 7, 2023Updated 3 years ago
- ☆15Dec 3, 2024Updated last year
- A quarantine-ready Wikipedia game, to be played by two people☆16Feb 6, 2025Updated last year
- Unofficial Mirror of https://github.com/aireveries/RarePlanes.git☆13May 20, 2022Updated 3 years ago