π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
β17Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for vertex-ai-huggingface-inference-toolkit
Users that are interested in vertex-ai-huggingface-inference-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ Trade any tensors over the networkβ31Sep 27, 2023Updated 2 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Apr 27, 2026Updated last month
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50May 8, 2023Updated 3 years ago
- β29May 26, 2026Updated 3 weeks ago
- π A Python package template using pyproject.toml, hatch, pre-commit, black, ruff, and mkdocs.β59Sep 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- π€ Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AIβ23Feb 26, 2024Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.β24Sep 24, 2023Updated 2 years ago
- β10Oct 2, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 8 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β96May 28, 2026Updated 3 weeks ago
- β17Sep 9, 2022Updated 3 years ago
- β11Mar 31, 2023Updated 3 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ36Nov 21, 2025Updated 6 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Slides and code examples for my talksβ23May 29, 2026Updated 2 weeks ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 11 months ago
- [NeurIPS 2024] πΈ GlotCC Dataset and Piplineβ20Apr 6, 2025Updated last year
- Command Line Interface for Hugging Face Inference Endpointsβ65Apr 10, 2024Updated 2 years ago
- Keyphrase Extraction Prototypesβ15Nov 24, 2016Updated 9 years ago
- Rhythm analysis toolkit in Pythonβ13Sep 29, 2023Updated 2 years ago
- SpanMarker for Named Entity Recognitionβ476Apr 10, 2026Updated 2 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformersβ60Jun 3, 2024Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β168Jan 15, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β13Jul 23, 2023Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Oct 26, 2022Updated 3 years ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub π€β82Oct 30, 2024Updated last year
- π Modular retrievers for zero-shot multilingual IR.β30Mar 6, 2024Updated 2 years ago
- β18Nov 13, 2024Updated last year
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)β11Aug 24, 2024Updated last year
- Legate Hello World Pedagogical Libraryβ10Apr 5, 2023Updated 3 years ago
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"β13May 20, 2023Updated 3 years ago
- β22May 27, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Databases is a collection of information that is organized so that it can easily be accessed, managed, and updated. In one view, databasβ¦β12Nov 8, 2016Updated 9 years ago
- β12Mar 25, 2024Updated 2 years ago
- Computable protocol wikiβ11Mar 26, 2018Updated 8 years ago
- Database of open-source Bible texts and topical and cross referencesβ16Mar 29, 2017Updated 9 years ago
- Code for the MTEB leaderboardβ31Feb 4, 2025Updated last year
- This project showcases a comprehensive analysis of CO2 emissions in a fictitious cheese manufacturing supply chain using both graph databβ¦β11Sep 18, 2024Updated last year
- DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domainsβ21Feb 7, 2024Updated 2 years ago