alvarobartt / vertex-ai-huggingfaceLinks
π€ Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AI
β21Updated last year
Alternatives and similar repositories for vertex-ai-huggingface
Users that are interested in vertex-ai-huggingface are comparing it to the libraries listed below
Sorting:
- β78Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.β39Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- β80Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ119Updated 8 months ago
- β89Updated 2 years ago
- Drift detection module for machine learning pipelines.β23Updated 2 years ago
- Framework for building and maintaining self-updating prompts for LLMsβ65Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β115Updated 6 months ago
- An efficient, to-the-point, and easy-to-use checklist to following when deploying an ML model into production.β30Updated 2 years ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β35Updated 7 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooksβ57Updated 3 months ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated last year
- β15Updated 2 years ago
- Tuning the Finetuning: An exploration of achieving success with QLoRAβ45Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 3 months ago
- Generalist and Lightweight Model for Text Classificationβ167Updated 3 weeks ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β85Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- Learn how to monitor ML systems to identify and mitigate sources of drift before model performance decay.β92Updated 3 years ago
- Chunk your text using gpt4o-mini more accuratelyβ44Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β33Updated last year
- β217Updated last year
- Codes, scripts, and notebooks on various aspects of transformer models.β27Updated 2 years ago
- β18Updated last year
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.β40Updated 11 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β117Updated last year
- Gzip and nearest neighbors for text classificationβ57Updated 2 years ago