the-full-stack / gpu-deploymentsLinks
Testing methods for GPU deployment
☆20Updated 3 years ago
Alternatives and similar repositories for gpu-deployments
Users that are interested in gpu-deployments are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆33Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- Source of the FSDL 2022 labs, which are at https://github.com/full-stack-deep-learning/fsdl-text-recognizer-2022-labs☆83Updated last year
- ☆68Updated 9 months ago
- ☆79Updated last year
- ☆80Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆120Updated 8 months ago
- manage histories of LLM applied applications☆91Updated 2 years ago
- ☆20Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Large Language Model (LLM) Inference API and Chatbot☆128Updated last year
- A comprehensive deep dive into the world of tokens☆226Updated last year
- QR Codes that look nice☆63Updated 6 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- Repository for fine-tuning gemma models using unsloth for indic languages☆97Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆65Updated last year
- ☆125Updated last year
- Notes from the Latent Space paper club. Follow along or start your own!☆243Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆127Updated 10 months ago
- ☆90Updated 2 years ago
- ☆45Updated 2 years ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆134Updated 2 weeks ago