the-full-stack / gpu-deployments
Testing methods for GPU deployment
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpu-deployments
- ☆24Updated last year
- ☆25Updated 10 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Retrieve the source code for any model made available on replicate.com!☆33Updated 9 months ago
- Apps that run on modal.com☆12Updated 5 months ago
- ☆48Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated last year
- ☆18Updated last month
- ☆19Updated 3 months ago
- ☆26Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆67Updated last month
- ☆74Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Verbosity control for AI agents☆58Updated 5 months ago
- Collection of recipes aiding Gen AI model development☆83Updated this week
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆57Updated 6 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆58Updated 5 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆86Updated last week
- Writing Blog Posts with Generative Feedback Loops!☆42Updated 7 months ago
- Simple examples using Argilla tools to build AI☆38Updated last week
- ☆48Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- QLoRA for Masked Language Modeling☆20Updated last year