AlexBodner / How_Much_VRAM
☆98Updated 5 months ago
Alternatives and similar repositories for How_Much_VRAM:
Users that are interested in How_Much_VRAM are comparing it to the libraries listed below
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- ☆111Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 8 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- ☆53Updated 8 months ago
- ☆78Updated last month
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated last year
- Scripts to create your own moe models using mlx☆86Updated 11 months ago
- An automated tool for discovering insights from research papaer corpora☆136Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- automatically quant GGUF models☆154Updated this week
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 5 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- LLM reads a paper and produce a working prototype☆48Updated 2 weeks ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆59Updated 3 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- ☆172Updated 6 months ago
- A pipeline parallel training script for LLMs.☆122Updated 3 weeks ago
- Own your AI, search the web with it🌐😎☆79Updated last month
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- ☆65Updated 8 months ago
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated 2 weeks ago