AlexBodner / How_Much_VRAM
☆99Updated 6 months ago
Alternatives and similar repositories for How_Much_VRAM:
Users that are interested in How_Much_VRAM are comparing it to the libraries listed below
- ☆111Updated 2 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆63Updated 4 months ago
- ☆91Updated 2 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 6 months ago
- ☆53Updated 9 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆85Updated 2 months ago
- A pipeline parallel training script for LLMs.☆128Updated this week
- Scripts to create your own moe models using mlx☆89Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Self-host LLMs with vLLM and BentoML☆92Updated this week
- ☆65Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- CursorCore: Assist Programming through Aligning Anything☆116Updated last month
- Distributed Inference for mlx LLm☆84Updated 7 months ago
- Embed anything.☆29Updated 9 months ago
- ☆30Updated 8 months ago
- ☆172Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆135Updated 3 weeks ago