UmerHA / quanting-notesLinks
I learn about and explain quantization
☆26Updated last year
Alternatives and similar repositories for quanting-notes
Users that are interested in quanting-notes are comparing it to the libraries listed below
Sorting:
- An introduction to LLM Sampling☆79Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- ☆23Updated 2 years ago
- ☆88Updated last year
- ☆86Updated 11 months ago
- ☆19Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆66Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- ☆124Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 11 months ago
- Collection of autoregressive model implementation☆86Updated 4 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Project code for training LLMs to write better unit tests + code☆21Updated 3 months ago
- ☆66Updated 3 months ago
- A comprehensive deep dive into the world of tokens☆226Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- ☆80Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- ☆141Updated last week
- ☆49Updated 6 months ago
- Efficient vector database for hundred millions of embeddings.☆207Updated last year
- look how they massacred my boy☆64Updated 10 months ago
- Lego for GRPO☆29Updated 3 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 7 months ago
- lossily compress representation vectors using product quantization☆59Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆120Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago