UmerHA / quanting-notes
I learn about and explain quantization
☆26Updated 9 months ago
Alternatives and similar repositories for quanting-notes:
Users that are interested in quanting-notes are comparing it to the libraries listed below
- An introduction to LLM Sampling☆75Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- ☆24Updated last year
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆19Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- Cerule - A Tiny Mighty Vision Model☆67Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- NLP with Rust for Python 🦀🐍☆60Updated 7 months ago
- ☆76Updated 7 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- Collection of autoregressive model implementation☆77Updated 3 weeks ago
- ☆65Updated 8 months ago
- ☆37Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆52Updated 11 months ago
- Verbosity control for AI agents☆59Updated 8 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆119Updated last month
- Training code for Sparse Autoencoders on Embedding models☆35Updated 2 months ago
- ☆48Updated 2 months ago
- ☆48Updated last year
- utilities for loading and running text embeddings with onnx☆43Updated 5 months ago
- ☆27Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- My personal site☆70Updated 5 months ago
- alternative way to calculating self attention☆18Updated 8 months ago
- ☆87Updated 11 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆119Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago