cosmo3769 / Quantized-LLMs
Quantization of LLMs and benchmarking.
☆10Updated 11 months ago
Alternatives and similar repositories for Quantized-LLMs:
Users that are interested in Quantized-LLMs are comparing it to the libraries listed below
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 7 months ago
- Collection of autoregressive model implementation☆81Updated 3 weeks ago
- ☆41Updated 10 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆89Updated 2 months ago
- Notebooks to demonstrate TimmWrapper☆15Updated last month
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.☆37Updated last year
- Contains materials for my talk "You don't know TensorFlow".☆9Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 9 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated this week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 10 months ago
- Notebooks for fine tuning pali gemma☆96Updated 2 months ago
- Set of scripts to finetune LLMs☆36Updated 11 months ago
- ☆24Updated last year
- End-to-End LLM Guide☆103Updated 8 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆29Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆23Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆56Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 2 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year
- Building GPT ...☆17Updated 3 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- ☆15Updated last year
- ☆16Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 3 months ago
- Experimentation on google's gemma model☆16Updated last year