EXL2 quantization generalized to other models.
☆10Mar 17, 2024Updated 2 years ago
Alternatives and similar repositories for exl2-for-all
Users that are interested in exl2-for-all are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- QuIP quantization☆64Mar 17, 2024Updated 2 years ago
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- ide-cap-chan is a utility for batch image captioning with natural language using various VL models☆14May 1, 2025Updated 11 months ago
- adds a few extra samplers and schedulers to the dropdowns in recent A1111-derived webUIs for Stable Diffusion☆26Dec 5, 2025Updated 4 months ago
- ☆22Jan 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21May 24, 2023Updated 2 years ago
- DeepL을 통한 한국 번역 자동화 코드☆12Jul 27, 2023Updated 2 years ago
- QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coo…☆19Feb 22, 2024Updated 2 years ago
- Website for holding generative-image-dynamics☆37Jun 23, 2024Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated last year
- A web application demonstrating translations and summarization with Google Gemini Nano (on-device model)☆19Dec 4, 2024Updated last year
- Detect the solar panel on satellite image☆14Apr 18, 2019Updated 6 years ago
- Stable Diffusion Studio☆25Aug 29, 2024Updated last year
- KoTAN: Korean Translation and Augmentation with fine-tuned NLLB☆23Jan 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆55Oct 10, 2025Updated 5 months ago
- Scrapes instagram based on multiple profiles, creates folders of each persons face recognized so can be used for training models☆17May 16, 2023Updated 2 years ago
- 한국어 메시지 번역 파일(PO 파일)에서 흔히 범하는 실수들을 찾아내는 프로그램입니다 / Find common errors in Korean PO message translations☆36Aug 27, 2024Updated last year
- Engineering demo for DSubs online subsim.☆12May 29, 2025Updated 10 months ago
- Chat with your RVC models. See website for demo:☆22Feb 15, 2024Updated 2 years ago
- 1Fichier Download Manager (KR)☆30Oct 14, 2025Updated 5 months ago
- 3rd place solution of ICDM 2022 Risk Commodities Detection on Large-Scale E-Commence Graphs☆42Sep 15, 2022Updated 3 years ago
- MirrorMetrics: How to evaluate Stable Diffusion LoRAs. A visual diagnostic tool to detect overfitting, check dataset quality, and fix tra…☆50Feb 21, 2026Updated last month
- An enhanced SillyTavern Visual Novel Experience in an extension☆44Feb 8, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Gives each individual character their own memory.☆30Updated this week
- 使用官方SDK实现将标准Anthropic Claude请求转发至VertexAI。☆31Jul 31, 2024Updated last year
- Gemini-based translation API that integrates with the "Immersive Translate", 基于 Gemini 的翻译 API,可与沉浸式翻译插件集成☆33Dec 21, 2023Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆42Feb 13, 2024Updated 2 years ago
- Luna App Unofficial for the Nvidia Shield Android TV. Play your Amazon Luna games in the cloud directly in your Nvidia Shield TV☆45May 9, 2022Updated 3 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- See which WorldInfo entries were active in the last generated message.☆37Sep 12, 2025Updated 6 months ago
- A method to rate chess engines using STS test suite.☆19Nov 27, 2022Updated 3 years ago
- ☆19Sep 28, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆94Mar 31, 2026Updated last week
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆187Mar 23, 2026Updated 2 weeks ago
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Jun 25, 2024Updated last year
- A LLM prompt to give some semblance of referential recursive structure☆24Mar 5, 2026Updated last month
- Neuroengine is a service to share LLMs in the form of a webchat and API.☆45Oct 21, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year