Sumandora / remove-refusals-with-transformers
Implements harmful/harmless refusal removal using pure HF Transformers
☆783Updated 11 months ago
Alternatives and similar repositories for remove-refusals-with-transformers
Users that are interested in remove-refusals-with-transformers are comparing it to the libraries listed below
Sorting:
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆462Updated 11 months ago
- Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLa…☆537Updated this week
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆944Updated this week
- Make abliterated models with transformers, easy and fast☆68Updated 3 weeks ago
- Large-scale LLM inference engine☆1,413Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆439Updated this week
- Web UI for ExLlamaV2☆496Updated 3 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆341Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆253Updated 2 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆420Updated this week
- Dolphin System Messages☆304Updated 2 months ago
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆745Updated this week
- LM inference server implementation based on *.cpp.☆185Updated this week
- ☆88Updated 2 months ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,436Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆152Updated 11 months ago
- Interface for OuteTTS models.☆1,214Updated last week
- Evaling and unaligning Chinese LLM censorship☆61Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,488Updated 2 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 7 months ago
- Docker compose to run vLLM on Windows☆78Updated last year
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆161Updated this week
- AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:☆2,155Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆612Updated last month
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆241Updated last week
- An Open Source Toolkit For LLM Distillation☆594Updated last week
- A open webui function for better R1 experience☆79Updated 2 months ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆2,867Updated last week
- A benchmark for emotional intelligence in large language models☆289Updated 9 months ago
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆202Updated this week