Sumandora / remove-refusals-with-transformersLinks
Implements harmful/harmless refusal removal using pure HF Transformers
☆903Updated last year
Alternatives and similar repositories for remove-refusals-with-transformers
Users that are interested in remove-refusals-with-transformers are comparing it to the libraries listed below
Sorting:
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆480Updated last year
- Make abliterated models with transformers, easy and fast☆74Updated 2 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆990Updated this week
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆419Updated last week
- Large-scale LLM inference engine☆1,453Updated last week
- llama.cpp fork with additional SOTA quants and improved performance☆608Updated this week
- ☆90Updated 3 months ago
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆240Updated 2 weeks ago
- Apple MLX engine for LM Studio☆630Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆635Updated 3 months ago
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆634Updated this week
- automatically quant GGUF models☆184Updated this week
- Efficient visual programming for AI language models☆364Updated last month
- run DeepSeek-R1 GGUFs on KTransformers☆236Updated 3 months ago
- VS Code extension for LLM-assisted code/text completion☆807Updated last week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆154Updated last year
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆295Updated 9 months ago
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆969Updated this week
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated 4 months ago
- Force DeepSeek r1 models to think for as long as you wish☆368Updated 4 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆177Updated last week
- A benchmark for emotional intelligence in large language models☆306Updated 11 months ago
- Evaling and unaligning Chinese LLM censorship☆63Updated last month
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,216Updated 3 weeks ago
- Sync your thinking with AI reasoning models to achieve deeper cognitive alignment Follow, learn, and iterate the thought within one turn☆89Updated 4 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆408Updated last month
- LM inference server implementation based on *.cpp.☆226Updated this week
- Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLa…☆633Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,381Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,957Updated last week