☆95Jan 22, 2025Updated last year
Alternatives and similar repositories for LLMsKnow
Users that are interested in LLMsKnow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Dec 8, 2024Updated last year
- ☆34Nov 7, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated last year
- ☆25Apr 3, 2024Updated 2 years ago
- ☆34Oct 13, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆70Apr 11, 2025Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Dec 8, 2025Updated 6 months ago
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆14Sep 23, 2024Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆81Jun 20, 2026Updated 2 weeks ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆58Apr 4, 2025Updated last year
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆26Dec 10, 2025Updated 6 months ago
- ☆16Feb 8, 2024Updated 2 years ago
- Simple Calculator: I created simple calculator to perform operations.☆13Jun 21, 2024Updated 2 years ago
- Have an LLM write your biography, probably incorrectly☆15Dec 26, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆86Jun 2, 2026Updated last month
- ☆12Jul 4, 2024Updated 2 years ago
- ☆12Dec 4, 2024Updated last year
- Attribution-based Parameter Decomposition☆35Jun 11, 2025Updated last year
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆35Jan 31, 2025Updated last year
- Ruby Static Checker☆30Nov 7, 2011Updated 14 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- DropKAN (Dropout Kolmogorov Arnold Networks)☆19Jun 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆29Dec 21, 2025Updated 6 months ago
- ☆14Apr 22, 2024Updated 2 years ago
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆20Apr 27, 2023Updated 3 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆21Jun 29, 2025Updated last year
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆144Mar 26, 2024Updated 2 years ago
- My GitHub profile page repository☆16Updated this week
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆36Jul 10, 2025Updated 11 months ago
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆20Oct 11, 2024Updated last year
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Acrilic on Canvas tiled map editor☆11Feb 1, 2018Updated 8 years ago
- A Small collection of great quotes from famous people☆10Nov 6, 2024Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆21May 5, 2026Updated last month
- ☆35Sep 13, 2023Updated 2 years ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆309Jan 22, 2026Updated 5 months ago
- ☆17Aug 2, 2023Updated 2 years ago
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago