Code release for "Idiosyncrasies in Large Language Models"
☆56Jul 21, 2025Updated 10 months ago
Alternatives and similar repositories for llm-idiosyncrasies
Users that are interested in llm-idiosyncrasies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Nov 16, 2025Updated 6 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆30Aug 19, 2025Updated 9 months ago
- ☆34Jan 26, 2025Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- [𝐍𝐚 𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆27Apr 21, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- 使用rag来学习rag☆10Sep 6, 2024Updated last year
- ☆13Nov 10, 2024Updated last year
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- ☆16Jul 11, 2023Updated 2 years ago
- Common MPC Pitfalls☆18Updated this week
- BookWorm: A Dataset for Character Description and Analysis [EMNLP Findings 2024]☆14Feb 28, 2025Updated last year
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 3 years ago
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆24Jun 7, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Dec 13, 2014Updated 11 years ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆27Feb 25, 2025Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- Codebase for Obfuscated Activations Bypass LLM Latent-Space Defenses☆31Feb 11, 2025Updated last year
- Code for paper "Improving Generalizability of Graph Anomaly Detection Models via Data Augmentation" (TKDE 2023)☆16Dec 4, 2025Updated 6 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆25Oct 26, 2025Updated 7 months ago
- ☆21Feb 10, 2025Updated last year
- Code for the paper: Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions☆12Mar 13, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11May 1, 2022Updated 4 years ago
- ☆14Feb 5, 2014Updated 12 years ago
- A PyTorch implementation of "Generating Sentences from a Continuous Space"☆13Feb 22, 2018Updated 8 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆45Oct 3, 2025Updated 8 months ago
- Kubernetes Tutorial for the PS2 group meetings at UC Berkeley☆16Mar 23, 2023Updated 3 years ago
- [ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆24Apr 7, 2026Updated 2 months ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆33Mar 2, 2025Updated last year
- A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation an…☆21Oct 16, 2025Updated 7 months ago
- Official Repository for the ICML 2024 paper "Gradual Divergence for Seamless Adaptation: A Novel Domain Incremental Learning Method"☆15Sep 20, 2024Updated last year
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆58Mar 31, 2026Updated 2 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆30Feb 6, 2026Updated 4 months ago
- ☆12Nov 9, 2018Updated 7 years ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year