Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models
☆29Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for llm-hessian
Users that are interested in llm-hessian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing the code for Implicit Manifold Gaussian Process.☆16Aug 8, 2024Updated last year
- Minimalistic port of NanoGUI claim works with SDL API w/o external dependencies.☆12Sep 4, 2019Updated 6 years ago
- An approach for Circuit Synthesis using Dataset Threshold queries.☆14May 28, 2023Updated 2 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Blogs that I'm actively following.☆14Sep 17, 2023Updated 2 years ago
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆25Mar 18, 2026Updated last week
- ☆11Jun 2, 2022Updated 3 years ago
- A benchmark of real-world DL kernel problems☆145Updated this week
- ☆10Apr 28, 2020Updated 5 years ago
- Wenzhou-Kean University AI-LAB☆10Jun 6, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation of wd1☆24Sep 25, 2025Updated 6 months ago
- ☆57Feb 24, 2026Updated last month
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- PeRL: Parameter-Efficient Reinforcement Learning☆73Mar 10, 2026Updated 2 weeks ago
- A package containing utils for the PyTorch version of the Tapas algorithm.☆11Apr 29, 2021Updated 4 years ago
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated last year
- Efficient Conformal Prediction via Cascaded Inference with Expanded Admission☆20Sep 15, 2021Updated 4 years ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆52Aug 6, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Fork of Flame repo for training of some new stuff in development☆19Mar 17, 2026Updated last week
- Codebase for the Progressive Mixed-Precision Decoding paper.☆19Jul 15, 2025Updated 8 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆24Jun 26, 2024Updated last year
- Perform bayesian distribution regression☆13Mar 19, 2018Updated 8 years ago
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆123Jul 4, 2025Updated 8 months ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 5 years ago
- ☆24Apr 3, 2025Updated 11 months ago
- Code Implementation of Adversarial Prompt Evaluation paper☆14Sep 18, 2025Updated 6 months ago
- MicroVIM, a simple editor implementing some basic vim features.☆27Dec 12, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆30Jul 22, 2024Updated last year
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆21Oct 13, 2020Updated 5 years ago
- Neural theorem proving evaluation via the Lean REPL☆23Jul 12, 2025Updated 8 months ago
- nanoGPT-like codebase for LLM training☆116Nov 7, 2025Updated 4 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Official Implementation for Inference-time Scaling of Diffusion Models through Classical Search☆31Oct 8, 2025Updated 5 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year