Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"
☆29Dec 20, 2024Updated last year
Alternatives and similar repositories for sea-llm
Users that are interested in sea-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 13, 2022Updated 3 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- ☆12Jul 30, 2025Updated 8 months ago
- Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…☆16Aug 9, 2023Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"☆13May 20, 2023Updated 2 years ago
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 10 months ago
- ☆23Feb 5, 2026Updated 2 months ago
- ☆24Jul 25, 2024Updated last year
- The official repo for the Dialz Python library - a toolkit for steering vector research.☆24Mar 26, 2026Updated 3 weeks ago
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆27Aug 27, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆14Mar 11, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago
- ☆13Jun 25, 2025Updated 9 months ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".☆50Apr 17, 2022Updated 4 years ago
- The repository contains resources of our paper published at AAAI 2020 ``A Dataset for Low-Resource Stylized Sequence-to-Sequence Generati…☆33Jan 21, 2020Updated 6 years ago
- ☆43Jan 15, 2025Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- ☆20Apr 10, 2025Updated last year
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆24Sep 11, 2024Updated last year
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 3 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Sep 13, 2024Updated last year
- ☆31Nov 9, 2024Updated last year
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆12Jul 22, 2024Updated last year
- SAP Benchmark☆27Sep 18, 2024Updated last year
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- Implementation of Concept-level Debugging of Part-Prototype Networks☆12May 9, 2023Updated 2 years ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆25Sep 4, 2022Updated 3 years ago
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆22Apr 13, 2022Updated 4 years ago