Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"
☆29Dec 20, 2024Updated last year
Alternatives and similar repositories for sea-llm
Users that are interested in sea-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- ☆12Jul 30, 2025Updated 7 months ago
- Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…☆16Aug 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- The code base for the article "Pivot Based Language Modeling for Improved Neural Domain Adaptation", NAACL 2018☆16Jul 14, 2019Updated 6 years ago
- ☆23Feb 5, 2026Updated last month
- ☆24Jul 25, 2024Updated last year
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- The official repo for the Dialz Python library - a toolkit for steering vector research.☆23Updated this week
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆27Aug 27, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago
- ☆13Jun 25, 2025Updated 9 months ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".☆50Apr 17, 2022Updated 3 years ago
- (MICCAI-2025) MedDiff-FT: Data-Efficient Diffusion Model Fine-tuning with Structural Guidance for Controllable Medical Image Synthesis☆18Jul 11, 2025Updated 8 months ago
- The repository contains resources of our paper published at AAAI 2020 ``A Dataset for Low-Resource Stylized Sequence-to-Sequence Generati…☆33Jan 21, 2020Updated 6 years ago
- ☆39Jan 15, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Text file containing NSFW words aggregated from various sources.☆10Aug 23, 2020Updated 5 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- ☆20Apr 10, 2025Updated 11 months ago
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Jul 19, 2021Updated 4 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Sep 13, 2024Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 2 months ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)☆29Oct 22, 2025Updated 5 months ago
- ☆113Feb 11, 2025Updated last year
- ☆37May 28, 2023Updated 2 years ago
- ☆31Nov 9, 2024Updated last year
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- ☆11Dec 18, 2024Updated last year
- This is the official repository for the paper titled "Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a M…☆16Apr 29, 2025Updated 11 months ago