Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".
☆18Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for Skill-Neuron
Users that are interested in Skill-Neuron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Aug 16, 2023Updated 2 years ago
- ☆45Feb 22, 2026Updated last month
- ☆28May 29, 2024Updated last year
- Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)☆21Nov 21, 2024Updated last year
- Paper list of compositional zero-shot learning☆11Jul 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- A Logisim schematic for a CPU that implements the Brainfuck language.☆10Mar 9, 2018Updated 8 years ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- ☆12Nov 18, 2022Updated 3 years ago
- UFT: Unifying Supervised and Reinforcement Fine-Tuning☆28Jun 30, 2025Updated 9 months ago
- Official implementation of the FG23 paper: "Learning Continuous Mesh Representation with Spherical Implicit Surface"☆12Sep 12, 2022Updated 3 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- ☆15May 15, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 9 years ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Jan 25, 2019Updated 7 years ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆32Apr 16, 2025Updated 11 months ago
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- ☆17Feb 4, 2025Updated last year
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆13May 24, 2018Updated 7 years ago
- AMG-RAG (Agentic Medical Graph-RAG) is a comprehensive framework that automates the construction and continuous updating of Medical Knowl…☆27Feb 5, 2026Updated 2 months ago
- Papers about the trend of Entity Linking in recent years.☆11Sep 5, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper: Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness☆12Oct 22, 2023Updated 2 years ago
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 3 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- Flexibly construct and implement attacks on Learning Parity with Noise☆11Sep 26, 2023Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 8 months ago
- Maximum mean discrepancy comparisons for single cell profiling experiments☆20Feb 9, 2022Updated 4 years ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆85Mar 14, 2026Updated last month
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- Partial code for "Skill Extraction from Job Postings using Weak Supervision" at RecSysHR 2022.☆13May 19, 2023Updated 2 years ago
- Efficient Activation Function Optimization through Surrogate Modeling☆11Oct 3, 2023Updated 2 years ago
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆16Dec 14, 2023Updated 2 years ago
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆25Jan 29, 2026Updated 2 months ago
- ☆23Oct 17, 2022Updated 3 years ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 6 months ago