☆32Apr 14, 2022Updated 3 years ago
Alternatives and similar repositories for ELLE
Users that are interested in ELLE are comparing it to the libraries listed below
Sorting:
- ☆13Apr 24, 2022Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago
- DEMix Layers for Modular Language Modeling☆54Aug 23, 2021Updated 4 years ago
- [ICML 2023] Parameter-Level Soft-Masking for Continual Learning☆19Jul 13, 2023Updated 2 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆83Dec 21, 2024Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Oct 11, 2022Updated 3 years ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆26Oct 8, 2024Updated last year
- 之江杯-电商评论观点挖掘 rank30☆15Nov 3, 2019Updated 6 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆23Aug 22, 2019Updated 6 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Mar 6, 2022Updated 3 years ago
- ☆68May 18, 2023Updated 2 years ago
- Diaformer: Automatic Diagnosis via Symptoms Sequence Generation☆26Nov 9, 2023Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- The source code of KESA☆31May 3, 2023Updated 2 years ago
- ☆12Mar 25, 2025Updated 11 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Oct 20, 2022Updated 3 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 3 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoen…☆21Jun 23, 2014Updated 11 years ago
- ☆12Jul 4, 2024Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- 中文短文本在线聚类☆11Nov 24, 2024Updated last year
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- ☆11Jan 10, 2020Updated 6 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- ☆18Jun 23, 2025Updated 8 months ago
- ☆11Nov 11, 2022Updated 3 years ago
- ☆10Oct 14, 2020Updated 5 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- ☆12Apr 24, 2024Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- ☆13Sep 8, 2024Updated last year
- ☆12Feb 7, 2021Updated 5 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- Vue wrapper components for the pdfjs library☆10Dec 31, 2022Updated 3 years ago