yileijin / PayAttnLinks
Official Implementation of "Pay Attention to What You Need"
☆44Updated 10 months ago
Alternatives and similar repositories for PayAttn
Users that are interested in PayAttn are comparing it to the libraries listed below
Sorting:
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆28Updated last year
- ☆44Updated last year
- ACL 2024☆35Updated 5 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 10 months ago
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆39Updated 7 months ago
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Updated 2 years ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆38Updated 9 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆42Updated last month
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Updated last year
- 📕 DDmkTCCorpus: Diachronic Danmaku Text Comments Corpus (历时弹幕语料库)☆15Updated last year
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆32Updated last year
- ☆52Updated last year
- [EMNLP 2023 Findings] Corpus and Enhanced Pre-trained Models for Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario …☆33Updated 2 years ago
- Official dataset link for ''Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension''☆22Updated 4 years ago
- ☆83Updated 6 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated 2 years ago
- SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models☆39Updated 10 months ago
- TITAN : A Task-oriented Dialogue Dataset with Mixed-Initiative Interactions☆33Updated 2 years ago
- [AAAI'25] Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations☆29Updated 8 months ago
- ☆62Updated last year
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆75Updated last year
- ☆21Updated 3 years ago
- ☆73Updated last year
- ☆12Updated 2 years ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated last month
- Official Code of Logits-Based-Finetuning☆91Updated 6 months ago
- ☆111Updated last month
- ☆75Updated last year
- This repository serves as the WebSocket server for Amahane Chat, facilitating real-time messaging and video calls. Built with Node.js, Ex…☆23Updated 2 years ago