MrYxJ / enhance_long
This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without training, and can be used directly in the LLM inference phase.
☆45Updated last year
Alternatives and similar repositories for enhance_long:
Users that are interested in enhance_long are comparing it to the libraries listed below
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆40Updated last year
- CCKS‘2021:《SGSum:一个面向体育赛事摘要的人工标注数据集》☆21Updated 3 years ago
- [COLING'22] Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER"☆44Updated 6 months ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆38Updated last week
- Official Implementation of "Pay Attention to What You Need"☆41Updated 2 weeks ago
- Collecting personality-indicative data for role-playing agents.☆22Updated 3 weeks ago
- Support mixed-precsion inference with vllm☆80Updated 2 months ago
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year
- Mixed precision inference by Tensorrt-LLM☆76Updated 4 months ago
- Author: Xiangyu Dong (xdong2ps@gmail.com) and Wenhao Yu (wyu1@nd.edu). EMMLP 2021. News text generation.☆17Updated 3 years ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 2 months ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated 11 months ago
- ☆47Updated 5 months ago
- Corpus and Enhanced Pre-trained Models for EMNLP 2023 Findings Long Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenar…☆30Updated last year
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 5 months ago
- [ACL2024 Findings] Towards Better Question Generation in QA-based Event Extraction☆44Updated last week
- ☆47Updated 8 months ago
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Updated last year
- Audio and Text Corpus for Machine Learning, published by MagicHub(An open source community of Magic Data Tech)☆38Updated 2 years ago
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆37Updated 3 months ago
- Official dataset link for ''Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension''☆24Updated 4 years ago
- ☆12Updated last year
- LLM Benchmark for Code☆31Updated 7 months ago
- ☆50Updated last year
- ☆30Updated 5 months ago
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆78Updated last year
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆26Updated 8 months ago
- ☆70Updated last year
- The implementation for ACL 2023 paper "Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularizatio…☆17Updated last year
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆15Updated last month