zwhe99 / X-SIR
[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models
☆30Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for X-SIR
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆43Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆60Updated 8 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆34Updated last month
- Code and Data Repo for ACL'23 Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated 10 months ago
- ☆48Updated this week
- Feeling confused about super alignment? Here is a reading list☆43Updated 10 months ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- ☆68Updated 9 months ago
- The repository for paper <Evaluating Open-QA Evaluation>☆23Updated 7 months ago
- ☆23Updated 2 months ago
- Personality Alignment of Language Models☆18Updated 2 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆47Updated 4 months ago
- [EMNLP 2024 Findings] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆19Updated last week
- ☆15Updated 9 months ago
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2☆22Updated 3 weeks ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆58Updated 8 months ago
- Do Large Language Models Know What They Don’t Know?☆85Updated 2 weeks ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- Multilingual safety benchmark for Large Language Models☆24Updated 2 months ago
- ☆36Updated last year
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆81Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- ☆37Updated 10 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆42Updated 2 months ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆19Updated 4 months ago
- trending projects & awesome papers about data-centric llm studies.☆32Updated 2 weeks ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆77Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆30Updated 3 months ago