LightChen233 / Awesome-Multilingual-LLMLinks
☆112Updated 9 months ago
Alternatives and similar repositories for Awesome-Multilingual-LLM
Users that are interested in Awesome-Multilingual-LLM are comparing it to the libraries listed below
Sorting:
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆56Updated last year
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Updated last year
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆115Updated 3 months ago
- A Survey on Data Selection for Language Models☆247Updated 4 months ago
- A curated list of awesome papers about information retrieval(IR) in the age of large language model(LLM). These include retrieval augment…☆75Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆187Updated 2 months ago
- ☆84Updated 8 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆129Updated 11 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆186Updated last month
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆146Updated 3 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆64Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆175Updated 2 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆167Updated this week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆59Updated 11 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆119Updated last year
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year
- Counting-Stars (★)☆83Updated 3 months ago
- a curated list of the role of small models in the LLM era☆104Updated 11 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆81Updated 8 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆78Updated 2 weeks ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆231Updated last year
- ☆426Updated last month
- Code implementation of synthetic continued pretraining☆129Updated 8 months ago
- EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural langu…☆109Updated last year
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆128Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆262Updated 2 months ago
- Personality Alignment of Language Models☆43Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆135Updated 11 months ago
- Fantastic Data Engineering for Large Language Models☆91Updated 8 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 10 months ago