☆78Dec 26, 2023Updated 2 years ago
Alternatives and similar repositories for detect-pretrain-code-contamination
Users that are interested in detect-pretrain-code-contamination are comparing it to the libraries listed below
Sorting:
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- ☆17Apr 11, 2024Updated last year
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Aug 28, 2023Updated 2 years ago
- Tools for merging pretrained large language models.☆6,826Updated this week
- ☆68May 26, 2024Updated last year
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 7 months ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated last month
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆316Dec 20, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Apr 10, 2024Updated last year
- ☆23Nov 26, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- Implementation for PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs☆24Jun 6, 2024Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆262Apr 23, 2024Updated last year
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆252Oct 30, 2024Updated last year
- Evaluating LLMs with Dynamic Data☆112Feb 11, 2026Updated 3 weeks ago
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated last year
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 3 weeks ago
- Codebase for Merging Language Models (ICML 2024)☆863May 5, 2024Updated last year
- A fast batching API to serve LLM models☆189Apr 26, 2024Updated last year
- Reward Model을 이 용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated 2 years ago
- My Langchain Code archive maybe☆24Dec 25, 2023Updated 2 years ago
- ☆129Jan 22, 2024Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- ☆64Apr 9, 2024Updated last year
- Efficient fine-tuning for ko-llm models☆185Mar 18, 2024Updated last year
- ☆210Feb 3, 2024Updated 2 years ago
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆31May 29, 2024Updated last year
- ☆142Aug 20, 2025Updated 6 months ago
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆124Apr 28, 2024Updated last year