martin-wey / peft-llm-codeLinks
Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models".
☆19Updated 9 months ago
Alternatives and similar repositories for peft-llm-code
Users that are interested in peft-llm-code are comparing it to the libraries listed below
Sorting:
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆13Updated 3 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆18Updated last month
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆14Updated 7 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆19Updated last month
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆14Updated 5 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 weeks ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆20Updated last week
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆23Updated 2 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated last year
- ☆22Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆24Updated 7 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆18Updated 8 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- ☆39Updated 5 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆97Updated last week
- ☆32Updated 3 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 8 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆33Updated last year
- ☆26Updated 3 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆88Updated 3 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆27Updated 7 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆74Updated 4 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆12Updated last week
- ☆49Updated last year
- (ACL 2025 main) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆29Updated last month
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 2 months ago
- ☆13Updated 6 months ago