huybery / Awesome-Code-LLM
👨💻 An awesome and curated list of best code-LLM for research.
☆1,170Updated 3 months ago
Alternatives and similar repositories for Awesome-Code-LLM:
Users that are interested in Awesome-Code-LLM are comparing it to the libraries listed below
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆2,288Updated last week
- A framework for the evaluation of autoregressive code generation language models.☆912Updated 4 months ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,418Updated 2 months ago
- Run evaluation on LLMs using human-eval benchmark☆400Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆459Updated last month
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆321Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,451Updated 11 months ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,001Updated last month
- A library for advanced large language model reasoning☆2,065Updated last month
- Efficient Retrieval Augmentation and Generation Framework☆1,495Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,336Updated this week
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step☆509Updated 6 months ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。☆632Updated 11 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,695Updated 3 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,969Updated 2 weeks ago
- ☆640Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,578Updated this week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,584Updated 3 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,475Updated 3 weeks ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆646Updated 9 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,113Updated last year
- Optimizing inference proxy for LLMs☆2,112Updated last week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆613Updated this week
- Tools for merging pretrained large language models.☆5,478Updated this week
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.☆743Updated 10 months ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆1,271Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,767Updated 7 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆395Updated last month
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,358Updated last week
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆544Updated last year