huybery / Awesome-Code-LLMLinks
๐จโ๐ป An awesome and curated list of best code-LLM for research.
โ1,255Updated last year
Alternatives and similar repositories for Awesome-Code-LLM
Users that are interested in Awesome-Code-LLM are comparing it to the libraries listed below
Sorting:
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024โ1,640Updated 2 months ago
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.โ3,110Updated 2 weeks ago
- A framework for the evaluation of autoregressive code generation language models.โ1,006Updated 4 months ago
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.โ787Updated last year
- A curated list of awesome LLM agents frameworks.โ1,194Updated this week
- โ672Updated last year
- A library for advanced large language model reasoningโ2,313Updated 6 months ago
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...โ2,168Updated 7 months ago
- Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.โ513Updated 8 months ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt ๆถๅฝๅ็งๅๆ ท็ๆไปคๆฐๆฎ้, ็จไบ่ฎญ็ป ChatLLM ๆจกๅใโ713Updated last year
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGIโ453Updated last month
- ๐ OctoPack: Instruction Tuning Code Large Language Modelsโ475Updated 10 months ago
- Run evaluation on LLMs using human-eval benchmarkโ425Updated 2 years ago
- [ACL 2023] Reasoning with Language Model Prompting: A Surveyโ990Updated 6 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.โ1,920Updated 4 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 ๐โ3,459Updated 7 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought promptingโ2,761Updated last year
- A collection of benchmarks and datasets for evaluating LLM.โ531Updated last year
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)โ1,135Updated last year
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)โ567Updated last year
- List of language agents based on paper "Cognitive Architectures for Language Agents"โ1,083Updated 10 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructโ2,070Updated last year
- ๐ฐ Must-read papers and blogs on LLM based Long Context Modeling ๐ฅโ1,844Updated this week
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".โ1,584Updated 6 months ago
- Repo-Level Code generation papersโ224Updated 4 months ago
- โ481Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diveโฆโ971Updated last year
- Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Modelsโ1,290Updated 9 months ago
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMsโฆโ582Updated 2 weeks ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Modelsโ1,640Updated last year