wanghanbinpanda / Large-Language-Models-for-Code
Large Language Models(LLMs) of Code
☆17Updated 2 years ago
Alternatives and similar repositories for Large-Language-Models-for-Code:
Users that are interested in Large-Language-Models-for-Code are comparing it to the libraries listed below
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆42Updated 10 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆190Updated last week
- ☆140Updated last year
- Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".☆11Updated 2 months ago
- The awesome agents in the era of large language models☆62Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 10 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆72Updated 9 months ago
- The related works and background techniques about Openai o1☆221Updated 3 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆361Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆112Updated 7 months ago
- ☆81Updated last year
- ☆97Updated last year
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆22Updated 3 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆46Updated 3 weeks ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆83Updated 9 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- ☆113Updated 7 months ago
- Collection of papers for scalable automated alignment.☆88Updated 6 months ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆35Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆57Updated 8 months ago
- Awesome papers for role-playing with language models☆186Updated 5 months ago
- ☆326Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆49Updated 3 weeks ago
- ☆45Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆203Updated 11 months ago
- ☆312Updated 11 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆256Updated 7 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆435Updated 6 months ago
- ☆143Updated 9 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆162Updated last year