codefuse-ai / Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
β1,386Updated this week
Related projects: β
- π¨βπ» An awesome and curated list of best code-LLM for research.β871Updated 2 months ago
- β2,490Updated 4 months ago
- Must-read Papers on LLM Agents.β1,664Updated last week
- Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-β¦β1,114Updated 3 weeks ago
- β1,704Updated 4 months ago
- A repo lists papers related to LLM based agentβ974Updated last month
- Summarize existing representative LLMs text datasets.β824Updated 2 weeks ago
- Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 πβ1,493Updated this week
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...β1,368Updated this week
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".β1,382Updated 3 months ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.β1,772Updated this week
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.β681Updated 4 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)β2,026Updated this week
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β2,120Updated 3 weeks ago
- Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Modelsβ871Updated last month
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,β¦β1,747Updated 3 months ago
- πA curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batcβ¦β2,475Updated this week
- Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)β1,433Updated 6 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought promptingβ2,513Updated last month
- A quick guide (especially) for trending instruction finetuning datasetsβ2,443Updated 9 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, β¦β3,741Updated this week
- AgentTuning: Enabling Generalized Agent Abilities for LLMsβ1,329Updated 10 months ago
- LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabiliβ¦β2,300Updated this week
- π° Must-read papers and blogs on LLM based Long Context Modeling π₯β816Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)β3,730Updated 3 weeks ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.β1,436Updated this week
- β‘FlashRAG: A Python Toolkit for Efficient RAG Researchβ1,118Updated this week
- A generalized information-seeking agent system with Large Language Models (LLMs).β1,074Updated 3 months ago
- β836Updated 2 months ago
- A unified evaluation framework for large language modelsβ2,375Updated last week