aiwaves-cn / Dive-into-LLMsLinks

The official github repo for the open online courses: "Dive into LLMs".

☆10

Alternatives and similar repositories for Dive-into-LLMs

Users that are interested in Dive-into-LLMs are comparing it to the libraries listed below

Sorting:

YeFD / RRAG
The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…
☆33Updated 7 months ago
siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆115Updated 8 months ago
test-time-interaction / TTI
☆48Updated last month
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
GAIR-NLP / OlympicArena
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆102Updated 4 months ago
aorwall / moatless-testbeds
Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…
☆13Updated 3 months ago
THUDM / Self-Contrast
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
☆21Updated last year
du-nlp-lab / MLR-Copilot
☆66Updated 3 months ago
NanshineLoong / Self-Evolving-Benchmark
A framework for evolving and testing question-answering datasets with various models.
☆16Updated last year
KbsdJames / omni-math-rule
The rule-based evaluation subset and code implementation of Omni-MATH
☆22Updated 6 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆102Updated 7 months ago
hsaest / Agent-Planning-Analysis
[NAACL'25] "Revealing the Barriers of Language Agents in Planning"
☆12Updated 3 weeks ago
AkariAsai / OpenScholar_ExpertEval
This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.
☆25Updated 7 months ago
kyegomez / MultiModal-ToT
Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement
☆16Updated 8 months ago
bradhilton / o1-chain-of-thought
o1 Chain of Thought Examples
☆33Updated 9 months ago
chengyou-jia / AgentStore
[ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
☆38Updated 6 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
dxhou / CoAct
☆27Updated last year
QwenLM / Self-Lengthen
☆88Updated 8 months ago
18907305772 / FuseAI
FuseAI Project
☆87Updated 5 months ago
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆20Updated last year
GAIR-NLP / AIME-Preview
☆71Updated 4 months ago
open-compass / CriticEval
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
☆39Updated 7 months ago
GAIR-NLP / PC-Agent-E
Efficient Agent Training for Computer Use
☆114Updated last month
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
MoonshotAI / Kimi-Researcher
☆63Updated 3 weeks ago
Tencent / digitalhuman
☆53Updated last week
LAMDASZ-ML / Self-Backtracking
☆47Updated 5 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆65Updated 3 months ago
thu-coai / SPaR
☆47Updated last month