hengjiUSTC/learn-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hengjiUSTC/learn-llm)

hengjiUSTC / learn-llm

☆114

Alternatives and similar repositories for learn-llm

Users that are interested in learn-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Chengsong-Huang / Self-Calibration
View on GitHub
codes for Efficient Test-Time Scaling via Self-Calibration
☆20Sep 13, 2025Updated 9 months ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
fxmeng / mixtral_spliter
View on GitHub
Converting Mixtral-8x7B to Mixtral-[1~7]x7B
☆22Mar 4, 2024Updated 2 years ago
RUCKBReasoning / CoT-based-Synthesizer
View on GitHub
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆32May 19, 2025Updated last year
THU-KEG / LRM-FactEval
View on GitHub
☆16Jun 25, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
qhjqhj00 / MetaAgent
View on GitHub
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆47Sep 3, 2025Updated 9 months ago
Sphere-AI-Lab / poet
View on GitHub
Implementation for POET and POET-X for LLM pretraining
☆37Jun 9, 2026Updated 2 weeks ago
miniHuiHui / SimpleRL-reason-GRPO
View on GitHub
☆12Feb 27, 2025Updated last year
HarderThenHarder / transformers_tasks
View on GitHub
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…
☆2,421Sep 29, 2023Updated 2 years ago
maohangyu / PET-SQL
View on GitHub
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency
☆20Mar 29, 2024Updated 2 years ago
jiahe7ay / MINI_LLM
View on GitHub
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
☆505May 1, 2025Updated last year
TIGER-AI-Lab / VisualWebInstruct
View on GitHub
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]
☆40Feb 1, 2026Updated 4 months ago
ShaneSpace / LocalMeanDecomposition
View on GitHub
The MATLAB code of the local mean decomposition using empirical optimal envelope
☆13Jan 6, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lqiang67 / generative-models-on-toys
View on GitHub
generative models on toys
☆12Sep 10, 2024Updated last year
zhanshijinwat / Steel-LLM
View on GitHub
Train a 1B LLM with 1T tokens from scratch by personal
☆807Apr 27, 2025Updated last year
zjunlp / WorfBench
View on GitHub
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆153Feb 19, 2025Updated last year
DLLXW / baby-llama2-chinese
View on GitHub
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
☆2,926May 21, 2024Updated 2 years ago
aninair1905 / DynaBARN
View on GitHub
☆14Aug 27, 2022Updated 3 years ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,673Jun 17, 2026Updated last week
huggingface / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆26Jun 27, 2019Updated 7 years ago
lamda-bbo / mcts-transfer
View on GitHub
Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".
☆13Nov 28, 2024Updated last year
HIT-SCIR / Chinese-Mixtral-8x7B
View on GitHub
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
☆651Aug 17, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Glanvery / LLM-Travel
View on GitHub
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆381Jul 21, 2024Updated last year
WangRongsheng / Aurora
View on GitHub
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
☆263May 9, 2024Updated 2 years ago
charent / Phi2-mini-Chinese
View on GitHub
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
☆592Jul 11, 2024Updated last year
mmrezaee / VRTM
View on GitHub
"A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)
☆11Apr 26, 2021Updated 5 years ago
anyscale / long-context-fine-tuning-blogpost
View on GitHub
☆17Feb 19, 2024Updated 2 years ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
isl-org / 0shot-object-insertion
View on GitHub
Simulation and robot code for contact-rich household object insertion (ICRA 2023).
☆24Dec 18, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
upiterbarg / diff_history
View on GitHub
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Aug 20, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
AI-Study-Han / Zero-Qwen-VL
View on GitHub
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆82Sep 6, 2024Updated last year
maseval / MASEval
View on GitHub
Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/
☆35May 31, 2026Updated 3 weeks ago
HKUST-KnowComp / WDDC
View on GitHub
Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode
☆10Jun 13, 2022Updated 4 years ago
OrderAndCh4oS / phonetics-transliterator
View on GitHub
Convert bodies of text to IPA translations
☆12May 2, 2023Updated 3 years ago
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,643Oct 24, 2024Updated last year
infly-ai / INF-LLM
View on GitHub
The official repo of INF-34B models trained by INF Technology.
☆34Jul 25, 2024Updated last year
Zayne-sprague / To-CoT-or-not-to-CoT
View on GitHub
☆25Apr 10, 2025Updated last year