☆115Nov 10, 2024Updated last year
Alternatives and similar repositories for learn-llm
Users that are interested in learn-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 6 months ago
- ☆19Dec 3, 2021Updated 4 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆19Oct 28, 2025Updated 5 months ago
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆22Mar 4, 2024Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆12Sep 27, 2024Updated last year
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆33May 19, 2025Updated 10 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆44Sep 3, 2025Updated 6 months ago
- ☆16Jun 25, 2025Updated 9 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,413Sep 29, 2023Updated 2 years ago
- PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency☆19Mar 29, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 使用强化学习算法Q-learning,对3D打印的路径进行规划,减少打印喷头转弯、启停,提高打印效率。☆13Jun 30, 2021Updated 4 years ago
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆498May 1, 2025Updated 10 months ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆18Feb 28, 2025Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆792Apr 27, 2025Updated 11 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆147Feb 19, 2025Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一 个具备简单中文问答能力的chat-llama2.☆2,903May 21, 2024Updated last year
- ☆14Aug 27, 2022Updated 3 years ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,273Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of AdaCQR(COLING 2025)☆14Dec 30, 2024Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆653Aug 17, 2024Updated last year
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 6 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆377Jul 21, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 4 months ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 9 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆587Jul 11, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Jan 25, 2019Updated 7 years ago
- ☆17Feb 19, 2024Updated 2 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- A static website for a Chatbot with Azure OpenAI, Azure Text to Speech Services and Live2D☆13Sep 4, 2024Updated last year
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆81Sep 6, 2024Updated last year
- Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode☆10Jun 13, 2022Updated 3 years ago