☆131Aug 8, 2024Updated last year
Alternatives and similar repositories for LLM
Users that are interested in LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆143Sep 29, 2024Updated last year
- 通义千问的DPO训练☆64Sep 21, 2024Updated last year
- ☆98Nov 5, 2024Updated last year
- ☆10Jun 1, 2024Updated last year
- Build CUDA Neural Network From Scratch☆22Aug 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- 手搓Llama,个人学习用☆16May 21, 2024Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- A fork of HumanEval-Java from the paper "Impact of Code Language Models on Automated Program Repair"☆13Dec 11, 2024Updated last year
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- llm & rl☆280Oct 24, 2025Updated 5 months ago
- 基于Qt的简易内部电子邮件系统☆13Jun 6, 2020Updated 5 years ago
- ☆12Mar 20, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Sep 7, 2021Updated 4 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 11 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ☆13Feb 1, 2024Updated 2 years ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Updated this week
- ☆15Aug 4, 2025Updated 7 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Semi-automated modelling and Model-Based Testing for CosmWasm contracts☆17Jun 28, 2024Updated last year
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- 基于Pytorch热门深度学习框架 从零开发NLP聊天机器人☆14Sep 13, 2020Updated 5 years ago
- ☆15Nov 10, 2023Updated 2 years ago
- 南开大学 大数据计算及应用; NKU Big Data☆11Sep 8, 2023Updated 2 years ago
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated 2 months ago
- ☆12May 31, 2024Updated last year
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 3 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- A SCL Unit Testing library☆11Nov 13, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- MCP server for creating UI flowcharts☆12Jan 5, 2025Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]☆10Jan 28, 2024Updated 2 years ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆22Apr 29, 2024Updated last year
- This repo provides the implemetation of the paper How to train your agent to read and write?☆10Dec 29, 2020Updated 5 years ago
- ☆20Jun 9, 2023Updated 2 years ago
- A Repository of Real, Recent Java Bugs☆22Jan 6, 2026Updated 2 months ago