☆135Aug 8, 2024Updated last year
Alternatives and similar repositories for LLM
Users that are interested in LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆66Aug 23, 2024Updated last year
- ☆143Sep 29, 2024Updated last year
- 通义千问的DPO训练☆64Sep 21, 2024Updated last year
- Train a tiny LLaMA model from scratch to repeat your words using Reinforcement Learning from Human Feedback (RLHF)☆18May 23, 2024Updated last year
- llm & rl☆283Oct 24, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Mar 25, 2025Updated last year
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- 手搓Llama,个人学习用☆16May 21, 2024Updated last year
- [ICLR 2024 Oral] Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness.☆17Jan 19, 2024Updated 2 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Simple MRCP based voice agent with support for BOTs☆16May 7, 2019Updated 6 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 7 months ago
- An experimental Media Resource Control Protocol server☆11Jul 22, 2025Updated 8 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jun 27, 2025Updated 9 months ago
- ☆24Apr 8, 2019Updated 7 years ago
- [ICCV 2025] Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models☆35Mar 20, 2026Updated 3 weeks ago
- ☆11Aug 1, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- User-specified ICP.☆11Sep 14, 2021Updated 4 years ago
- ☆13Jan 14, 2026Updated 3 months ago
- ⚙️ Program slicer based on the Mozilla Lithium Tool for Java (also dubbed as Tandem-FL).☆11Oct 21, 2024Updated last year
- Map binary data into a beautiful chart☆15Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for L4DC 2022 paper: Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning.☆15Jul 31, 2023Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- [IROS 2025] AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones ROS node for controlling DJI Tello drones with comprehe…☆22Jul 14, 2025Updated 9 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago
- ☆20Jun 9, 2023Updated 2 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆27Jan 4, 2026Updated 3 months ago
- ☆52Feb 9, 2026Updated 2 months ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆26Nov 11, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆27Sep 15, 2025Updated 7 months ago
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated 3 months ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆14Sep 23, 2024Updated last year
- Pytorch Implementation of LoG 22 [Oral] -- Transductive Linear Probing: A Novel Framework for Few-Shot Node Classification☆17May 31, 2023Updated 2 years ago
- A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models☆21Feb 16, 2025Updated last year
- ☆31Dec 31, 2025Updated 3 months ago
- foundational data structures and algorithms for time- oriented data in Visual Analytics☆26May 13, 2019Updated 6 years ago