unsloth-5090-multiple
☆63May 21, 2025Updated last year
Alternatives and similar repositories for unsloth-5090-multiple
Users that are interested in unsloth-5090-multiple are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆245Sep 30, 2025Updated 8 months ago
- ☆15Jan 11, 2025Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated 2 months ago
- ☆21Dec 9, 2025Updated 6 months ago
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- ☆56Feb 10, 2025Updated last year
- Scripts for text classification with llama and bert☆35Jul 23, 2025Updated 10 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆39Apr 7, 2026Updated 2 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 months ago
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆39Updated this week
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- ☆19Aug 23, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- ☆41May 26, 2026Updated 2 weeks ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆36Oct 13, 2025Updated 7 months ago
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 5 months ago
- Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over…☆258Feb 19, 2026Updated 3 months ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- MAFAND-MT☆62Jul 9, 2024Updated last year
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆28Feb 19, 2026Updated 3 months ago
- ☆114Jun 19, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)☆66Jan 28, 2026Updated 4 months ago
- ROSA-Tuning☆74Feb 4, 2026Updated 4 months ago
- Authenticated independently verifiable agent delegation.☆33Dec 17, 2025Updated 5 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆38Feb 21, 2026Updated 3 months ago
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆18May 20, 2026Updated 2 weeks ago
- Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits☆14Sep 11, 2024Updated last year
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated 2 years ago
- Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)☆10Apr 29, 2024Updated 2 years ago
- 📦 Repopack is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your …☆16Oct 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic r…☆18Aug 4, 2025Updated 10 months ago
- 大模型意图识别☆11Aug 14, 2024Updated last year
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆43Apr 22, 2025Updated last year
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆128Feb 15, 2026Updated 3 months ago
- Interact with various LLMs in your browser (LangChain.js, Angular)☆17May 7, 2026Updated last month
- Pages saved with SingleFile☆13Mar 16, 2024Updated 2 years ago
- Apple OpenSource download tool☆13Apr 17, 2020Updated 6 years ago