β19May 17, 2025Updated 11 months ago
Alternatives and similar repositories for aws-sft-grpo-budget-llm-finetune
Users that are interested in aws-sft-grpo-budget-llm-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β17Apr 9, 2025Updated last year
- Synthetic Data Quality Assurance πβ66Jan 8, 2026Updated 3 months ago
- β100Jun 23, 2025Updated 10 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"β22Jun 26, 2025Updated 10 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Leβ¦β13Jan 16, 2025Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β145May 6, 2025Updated 11 months ago
- β14Apr 14, 2025Updated last year
- XmodelLMβ38Nov 19, 2024Updated last year
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Modelsβ41Jun 14, 2025Updated 10 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuningβ52Oct 17, 2025Updated 6 months ago
- The official implementation of Preference Data Reward-Augmentation.β18May 1, 2025Updated 11 months ago
- Touti Cracker is a cross-platform ethical hacking toolkit for educational purposes, featuring password cracking, WiFi auditing, and reverβ¦β50Apr 6, 2026Updated 3 weeks ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"β20Apr 9, 2025Updated last year
- A tiny autograd engine with a Jax-like APIβ75Jul 6, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Heimdall is a data orchestration and job execution platformβ65Updated this week
- β42May 15, 2025Updated 11 months ago
- Rivet plugin to access E2B goodiesβ10Feb 6, 2025Updated last year
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, β¦β84Mar 9, 2026Updated last month
- A framework for evaluating RAG pipelines, specifically adapted for the legal domain.β76Jul 28, 2025Updated 9 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.β25Mar 1, 2026Updated last month
- WPAUDIT: Advanced WordPress security auditing suite & vulnerability scanner. Automates pentesting with Nmap, WPScan, Nuclei, SQLMap. Compβ¦β37May 27, 2025Updated 11 months ago
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 4 months ago
- UV kernel for Jupyterβ459May 21, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β39May 20, 2025Updated 11 months ago
- Built with Nuxt 3 + Tailwind CSS + Supabaseβ10Jul 20, 2023Updated 2 years ago
- The official baseline implementations for Chronoceptβ10Mar 31, 2026Updated 3 weeks ago
- β14Mar 23, 2026Updated last month
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.β72May 22, 2025Updated 11 months ago
- Analysis and visualize massive real-time updated data.β17Oct 31, 2022Updated 3 years ago
- An OpenAI API Compatible Honeypot Gatewayβ17Mar 17, 2025Updated last year
- lumiere clientβ39Mar 2, 2026Updated last month
- Test server code for Phi-2 model. support OpenAI API specβ18Dec 15, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Langchain + Docker + Neo4jβ10Oct 29, 2024Updated last year
- β35Feb 23, 2026Updated 2 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.β17Jul 11, 2023Updated 2 years ago
- Upload a document image or PDF, or provide a URL, to convert it into a structured format using SmolDocling.β16Mar 31, 2025Updated last year
- Find random, interesting content from Reddit and Hacker News with just one click.β13Sep 8, 2024Updated last year
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean aβ¦β15Mar 30, 2024Updated 2 years ago
- Think of it as an open-source alternative to expensive solutions like the MouthPad, eye-trackers, or even complex systems like Neuralink.β¦β37Apr 20, 2026Updated last week