EsmaeilNarimissa/aws-sft-grpo-budget-llm-finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EsmaeilNarimissa/aws-sft-grpo-budget-llm-finetune)

EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune

☆19

Alternatives and similar repositories for aws-sft-grpo-budget-llm-finetune

Users that are interested in aws-sft-grpo-budget-llm-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shuzhangzhong / HybriMoE-Preview
View on GitHub
☆17Apr 9, 2025Updated last year
ckane / pycti-mcp
View on GitHub
MCP (Model Context Protocol) Server for pycti
☆15Jul 11, 2025Updated last year
XavierGrool / FGFusion
View on GitHub
☆25Sep 19, 2023Updated 2 years ago
asg017 / sqlite-dist
View on GitHub
☆17Mar 21, 2026Updated 3 months ago
mostly-ai / mostlyai-qa
View on GitHub
Synthetic Data Quality Assurance 🔎
☆65May 8, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / zero
View on GitHub
PyTorch Implementation of Zero-Shot Vision Encoder Grafting via LLM Surrogates [ICCV'25]
☆54Jul 10, 2025Updated last year
TencentARC / SEED-Bench-R1
View on GitHub
☆100Jun 23, 2025Updated last year
F2-Song / Weak-to-Strong-Decoding
View on GitHub
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆22Jun 26, 2025Updated last year
b3rito / b3acon
View on GitHub
b3acon - a mail-based C2 that communicates via an in-memory C# IMAP client dynamically compiled in memory using PowerShell.
☆45Apr 21, 2025Updated last year
UKPLab / arxiv2025-inherent-limits-plms
View on GitHub
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…
☆14Jan 16, 2025Updated last year
jiangjiechen / auction-arena
View on GitHub
Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…
☆49Jan 28, 2024Updated 2 years ago
amazon-science / expert-upcycling
View on GitHub
☆15Apr 15, 2026Updated 3 months ago
sail-sg / FlowReasoner
View on GitHub
☆145May 6, 2025Updated last year
XiaoduoAILab / XmodelLM
View on GitHub
XmodelLM
☆38Nov 19, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thedatagata / boring-ducklake-semantic-fishing-demo
View on GitHub
☆20Nov 5, 2025Updated 8 months ago
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆41Jun 14, 2025Updated last year
XBEAST1 / NextPGP
View on GitHub
NextPGP is a elegant and powerful, modern online PGP tool built with Next.js. It can generate keys, manage keyrings, encrypt and decrypt …
☆27Updated this week
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
shaochenze / EAR
View on GitHub
☆42May 15, 2025Updated last year
agno-agi / ai-app
View on GitHub
☆12May 23, 2024Updated 2 years ago
e2b-dev / rivet-plugin-e2b
View on GitHub
Rivet plugin to access E2B goodies
☆10Feb 6, 2025Updated last year
hazcod / shade
View on GitHub
PoC shadow SaaS and insecure credential detection system using a browser extension.
☆45Jul 10, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Amshaker / Mobile-VideoGPT
View on GitHub
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model
☆142Aug 6, 2025Updated 11 months ago
ricardojoserf / DoubleTeam
View on GitHub
Listener that spawns a new tmux window for each incoming reverse shell + Supports listening on many ports
☆59Jul 13, 2025Updated last year
LiamAshdown / built-with-gpt
View on GitHub
Built with Nuxt 3 + Tailwind CSS + Supabase
☆10Jul 20, 2023Updated 3 years ago
zjunlp / DynamicKnowledgeCircuits
View on GitHub
[ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
☆50Jul 18, 2025Updated last year
wnma3mz / tLLM
View on GitHub
☆14Mar 23, 2026Updated 3 months ago
gabrielPav / aws-preflight
View on GitHub
Check your AWS CLI commands for security risks before you run them.
☆33Apr 1, 2026Updated 3 months ago
kevinliddel / api-memgpt
View on GitHub
fast api for memgpt
☆11Nov 28, 2023Updated 2 years ago
SJTU-DENG-Lab / UniCMs
View on GitHub
☆39May 20, 2025Updated last year
tone-row / future-proof
View on GitHub
Write data migration logic in code so you can change the shape of your data confidently as your app evolves
☆15Sep 29, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Mj23978 / sam-assistant-ui
View on GitHub
🤖 Sam-assistant is a personal assistant that is designed to understand your documents, search the internet, and in future versions, crea…
☆15May 12, 2023Updated 3 years ago
ziozzang / Mac_mlx_phi-2_server
View on GitHub
Test server code for Phi-2 model. support OpenAI API spec
☆18Dec 15, 2023Updated 2 years ago
datadance-fun / DataDance
View on GitHub
Analysis and visualize massive real-time updated data.
☆17Oct 31, 2022Updated 3 years ago
CubiCasa / cubicasa-ios-sdk-example-project
View on GitHub
Example project to demonstrate the use of the CubiCasa SDK for iOS
☆14Jun 3, 2026Updated last month
joelburget / microjax
View on GitHub
A tiny autograd engine with a Jax-like API
☆75Jul 6, 2025Updated last year
RobustNLP / DeRTa
View on GitHub
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
☆72May 22, 2025Updated last year
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 10 months ago