EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetuneView external linksLinks
☆19May 17, 2025Updated 9 months ago
Alternatives and similar repositories for aws-sft-grpo-budget-llm-finetune
Users that are interested in aws-sft-grpo-budget-llm-finetune are comparing it to the libraries listed below
Sorting:
- ☆17Apr 9, 2025Updated 10 months ago
- This JavaScript CLI "undeletes' packages that have been removed from the NPM registry☆29Dec 18, 2025Updated 2 months ago
- ☆14Apr 14, 2025Updated 10 months ago
- ☆25Sep 19, 2023Updated 2 years ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- Synthetic Data Quality Assurance 🔎☆65Jan 8, 2026Updated last month
- ☆97Jun 23, 2025Updated 7 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 4 months ago
- XmodelLM☆38Nov 19, 2024Updated last year
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 8 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 7 months ago
- ☆144May 6, 2025Updated 9 months ago
- b3acon - a mail-based C2 that communicates via an in-memory C# IMAP client dynamically compiled in memory using PowerShell.☆45Apr 21, 2025Updated 9 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 4 months ago
- Claude skills I'm experimenting with. Please review carefully before use.☆93Updated this week
- ☆41May 15, 2025Updated 9 months ago
- WPAUDIT: Advanced WordPress security auditing suite & vulnerability scanner. Automates pentesting with Nmap, WPScan, Nuclei, SQLMap. Comp…☆34May 27, 2025Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 7 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆135Aug 6, 2025Updated 6 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆71Jan 23, 2026Updated 3 weeks ago
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆59Aug 25, 2025Updated 5 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- ☆39May 20, 2025Updated 8 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆45Jun 12, 2025Updated 8 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- A tiny autograd engine with a Jax-like API☆74Jul 6, 2025Updated 7 months ago
- Touti Cracker is a cross-platform ethical hacking toolkit for educational purposes, featuring password cracking, WiFi auditing, and rever…☆49Jan 9, 2026Updated last month
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆71May 22, 2025Updated 8 months ago
- (ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators☆279Sep 25, 2025Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated 2 weeks ago
- ☆87Oct 28, 2024Updated last year
- ESG Insights AI simplifies ESG data analysis with advanced AI models, ensuring compliance with GRI standards. It helps asset managers ass…☆13Oct 31, 2024Updated last year
- Listener that spawns a new tmux window for each incoming reverse shell + Supports listening on many ports☆59Jul 13, 2025Updated 7 months ago
- Kate is Multimodal Live Assistant that ignites your browsing experience☆11Feb 15, 2025Updated last year