☆19May 17, 2025Updated 10 months ago
Alternatives and similar repositories for aws-sft-grpo-budget-llm-finetune
Users that are interested in aws-sft-grpo-budget-llm-finetune are comparing it to the libraries listed below
Sorting:
- ☆17Apr 9, 2025Updated 11 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 5 months ago
- ☆25Sep 19, 2023Updated 2 years ago
- ☆99Jun 23, 2025Updated 8 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 8 months ago
- b3acon - a mail-based C2 that communicates via an in-memory C# IMAP client dynamically compiled in memory using PowerShell.☆45Apr 21, 2025Updated 10 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- ☆14Apr 14, 2025Updated 11 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- ☆145May 6, 2025Updated 10 months ago
- ☆15May 23, 2025Updated 9 months ago
- Graph Neural Network-Based Anomaly Detection☆33Mar 16, 2024Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 9 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 5 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- ☆12May 23, 2024Updated last year
- ☆42May 15, 2025Updated 10 months ago
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆79Mar 9, 2026Updated last week
- A framework for evaluating RAG pipelines, specifically adapted for the legal domain.☆73Jul 28, 2025Updated 7 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 2 weeks ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆46Jun 12, 2025Updated 9 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆64Dec 11, 2025Updated 3 months ago
- ForecastGrapher: Redefining Multivariate Time Series Forecasting with Graph Neural Networks☆32Jun 14, 2024Updated last year
- ☆39May 20, 2025Updated 10 months ago
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆227Jun 23, 2025Updated 8 months ago
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated 2 months ago
- Built with Nuxt 3 + Tailwind CSS + Supabase☆10Jul 20, 2023Updated 2 years ago
- Write data migration logic in code so you can change the shape of your data confidently as your app evolves☆15Sep 29, 2023Updated 2 years ago
- fast api for memgpt☆11Nov 28, 2023Updated 2 years ago
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆72May 22, 2025Updated 9 months ago
- Analysis and visualize massive real-time updated data.☆17Oct 31, 2022Updated 3 years ago
- [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models☆94Jul 31, 2025Updated 7 months ago
- Example project to demonstrate the use of the CubiCasa SDK for iOS☆13Sep 10, 2025Updated 6 months ago
- lumiere client☆34Mar 2, 2026Updated 2 weeks ago
- ☆35Feb 23, 2026Updated 3 weeks ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Jul 11, 2023Updated 2 years ago