EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetuneLinks
☆19Updated 2 months ago
Alternatives and similar repositories for aws-sft-grpo-budget-llm-finetune
Users that are interested in aws-sft-grpo-budget-llm-finetune are comparing it to the libraries listed below
Sorting:
- ☆16Updated 3 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated 6 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated last month
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆16Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 4 months ago
- XmodelLM☆39Updated 7 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆22Updated 7 months ago
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 months ago
- The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"☆29Updated last month
- ☆23Updated last month
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆22Updated last week
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 4 months ago
- ☆2Updated last month
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 9 months ago
- ☆101Updated last month
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆41Updated 2 weeks ago
- ☆126Updated 2 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- ☆19Updated 4 months ago
- Generate Python Package with Simple Prompts☆72Updated 7 months ago
- ☆78Updated 8 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 5 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 8 months ago
- An OpenSource Deep Research library with reasoning☆148Updated last month
- ☆36Updated last month
- The original Shared Recurrent Memory Transformer implementation☆27Updated last week
- ☆20Updated last month
- Wonderful Matrices to Build Small Language Models☆44Updated 5 months ago