ChuangtaoChen-TUM / LiveMindLinks
☆13Updated 7 months ago
Alternatives and similar repositories for LiveMind
Users that are interested in LiveMind are comparing it to the libraries listed below
Sorting:
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆17Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 3 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆18Updated last month
- A repository for research on medium sized language models.☆76Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- Lego for GRPO☆28Updated last month
- ☆51Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- ☆24Updated 9 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated 2 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆36Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆20Updated last week
- ☆19Updated 3 months ago
- This is the official repository for Inheritune.☆111Updated 4 months ago
- ☆29Updated 2 months ago
- ☆21Updated 6 months ago
- Exploring Model Kinship for Merging Large Language Models☆24Updated 2 months ago
- The original Shared Recurrent Memory Transformer implementation☆27Updated 2 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆35Updated last year
- accompanying material for sleep-time compute paper☆95Updated last month
- ☆32Updated last month
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago