ChuangtaoChen-TUM / LiveMind
☆13Updated 5 months ago
Alternatives and similar repositories for LiveMind:
Users that are interested in LiveMind are comparing it to the libraries listed below
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 10 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated last month
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- ☆24Updated 7 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- ☆24Updated last week
- ☆37Updated 6 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆22Updated last week
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 4 months ago
- Train, tune, and infer Bamba model☆88Updated this week
- ☆24Updated last month
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- ☆62Updated 3 weeks ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆40Updated 2 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- ☆48Updated 5 months ago
- Lego for GRPO☆27Updated 3 weeks ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 6 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆120Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- Train your own SOTA deductive reasoning model☆88Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- ☆16Updated 2 weeks ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆63Updated last month
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆32Updated last month