XiaoduoAILab / XmodelLM
XmodelLM
☆39Updated 5 months ago
Alternatives and similar repositories for XmodelLM:
Users that are interested in XmodelLM are comparing it to the libraries listed below
- ☆16Updated 2 months ago
- ☆13Updated 4 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆45Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- ☆41Updated 4 months ago
- ☆20Updated 11 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 11 months ago
- ☆63Updated last month
- ☆26Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆61Updated 9 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆40Updated 2 months ago
- Official Repository for Task-Circuit Quantization☆19Updated this week
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ☆24Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- ☆19Updated 3 weeks ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated last month
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 6 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆28Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 8 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆30Updated 8 months ago