XiaoduoAILab / XmodelLM
XmodelLM
☆39Updated 2 months ago
Alternatives and similar repositories for XmodelLM:
Users that are interested in XmodelLM are comparing it to the libraries listed below
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- ☆59Updated this week
- ☆45Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆35Updated this week
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Updated 10 months ago
- ☆36Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆18Updated 4 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 5 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 8 months ago
- ☆20Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- ☆68Updated 7 months ago
- ☆19Updated 2 weeks ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆46Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆57Updated 7 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆63Updated last month
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆24Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆18Updated 3 months ago
- World's Smallest Vision-Language Model☆24Updated 10 months ago