XiaoduoAILab / XmodelLMLinks
XmodelLM
☆39Updated 6 months ago
Alternatives and similar repositories for XmodelLM
Users that are interested in XmodelLM are comparing it to the libraries listed below
Sorting:
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- ☆45Updated 8 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆16Updated 2 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆73Updated 3 weeks ago
- ☆64Updated 2 months ago
- ☆13Updated 5 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 7 months ago
- Official Repository for Task-Circuit Quantization☆20Updated this week
- ☆20Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 7 months ago
- ☆37Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆27Updated this week
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆44Updated 3 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆15Updated 2 weeks ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆31Updated 9 months ago
- ☆49Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆41Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year