XiaoduoAILab / XmodelLMLinks

XmodelLM

☆39

Alternatives and similar repositories for XmodelLM

Users that are interested in XmodelLM are comparing it to the libraries listed below

Sorting:

huggingface / screensuite
ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!
☆99Updated last week
padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 10 months ago
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆45Updated 5 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
LLM360 / k2-data-prep
☆20Updated last year
du-nlp-lab / MLR-Copilot
☆66Updated 4 months ago
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated last year
ZihanWang314 / coeCheck
☆19Updated 5 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆27Updated 7 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
XiaoduoAILab / XmodelVLM
☆69Updated last year
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆59Updated last year
uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆24Updated 9 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated last week
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆21Updated 2 months ago
mwatkins1970 / SAE_Feature_Interpretability_Tool
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Updated 9 months ago
a-antoniades / swe-search
☆11Updated 8 months ago
vis-nlp / ChartGemma
☆66Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆83Updated last week
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
cyzus / thoughtsculpt
☆13Updated 7 months ago
EternityYW / Gemini-Commonsense-Evaluation
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
☆36Updated last year
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆68Updated 3 months ago
Zyphra / transformers_zamba2
☆48Updated 5 months ago
annahedstroem / sanity-checks-revisited
[NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"
☆25Updated last year
ulab-uiuc / Router-R1
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
☆33Updated last month
miralab-ai / autoreason
☆40Updated 7 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago