LLM360/TxT360

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LLM360/TxT360)

LLM360 / TxT360

☆25

Alternatives and similar repositories for TxT360

Users that are interested in TxT360 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jinlanfu / Polyglot_Prompt
View on GitHub
Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.
☆18Dec 7, 2022Updated 3 years ago
LLM360 / MegaMath
View on GitHub
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.
☆110Apr 4, 2025Updated last year
pppa2019 / swie_overmiss_llm4mt
View on GitHub
Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"
☆12Aug 26, 2023Updated 2 years ago
Timothyxxx / LMsMBTI
View on GitHub
A MBTI test on Large Language Model like GPT-3.
☆28May 2, 2022Updated 4 years ago
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
all-the-noises / eval-arena
View on GitHub
☆34Mar 21, 2026Updated 4 months ago
aflah02 / TokenSmith
View on GitHub
A comprehensive toolkit for streamlining data editing, search, and inspection for large-scale language model training and interpretabilit…
☆21Oct 30, 2025Updated 8 months ago
sail-sg / regmix
View on GitHub
[ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)
☆194Feb 17, 2025Updated last year
yuwfan / FILTER
View on GitHub
☆21Oct 13, 2021Updated 4 years ago
zjunlp / BiasEdit
View on GitHub
[TrustNLP@NAACL 2025] BiasEdit: Debiasing Stereotyped Language Models via Model Editing
☆18Sep 30, 2025Updated 9 months ago
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
chenllliang / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022
☆18May 19, 2022Updated 4 years ago
sail-sg / scaling-with-vocab
View on GitHub
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆112Sep 26, 2024Updated last year
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LLM360 / k2-train
View on GitHub
☆58Jun 6, 2024Updated 2 years ago
PlusLabNLP / Active-IT
View on GitHub
Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆26Nov 16, 2023Updated 2 years ago
jlko / active-surrogate-estimators
View on GitHub
☆13Feb 14, 2022Updated 4 years ago
feifeibear / DPSKV3MFU
View on GitHub
Estimate MFU for DeepSeekV3
☆26Jan 5, 2025Updated last year
slaclab / PyEmittance
View on GitHub
☆10Jul 4, 2026Updated 2 weeks ago
sustcsonglin / linear-attention-and-beyond-slides
View on GitHub
☆119Feb 25, 2025Updated last year
nick11roberts / XD
View on GitHub
☆12Jul 6, 2022Updated 4 years ago
NingMiao / InstaAug
View on GitHub
☆15Dec 28, 2022Updated 3 years ago
ChunhuaLiu596 / WAX
View on GitHub
The respository describing a novel datasets for word association explanations
☆13Sep 21, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gto76 / wfdl
View on GitHub
Watch Face Description Language
☆19Dec 25, 2019Updated 6 years ago
deep-spin / sparse_continuous_distributions
View on GitHub
This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
☆15May 10, 2023Updated 3 years ago
hitz-zentroa / lm-contamination
View on GitHub
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆81Apr 11, 2024Updated 2 years ago
glorgao / SelectiveDPO
View on GitHub
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
☆47Jul 16, 2025Updated last year
YoungseogChung / calibrated-quantile-uq
View on GitHub
Repository for Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification (NeurIPS 2024)
☆44Nov 25, 2024Updated last year
chili-lab / LT2
View on GitHub
Official Codebase: LT2: Linear-Time Looped Transformers.
☆49May 27, 2026Updated last month
ali-vilab / matrix
View on GitHub
☆34Apr 8, 2025Updated last year
HazyResearch / ludwig-benchmarking-toolkit
View on GitHub
Ludwig benchmark
☆20May 11, 2026Updated 2 months ago
sustainlab-group / IS-Count
View on GitHub
Code for reproducing IS-Count: Large-scale Object Counting with Importance Sampling (AAAI 2022)
☆26Nov 3, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JinjieNi / OpenMoE2
View on GitHub
The official repo for "OpenMoE 2: Sparse Diffusion Language Models".
☆58Dec 28, 2025Updated 6 months ago
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
LLM360 / Reasoning360
View on GitHub
A repo for open research on building large reasoning models
☆151Jul 3, 2026Updated 2 weeks ago
armenjeddi / loopformer
View on GitHub
LoopFormer is an elastic-depth looped Transformer trained on variable-length trajectories, using time/step-size conditioning and a shortc…
☆28Mar 28, 2026Updated 3 months ago
inspire-group / robustness-via-transport
View on GitHub
☆12Sep 26, 2019Updated 6 years ago
RunxinXu / ContrastivePruning
View on GitHub
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Dec 15, 2021Updated 4 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago