☆23Dec 18, 2024Updated last year
Alternatives and similar repositories for TxT360
Users that are interested in TxT360 are comparing it to the libraries listed below
Sorting:
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆109Apr 4, 2025Updated 10 months ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- ☆21Oct 13, 2021Updated 4 years ago
- ☆20Nov 4, 2025Updated 3 months ago
- ☆52Jun 6, 2024Updated last year
- ☆33Feb 15, 2026Updated 2 weeks ago
- Estimate MFU for DeepSeekV3☆26Jan 5, 2025Updated last year
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 6 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated last year
- ☆12Oct 18, 2022Updated 3 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- OpenTelemetry layer for HTTP/gRPC services☆10Feb 23, 2026Updated last week
- ☆10Jul 16, 2023Updated 2 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- An (incomplete) overview of information extraction☆43Apr 28, 2022Updated 3 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 11 months ago
- ☆11Dec 31, 2021Updated 4 years ago
- Collection of programming style guides used in Serokell☆12Jul 4, 2022Updated 3 years ago
- ☆11Nov 13, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- todo2(a.k.a. todo or die) - A better todo! macro inspired from searls/todo_or_die☆11Feb 24, 2026Updated last week
- Version lock, cache, and run binaries from any Github Release assets. Pull in external tools and keep the versions in sync across your te…☆15Jan 3, 2024Updated 2 years ago
- ☆16Oct 2, 2022Updated 3 years ago
- ☆11Feb 23, 2024Updated 2 years ago
- Pytorch implementation of The ICML 2020 paper "On Learning Sets of Symmetric Elements" by Haggai Maron, Or Litany, Gal Chechik, Ethan Fet…☆10Apr 22, 2021Updated 4 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- “中国光谷·华为杯”第十九届中国研究生数学建模竞赛(2022年)☆10Jul 9, 2023Updated 2 years ago
- ☆11Nov 21, 2022Updated 3 years ago
- MiniLM (BERT) embeddings from scratch☆18Aug 14, 2025Updated 6 months ago
- OTP generation & validation library for Rust☆14Dec 4, 2025Updated 2 months ago
- Code for our paper "Towards Principled Graph Transformers"☆13Oct 30, 2024Updated last year
- ☆10Mar 6, 2022Updated 3 years ago
- Repo containing few notebooks on fine tuning of Language Models☆13Apr 29, 2024Updated last year
- The implementation of FedMix☆11Aug 18, 2022Updated 3 years ago
- A distributed execution framework built upon lunatic.☆16Jan 19, 2024Updated 2 years ago
- Install fonts on your system☆29Feb 4, 2026Updated 3 weeks ago