Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆50Jun 30, 2025Updated 8 months ago
Alternatives and similar repositories for MeCo
Users that are interested in MeCo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Oct 13, 2023Updated 2 years ago
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 5 months ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆38Jan 20, 2026Updated 2 months ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- ☆229Oct 27, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code and data from the paper 'Human Feedback is not Gold Standard'☆20Mar 6, 2026Updated 3 weeks ago
- Long Context Research☆31Jan 26, 2026Updated 2 months ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 4 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆62Mar 4, 2025Updated last year
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Group☆76Sep 18, 2025Updated 6 months ago
- ☆24Jul 24, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- ☆124Feb 21, 2025Updated last year
- Machine learning project using federated learning for text generation☆11May 5, 2024Updated last year
- ☆13Nov 26, 2021Updated 4 years ago
- ☆18May 15, 2021Updated 4 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆23Mar 31, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jun 10, 2024Updated last year
- some mixture of experts architecture implementations☆26Mar 22, 2024Updated 2 years ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆223Jul 25, 2025Updated 8 months ago
- SuperCLUE高考作文机器自动阅卷系统☆18Jun 8, 2023Updated 2 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- ☆48Jun 8, 2020Updated 5 years ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 9 months ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval☆16Mar 1, 2022Updated 4 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- ☆27Jul 9, 2024Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (https://huggingface.co/papers…☆90Nov 23, 2025Updated 4 months ago
- 智慧科研辅助平台☆28Nov 4, 2024Updated last year
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year