☆62Apr 17, 2026Updated this week
Alternatives and similar repositories for llm_trainer
Users that are interested in llm_trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement llm model in pytorch, support MoE and RoPE☆65Updated this week
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago
- Official implementation for the paper "Quantum Bayesian Optimization" accepted to NeurIPS 2023.☆12Jan 7, 2024Updated 2 years ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆24May 29, 2025Updated 10 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the NeurIPS 2020 paper: "Federated Bayesian Optimization via Thompson Sampling"☆26Nov 6, 2020Updated 5 years ago
- Natural Language to Overpass Query Language☆30Mar 14, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Yanjie — An English Speaking Learning Assistant Based on InternLM, aims to break the traditional interaction boundaries and eliminate th…☆16Oct 9, 2024Updated last year
- ☆27Dec 11, 2025Updated 4 months ago
- MeloTTS demo on Axera☆12Nov 18, 2025Updated 5 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆22Oct 14, 2025Updated 6 months ago
- Experiments with reasoning models, training techniques, papers☆28Apr 12, 2026Updated last week
- ☆51Dec 10, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- [COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"☆55Oct 6, 2025Updated 6 months ago
- ☆15Jun 22, 2025Updated 9 months ago
- ☆25Mar 8, 2026Updated last month
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- ☆10Jan 12, 2024Updated 2 years ago
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆34Aug 21, 2025Updated 7 months ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- 在您的机器上本地离线运行 AI 模型☆11May 8, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Experimental syslog template mining module☆11Aug 29, 2016Updated 9 years ago
- The offical repo of "Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives"☆48Aug 7, 2025Updated 8 months ago
- ☆17Jan 31, 2025Updated last year
- Methods and experiments for assumed density SDE approximations☆12Jan 26, 2022Updated 4 years ago
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- [ICLR 2022] Denoising Likelihood Score Matching for Conditional Score-based Data Generation☆11Jan 2, 2025Updated last year
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- Overlapping Reads COmpression with Minimizers☆16May 19, 2022Updated 3 years ago
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 5 years ago
- The official repository for AdaMuon☆37Aug 27, 2025Updated 7 months ago
- A lightweight Transformer training & inference framework☆47Apr 10, 2026Updated last week
- 基于InterLM的《黑神话:悟空》AI小助手,了解更多背后的故事--在更新视频中☆36Jan 4, 2025Updated last year
- ☆16May 12, 2023Updated 2 years ago
- IPLoM (Iterative Partitioning Log Mining) - Java☆15Mar 13, 2016Updated 10 years ago