An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
☆16Jun 6, 2024Updated last year
Alternatives and similar repositories for nanoLM
Users that are interested in nanoLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- A family of efficient edge language models in 100M~1B sizes.☆18Feb 14, 2025Updated last year
- An implementation of the hammer2 filesystem for Plan 9☆19Nov 25, 2018Updated 7 years ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- ☆18Mar 23, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Simple Application Sandboxing☆23Aug 9, 2024Updated last year
- ☆20Aug 21, 2020Updated 5 years ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 5 years ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆21Apr 1, 2026Updated 2 weeks ago
- ☆16Apr 3, 2024Updated 2 years ago
- readthedocs.org documentation for Inkplate boards☆10Aug 25, 2025Updated 7 months ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Causal Inference for Time Series Data (with CausalML Demo)☆14Jun 11, 2023Updated 2 years ago
- Write your code as tree-like expressions, then transform it☆21Jan 9, 2024Updated 2 years ago
- ☆12Dec 13, 2023Updated 2 years ago
- 基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程☆18May 12, 2024Updated last year
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆60Oct 2, 2024Updated last year
- ☆22Nov 11, 2024Updated last year
- ☆11Aug 26, 2021Updated 4 years ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- scrape, clean and model IPO data with supervised ML☆10Aug 20, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 2 years ago
- A C-based checksec without readelf or grep dependance.☆11Apr 20, 2021Updated 4 years ago
- The non-user-of-rawdraw-facing side of rawdraw.☆12Jan 12, 2021Updated 5 years ago
- 一步步理解基于pytorch实现yolo-v3过程☆12Aug 10, 2018Updated 7 years ago
- ☆23Aug 7, 2023Updated 2 years ago
- ☆12Nov 6, 2023Updated 2 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆17Mar 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal Kréta client written in Python.☆11Oct 7, 2023Updated 2 years ago
- ☆31Mar 23, 2024Updated 2 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- this project is developing to crawl stock A finance and trade data from website, process finance and trade data to get factors, and then …☆17Jan 12, 2023Updated 3 years ago
- ☆10Sep 30, 2020Updated 5 years ago
- Acme style editing plugin for micro editor☆26Jun 27, 2024Updated last year
- [KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".☆30Jan 10, 2025Updated last year