An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
☆16Jun 6, 2024Updated 2 years ago
Alternatives and similar repositories for nanoLM
Users that are interested in nanoLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Sep 5, 2024Updated last year
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆33Mar 30, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A family of efficient edge language models in 100M~1B sizes.☆19Feb 14, 2025Updated last year
- An implementation of the hammer2 filesystem for Plan 9☆19Nov 25, 2018Updated 7 years ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated last year
- Simple Application Sandboxing☆23Aug 9, 2024Updated last year
- ☆126May 26, 2026Updated 3 weeks ago
- readthedocs.org documentation for Inkplate boards☆10Aug 25, 2025Updated 9 months ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Causal Inference for Time Series Data (with CausalML Demo)☆14Jun 11, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the …☆57Mar 9, 2024Updated 2 years ago
- 基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程☆20May 12, 2024Updated 2 years ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆61Oct 2, 2024Updated last year
- ☆24Nov 11, 2024Updated last year
- ☆11Aug 26, 2021Updated 4 years ago
- scrape, clean and model IPO data with supervised ML☆10Aug 20, 2020Updated 5 years ago
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 3 years ago
- ☆11Aug 10, 2024Updated last year
- The non-user-of-rawdraw-facing side of rawdraw.☆12Jan 12, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 一步步理解基于pytorch实现yolo-v3过程☆12Aug 10, 2018Updated 7 years ago
- Network Etiquette (Netiquette) -- Written with 2020 technology in mind☆10Nov 19, 2021Updated 4 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆16Mar 13, 2025Updated last year
- This project combines logistic regression, gradient boosting, and LSTMs to predict next-month returns.☆13Sep 25, 2019Updated 6 years ago
- Minimalist RSS/Atom aggregator 📰☆23Oct 11, 2023Updated 2 years ago
- ☆10Sep 30, 2020Updated 5 years ago
- Acme style editing plugin for micro editor☆26Jun 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Two-way sync between Valtio proxies and Yjs CRDTs☆22May 18, 2026Updated 3 weeks ago
- JAX implementations of RWKV☆18Sep 26, 2023Updated 2 years ago
- Go port to plan9/arm64☆18Mar 11, 2025Updated last year
- WebAssembly port of Plan9 (fourth edition) libraries, device drivers, file systems and Inferno kernel☆20Jan 30, 2023Updated 3 years ago
- Vonatkésési statisztika☆21Updated this week
- ☆17Apr 10, 2024Updated 2 years ago
- ☆28Nov 18, 2017Updated 8 years ago