cofe-ai/nanoLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cofe-ai/nanoLM)

cofe-ai / nanoLM

An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales

☆16

Alternatives and similar repositories for nanoLM

Users that are interested in nanoLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cofe-ai / MSG
View on GitHub
Masked Structural Growth for 2x Faster Language Model Pre-training
☆25Apr 28, 2024Updated 2 years ago
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 2 years ago
Xingrun-Xing2 / EfficientLLM
View on GitHub
A family of efficient edge language models in 100M~1B sizes.
☆19Feb 14, 2025Updated last year
NikhilSehgal123 / coinbase-execution-algorithm
View on GitHub
An algorithm that intelligently executes a crypto order over time via Coinbase
☆13Oct 26, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Zyh716 / WSDM2022-C2CRS
View on GitHub
☆18Mar 23, 2022Updated 4 years ago
listentm / CROWDSELECT
View on GitHub
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…
☆20May 20, 2025Updated last year
Mnehmos / mnehmos.rpg.mcp
View on GitHub
☆33Apr 20, 2026Updated last month
duane1024 / l123
View on GitHub
☆123May 20, 2026Updated last week
SolderedElectronics / Inkplate-documentation
View on GitHub
readthedocs.org documentation for Inkplate boards
☆10Aug 25, 2025Updated 9 months ago
leokhoa / Open-DocLLM
View on GitHub
☆16Apr 3, 2024Updated 2 years ago
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
michaelnny / InstructLLaMA
View on GitHub
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the …
☆57Mar 9, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
LeeeeoLiu / LLM-CRS
View on GitHub
☆12Dec 13, 2023Updated 2 years ago
shiqinghuayi19 / LLMforEvent
View on GitHub
This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"
☆10Feb 16, 2024Updated 2 years ago
ryoungj / ObsScaling
View on GitHub
[NeurIPS'24 Spotlight] Observational Scaling Laws
☆61Oct 2, 2024Updated last year
rioyokotalab / Megatron-Llama2
View on GitHub
2023 ABCI Llama-2 継続学習プロジェクト
☆14Jan 22, 2024Updated 2 years ago
lxxue / prefix_sum
View on GitHub
A PyTorch wrapper of parallel exclusive scan in CUDA
☆12May 25, 2023Updated 3 years ago
GURPREETKAURJETHRA / LLM-based-Finance-Agent
View on GitHub
An intelligent agent utilizing Large Language Models (LLMs) for automated financial news retrieval and stock price prediction.
☆22Sep 9, 2024Updated last year
lockedbyte / protcheck
View on GitHub
A C-based checksec without readelf or grep dependance.
☆11Apr 20, 2021Updated 5 years ago
SnoopX-AI / Awesome-Weak-to-Strong-Generalization
View on GitHub
☆11Aug 10, 2024Updated last year
ml-gde / jaxgarden
View on GitHub
A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax
☆24Jun 8, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alasdairforsythe / capcode
View on GitHub
Lossless normalization of uppercase characters
☆11Jul 3, 2023Updated 2 years ago
tongzhou21 / Oasis
View on GitHub
☆23Aug 7, 2023Updated 2 years ago
npk48 / rwkv_cuda
View on GitHub
☆11Jul 23, 2023Updated 2 years ago
BryanLunduke / Netiquette2020
View on GitHub
Network Etiquette (Netiquette) -- Written with 2020 technology in mind
☆10Nov 19, 2021Updated 4 years ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
antonio-f / BERT_from_scratch
View on GitHub
Training a BERT model from scratch.
☆11Oct 15, 2023Updated 2 years ago
ArthurLeoM / peft-givens
View on GitHub
source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib
☆17Mar 13, 2025Updated last year
hatemr / quantitative-trading-project
View on GitHub
This project combines logistic regression, gradient boosting, and LSTMs to predict next-month returns.
☆13Sep 25, 2019Updated 6 years ago
RUCAIBox / ChainLM
View on GitHub
☆31Mar 23, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lampts / chatgpt-mle-interview
View on GitHub
ChatGPT solutions for the MLE interview
☆14Dec 9, 2022Updated 3 years ago
ww9 / news
View on GitHub
Minimalist RSS/Atom aggregator 📰
☆23Oct 11, 2023Updated 2 years ago
Sid-darthvader / DoWhy-The-Causal-Story-Behind-Hotel-Booking-Cancellations
View on GitHub
☆10Sep 30, 2020Updated 5 years ago
xxuejie / micro-acme
View on GitHub
Acme style editing plugin for micro editor
☆26Jun 27, 2024Updated last year
psilva261 / go-arm64.plan9
View on GitHub
Go port to plan9/arm64
☆18Mar 11, 2025Updated last year
omarWafaay / MathFormApp
View on GitHub
Application for Math formula detection in image/pdf and then recognition
☆12Jan 14, 2025Updated last year
xphung / plan9_webasm
View on GitHub
WebAssembly port of Plan9 (fourth edition) libraries, device drivers, file systems and Inferno kernel
☆20Jan 30, 2023Updated 3 years ago