francoisfleuret / dlcLinks

☆41

Alternatives and similar repositories for dlc

Users that are interested in dlc are comparing it to the libraries listed below

Sorting:

kmohan321 / Research_Papers
☆46Updated 3 months ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆67Updated 5 months ago
kvfrans / lmpo
☆83Updated last week
srush / Tensor-Puzzles-Penzai
☆20Updated last year
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 2 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆65Updated 3 weeks ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 5 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 8 months ago
yash-srivastava19 / arrakis
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
☆31Updated 2 months ago
keyonvafa / world-model-evaluation
☆59Updated 8 months ago
kanpuriyanawab / minbpe.c
a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.
☆21Updated last year
eemlcommunity / PracticalSessions2023
☆65Updated 2 years ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆87Updated 2 weeks ago
Quentin-Anthony / torch-profiling-tutorial
☆323Updated this week
Amplify-Partners / annotation-reading-list
A reading list of relevant papers and projects on foundation model annotation
☆27Updated 4 months ago
LeonGuertler / UnstableBaselines
☆90Updated this week
AI-Hypercomputer / RecML
☆186Updated this week
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆63Updated last month
brantondemoss / GrokkingComplexity
Code for
☆27Updated 7 months ago
vandijklab / Intelligence_at_the_edge_of_chaos
☆54Updated 4 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆140Updated last month
AniruddhaChattopadhyay / Books
☆161Updated 3 weeks ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
okarthikb / state-space-models
☆27Updated last year
damek / STAT-4830
Official Github for Wharton STAT 4830
☆40Updated last month
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆91Updated 4 months ago
google-deepmind / transformer_ngrams
☆23Updated 8 months ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated 2 weeks ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 7 months ago