jn2clark / articlesLinks
☆13Updated 2 years ago
Alternatives and similar repositories for articles
Users that are interested in articles are comparing it to the libraries listed below
Sorting:
- ☆213Updated last week
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Highly commented implementations of Transformers in PyTorch☆139Updated 2 years ago
- ☆94Updated 2 years ago
- ☆160Updated last year
- ☆210Updated 5 months ago
- ☆144Updated 2 years ago
- Inference code for Persimmon-8B☆412Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- ☆112Updated 2 years ago
- Convert all of libgen to high quality markdown☆254Updated 2 years ago
- Helpers and such for working with Lambda Cloud☆51Updated 2 years ago
- ☆157Updated 2 years ago
- run paligemma in real time☆133Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated this week
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 10 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆182Updated last month
- Full finetuning of large language models without large memory requirements☆94Updated 2 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆158Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆104Updated 2 years ago
- a small code base for training large models☆315Updated 7 months ago
- A miniature version of Modal☆21Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- ☆22Updated 2 years ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆296Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆63Updated this week
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Updated last year
- ☆31Updated last year
- ☆198Updated last year