RobertRiachi / nanoPALM
☆143Updated last year
Alternatives and similar repositories for nanoPALM:
Users that are interested in nanoPALM are comparing it to the libraries listed below
- An interactive exploration of Transformer programming.☆259Updated last year
- Helpers and such for working with Lambda Cloud☆52Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- git extension for {collaborative, communal, continual} model development☆208Updated 3 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆165Updated last week
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆201Updated 3 months ago
- Simple Transformer in Jax☆136Updated 8 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Use context-free grammars with an LLM☆168Updated 11 months ago
- A puzzle to learn about prompting☆124Updated last year
- ☆153Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆343Updated 7 months ago
- ☆92Updated last year
- AI sends pull requests for features you request in natural language☆113Updated last year
- Resources from the EleutherAI Math Reading Group☆53Updated last week
- ☆212Updated 7 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Updated 2 years ago
- ☆75Updated 8 months ago
- Drive a browser with Cohere☆72Updated last year
- Automatic gradient descent☆207Updated last year
- ☆22Updated last year
- ☆165Updated 2 years ago
- Prompt programming with FMs.☆440Updated 7 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆186Updated 10 months ago
- ☆60Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆215Updated 11 months ago
- Train very large language models in Jax.☆203Updated last year