dominiquegarmier / grok-pytorch
pytorch implementation of grok
☆11Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for grok-pytorch
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- LLM reads a paper and produce a working prototype☆36Updated 2 weeks ago
- ☆22Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆27Updated last year
- Collection of autoregressive model implementation☆67Updated this week
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆24Updated last year
- Low-Rank Adaptation of Large Language Models clean implementation☆9Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- Finetune any model on HF in less than 30 seconds☆56Updated 2 weeks ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆40Updated this week
- ☆41Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆85Updated 2 months ago
- ☆32Updated 10 months ago
- ☆48Updated last year
- Build Agentic workflows with function calling☆20Updated last week
- ☆12Updated last month
- ☆57Updated 11 months ago
- RWKV-7: Surpassing GPT☆50Updated last week
- ☆22Updated 6 months ago
- ☆104Updated 8 months ago
- Github repo for Peifeng's internship project☆12Updated last year
- ☆62Updated 2 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 11 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 2 months ago
- Hugging Face Deep RL Class notes☆10Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago