google-deepmind / calm
☆32Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for calm
- ☆24Updated 5 months ago
- ☆22Updated last month
- ☆26Updated 8 months ago
- RWKV-7: Surpassing GPT☆45Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆36Updated 3 weeks ago
- Modified Beam Search with periodical restart☆12Updated 2 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆30Updated last month
- ☆31Updated 10 months ago
- Collection of autoregressive model implementation☆67Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆58Updated last month
- ☆18Updated 10 months ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆19Updated 2 months ago
- ☆27Updated last year
- ☆41Updated 2 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 weeks ago
- ☆27Updated 3 months ago
- ☆30Updated 11 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆32Updated last year
- ☆62Updated last month
- ☆56Updated 6 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago
- ☆21Updated 5 months ago
- Generate Stunning Images and Craft Visual Stories for your Brand☆12Updated 3 weeks ago
- ☆34Updated 6 months ago
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆41Updated 3 months ago