SkunkworksAI / CodeFusion
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CodeFusion
- ☆20Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆22Updated last year
- ☆38Updated this week
- ☆72Updated last year
- GPT-2 small trained on phi-like data☆65Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- ☆27Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- ☆35Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆64Updated 5 months ago
- ☆62Updated last month
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 7 months ago
- ☆28Updated 2 weeks ago
- entropix style sampling + GUI☆25Updated last week
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- Modified Beam Search with periodical restart☆12Updated last month
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 10 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆37Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- look how they massacred my boy☆53Updated 3 weeks ago
- ☆48Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year