dingo-actual / om
An LLM architecture utilizing a recurrent structure and multi-layer memory
☆12Updated last month
Alternatives and similar repositories for om:
Users that are interested in om are comparing it to the libraries listed below
- ☆100Updated 2 months ago
- ☆38Updated 7 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆165Updated this week
- ☆49Updated 11 months ago
- ☆97Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 3 months ago
- ☆42Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 4 months ago
- An introduction to LLM Sampling☆75Updated 2 months ago
- Long context evaluation for large language models☆200Updated last week
- DeMo: Decoupled Momentum Optimization☆181Updated 3 months ago
- Collection of autoregressive model implementation☆81Updated 2 weeks ago
- ☆51Updated 6 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 4 months ago
- ☆122Updated 2 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆131Updated last week
- Retro styled terminal shell☆26Updated 9 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- ☆21Updated 2 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- look how they massacred my boy☆63Updated 4 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 5 months ago
- ☆53Updated last year
- Measuring the situational awareness of language models☆34Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆25Updated 5 months ago
- ☆65Updated 9 months ago