BlinkDL / LM-Trick-Questions
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆32Updated last year
Alternatives and similar repositories for LM-Trick-Questions:
Users that are interested in LM-Trick-Questions are comparing it to the libraries listed below
- Here we will test various linear attention designs.☆58Updated 9 months ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆24Updated 5 months ago
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆45Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 5 months ago
- Utilities for Training Very Large Models☆57Updated 4 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Structural Pruning for LLaMA☆54Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 6 months ago
- RWKV model implementation☆37Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 9 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Updated 8 months ago
- Tools for content datamining and NLP at scale☆42Updated 8 months ago
- GoldFinch and other hybrid transformer components☆43Updated 7 months ago
- Griffin MQA + Hawk Linear RNN Hybrid☆85Updated 9 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- ☆18Updated 8 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆30Updated 8 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 4 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆96Updated 4 months ago
- RWKV-7: Surpassing GPT☆79Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated last week
- ☆99Updated 11 months ago