yuxdux / kinda-llama
An open-source replication and extension of the Meta AI's LLAMA dataset
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for kinda-llama
- Reimplementation of the task generation part from the Alpaca paper☆118Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆63Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated last week
- Experimental sampler to make LLMs more creative☆30Updated last year
- ☆49Updated 7 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆70Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆40Updated last year
- LLM sampling method for enforcing syntax adherence in generated output☆21Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆105Updated last year
- Code repository for the c-BTM paper☆105Updated last year
- Framework agnostic python runtime for RWKV models☆145Updated last year
- ☆34Updated last year
- ☆22Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆61Updated last year
- One stop shop for all things carp☆58Updated 2 years ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated 10 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- GPT-2 small trained on phi-like data☆65Updated 8 months ago
- ☆72Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year