MLGroupJLU / RWKV-Survey
The official GitHub page for the survey paper "A Survey of RWKV".
☆12Updated 2 weeks ago
Alternatives and similar repositories for RWKV-Survey:
Users that are interested in RWKV-Survey are comparing it to the libraries listed below
- this is for fun, ain't it grand!☆12Updated 8 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 3 months ago
- ☆15Updated 5 months ago
- Efficient Scaling laws and collaborative pretraining.☆13Updated 2 months ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆24Updated 9 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 10 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 6 months ago
- Minimum Description Length probing for neural network representations☆18Updated this week
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆16Updated 6 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆27Updated 6 months ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Repository for Skill Set Optimization☆12Updated 5 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆56Updated last month
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- The repository contains code for Adaptive Data Optimization☆21Updated last month
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆64Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated 11 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆20Updated 5 months ago
- Lottery Ticket Adaptation☆37Updated 2 months ago
- ☆15Updated 2 months ago
- ☆9Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆37Updated last month
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆20Updated 11 months ago
- ☆25Updated 9 months ago
- ☆16Updated 2 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆19Updated 10 months ago
- PyTorch implementation for MRL☆18Updated 10 months ago
- Self-Supervised Alignment with Mutual Information☆16Updated 7 months ago