Reverse Instructions to generate instruction tuning data with corpus examples
☆215Mar 5, 2024Updated 2 years ago
Alternatives and similar repositories for LongForm
Users that are interested in LongForm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search the biomedical literature for protein interactions and protein associations☆11Nov 24, 2023Updated 2 years ago
- Alpaca dataset from Stanford, cleaned and curated☆1,585Mar 7, 2026Updated last month
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 3 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Instruction Tuning with GPT-4☆4,335Jun 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆1,561Updated this week
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Codes for NAACL 2021 paper 'Noisy Self-Knowledge Distillation for Text Summarization'☆24Jul 27, 2021Updated 4 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆231Sep 6, 2024Updated last year
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- Node.js implementation binding for the RWKV.cpp module☆21Aug 2, 2023Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 6 months ago
- [COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity☆13May 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Easily deploy your rwkv model☆19May 5, 2023Updated 2 years ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 9 months ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆822May 6, 2023Updated 2 years ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,586Mar 27, 2023Updated 3 years ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆552Mar 10, 2024Updated 2 years ago
- Knowledge Infused Decoding☆70Dec 31, 2023Updated 2 years ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 2 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆117Jun 28, 2025Updated 9 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Nov 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆315Jun 9, 2024Updated last year
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- ☆181Feb 23, 2023Updated 3 years ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Aug 13, 2023Updated 2 years ago
- ☆10Feb 6, 2025Updated last year
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆29Dec 31, 2021Updated 4 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆497Mar 19, 2024Updated 2 years ago
- RWKV godot interface module☆61Jun 13, 2024Updated last year
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago