[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models
☆212Feb 11, 2024Updated 2 years ago
Alternatives and similar repositories for Lion
Users that are interested in Lion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- This repo contains my reimplementation and improvement of DeepLOB model.☆31Apr 22, 2021Updated 4 years ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆37Aug 19, 2024Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 10 months ago
- 使用MovieLens数据集实现了基于Auto Encoder(AE), Variational Auto Encoder(VAE), BERT的深度学习电影推荐系统☆77Dec 18, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆587Sep 7, 2023Updated 2 years ago
- ☆18Mar 3, 2025Updated last year
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆26Oct 13, 2024Updated last year
- Use BiLSTM_attention, BERT, ALBERT, RoBERTa, XLNet model to classify the SST-2 data set based on pytorch☆109Dec 9, 2020Updated 5 years ago
- Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCN…☆78May 6, 2022Updated 3 years ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Jun 18, 2023Updated 2 years ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Nov 17, 2023Updated 2 years ago
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆40Aug 28, 2023Updated 2 years ago
- ☆10Jul 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NIPS2023] RRHF & Wombat☆808Sep 22, 2023Updated 2 years ago
- ☆11Feb 3, 2025Updated last year
- ☆45Aug 23, 2023Updated 2 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,798Dec 12, 2023Updated 2 years ago
- ☆22Feb 4, 2026Updated 2 months ago
- ☆11May 11, 2022Updated 3 years ago
- Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]☆10Mar 13, 2022Updated 4 years ago
- Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).☆34May 26, 2023Updated 2 years ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆73Sep 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MSTI☆16Mar 6, 2024Updated 2 years ago
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 9 months ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,586Mar 27, 2023Updated 3 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,285Oct 16, 2024Updated last year
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12May 7, 2025Updated 11 months ago
- ☆314Jun 9, 2024Updated last year
- Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca☆4,128Apr 18, 2025Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆842Jul 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,671Jul 18, 2024Updated last year
- ☆14Nov 29, 2023Updated 2 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆302May 31, 2023Updated 2 years ago
- ☆295Dec 20, 2023Updated 2 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,051Apr 14, 2024Updated 2 years ago
- ☆28Jul 11, 2024Updated last year
- Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation…☆33Aug 4, 2023Updated 2 years ago