[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Jan 26, 2025Updated last year
Alternatives and similar repositories for BMC
Users that are interested in BMC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains my reimplementation and improvement of DeepLOB model.☆32Apr 22, 2021Updated 4 years ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆37Aug 19, 2024Updated last year
- 使用MovieLens数据集实现了基于Auto Encoder(AE), Variational Auto Encoder(VAE), BERT的深度学习电影推荐系统☆77Dec 18, 2020Updated 5 years ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 9 months ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Nov 17, 2023Updated 2 years ago
- Use BiLSTM_attention, BERT, ALBERT, RoBERTa, XLNet model to classify the SST-2 data set based on pytorch☆109Dec 9, 2020Updated 5 years ago
- Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCN…☆78May 6, 2022Updated 3 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Feb 11, 2024Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- ☆11May 28, 2024Updated last year
- ☆20Dec 14, 2024Updated last year
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆53Jan 5, 2026Updated 2 months ago
- ☆15Jan 12, 2026Updated 2 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- ☆10May 28, 2024Updated last year
- Code for Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification☆12Nov 22, 2017Updated 8 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Dec 14, 2022Updated 3 years ago
- ☆35May 16, 2025Updated 10 months ago
- Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).☆34May 26, 2023Updated 2 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Sep 11, 2024Updated last year
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 weeks ago
- ☆15Dec 23, 2024Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- [OS X] Set the views of the stars above you as your dynamic desktop wallpaper.☆14May 19, 2020Updated 5 years ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 4 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 9 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated last year
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago