Thinklab-SJTU / BiLAF
Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"
☆11Updated 3 weeks ago
Alternatives and similar repositories for BiLAF:
Users that are interested in BiLAF are comparing it to the libraries listed below
- [NeurIPS 2024 Datasets and Benchmarks Track] Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime☆18Updated this week
- [SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline☆20Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆17Updated last year
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆18Updated 2 years ago
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆15Updated last year
- Awesome papers in machine learning theory☆11Updated 3 years ago
- ☆11Updated 11 months ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆18Updated 9 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆46Updated 3 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆12Updated 8 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆49Updated 4 months ago
- Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview☆22Updated last year
- [NeurIPS 2022 Spotlight] Improving Generative Adversarial Networks via Adversarial Learning in Latent Space☆17Updated 2 years ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆18Updated last year
- This is the project for IRM methods☆12Updated 3 years ago
- Official code for ICLR 2023 paper "ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond "☆34Updated last year
- Code for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆24Updated last month
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆48Updated 2 years ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆34Updated last month
- ☆18Updated 4 months ago
- some examples for drawing illustration plots for paper using seaborn package☆14Updated 5 years ago
- ☆11Updated 6 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆14Updated 2 weeks ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆30Updated 4 months ago
- Codes for Merging Large Language Models☆29Updated 6 months ago