Gunale0926 / SORSA
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
☆40Updated last month
Alternatives and similar repositories for SORSA:
Users that are interested in SORSA are comparing it to the libraries listed below
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆2Updated 3 months ago
- ☆31Updated 6 months ago
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation☆52Updated 4 months ago
- Collecting personality-indicative data for role-playing agents.☆21Updated last month
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 2 months ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 3 months ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆37Updated last month
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆42Updated 8 months ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 8 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated 4 months ago
- [ICLR 2025] Improving Data Efficiency via Curating LLM-Driven Rating Systems☆48Updated 3 weeks ago
- An easy-to-use vector database.☆36Updated 2 weeks ago
- ☆47Updated 6 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- ☆36Updated last year
- [ICLR 2023] Official Tensorflow implementation of "Distributionally Robust Post-hoc Classifiers under Prior Shifts"☆34Updated last year
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated last year
- [NeurIPS 2023] On Sparse Modern Hopfield Model☆50Updated last year
- official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training☆35Updated 4 months ago
- Single-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model☆12Updated 5 months ago
- ☆12Updated last year
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated 11 months ago
- Official Implementation of "Pay Attention to What You Need"☆42Updated last month
- ☆35Updated 3 months ago
- [AISTATS2021] Official implementation of "Sample Elicitation"☆29Updated 4 years ago
- Implementation of RSGC-BD (Blur Detection)☆47Updated 7 months ago
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Updated last year
- A collection of papers related to knowledge fusion☆54Updated 6 months ago
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆40Updated last year