luo-junyu / SemiEvol
SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation
☆51Updated 2 months ago
Alternatives and similar repositories for SemiEvol:
Users that are interested in SemiEvol are comparing it to the libraries listed below
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 2 months ago
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆30Updated last month
- ☆30Updated 5 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆42Updated 6 months ago
- official implementation of paper SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training☆30Updated 2 months ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆35Updated 9 months ago
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆55Updated 2 months ago
- ☆63Updated 3 months ago
- Collecting personality-indicative data for role-playing agents.☆22Updated this week
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 6 months ago
- GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts (NeurIPS 2024)☆21Updated 4 months ago
- ☆36Updated last year
- [ICLR 2023] Official Tensorflow implementation of "Distributionally Robust Post-hoc Classifiers under Prior Shifts"☆34Updated last year
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆38Updated 2 months ago
- [ACL2024 Findings] Towards Better Question Generation in QA-Based Event Extraction☆43Updated last month
- (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment☆73Updated this week
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated 11 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tas…☆55Updated 5 months ago
- ☆35Updated last month
- [ICPR'24 Oral] Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery☆43Updated 7 months ago
- ☆43Updated 4 months ago
- Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning, IEEE Transactions on Multimedia (T-MM), 2022☆29Updated last year
- ☆12Updated last year
- Domain-Controlled Prompt Learning (AAAI2024)☆88Updated 3 months ago
- [AISTATS2021] Official implementation of "Sample Elicitation"☆29Updated 3 years ago
- Mixed precision inference by Tensorrt-LLM☆76Updated 3 months ago
- Single-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model☆12Updated 3 months ago