Ledzy / StreamBPLinks
Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".
☆42Updated last week
Alternatives and similar repositories for StreamBP
Users that are interested in StreamBP are comparing it to the libraries listed below
Sorting:
- Hybrid Latent Reasoning via Reinforcement Learning☆120Updated 3 weeks ago
- Collecting personality-indicative data for role-playing agents.☆22Updated 4 months ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆36Updated 3 months ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆41Updated 6 months ago
- Official Code of Logits-Based-Finetuning☆85Updated last week
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation☆55Updated last month
- An easy-to-use vector database.☆38Updated 2 months ago
- ☆44Updated 2 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 4 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆31Updated this week
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 5 months ago
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆34Updated last month
- SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models☆40Updated 3 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆22Updated 6 months ago
- A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]☆117Updated last month
- ☆63Updated 2 weeks ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 3 months ago
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆28Updated last week
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆78Updated last year
- [AISTATS2021] Official implementation of "Sample Elicitation"☆29Updated 4 years ago
- ☆84Updated this week
- Implementation of RSGC-BD (Blur Detection)☆47Updated 9 months ago
- Our imbalance-aware ViT model achieves 0.91035 accuracy on the public leaderboard and 0.87750 on the private leaderboard of the ML2022Spr…☆19Updated 2 weeks ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- ACL 2024☆32Updated 9 months ago
- ☆29Updated last week
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Updated last year
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆184Updated last month
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 10 months ago
- A unified system resource management platform designed for administrators, serving as the foundational module for the Angus application s…☆87Updated this week