mlpod / OpenSFTLinks
☆45Updated 2 months ago
Alternatives and similar repositories for OpenSFT
Users that are interested in OpenSFT are comparing it to the libraries listed below
Sorting:
- ☆101Updated 3 weeks ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆50Updated 2 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 3 months ago
- ☆318Updated 3 months ago
- R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆530Updated last week
- [ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment☆102Updated this week
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆184Updated last month
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆66Updated 10 months ago
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 3 months ago
- ☆204Updated 2 months ago
- The code for COLING2022 paper: 《TSAM: A Two-Stream Attention Model for Causal Emotion Entailment》☆26Updated last year
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆20Updated 2 weeks ago
- ☆62Updated 3 months ago
- 用VLLM框架部署千问1.5并进行流式输出☆89Updated last year
- ☆14Updated 2 years ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 7 months ago
- Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"☆28Updated 2 weeks ago
- ☆208Updated 3 weeks ago
- A clean and extensible agentic RAG system with modular implementation.☆103Updated last month
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆94Updated 3 months ago
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 9 months ago
- ☆422Updated 9 months ago
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆157Updated 3 months ago
- PyTorch implementation of <Neural-based Mixture Probabilistic Query Embedding for Answering FOL queries on Knowledge Graphs>☆33Updated 2 years ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆136Updated 3 months ago
- Completed this competition in collaboration with Jiang Yan(https://github.com/jy1993) and Guan Shuicheng(https://github.com/guanshuicheng…☆367Updated 7 months ago
- Openai API Cost Tracker☆20Updated last year
- ☆38Updated 2 months ago
- ☆145Updated last year
- MTLA: Multi-head Temporal Latent Attention☆230Updated this week