mlpod / OpenSFT
☆45Updated last month
Alternatives and similar repositories for OpenSFT:
Users that are interested in OpenSFT are comparing it to the libraries listed below
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing☆139Updated last month
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆48Updated 3 weeks ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆57Updated 2 months ago
- ☆421Updated 8 months ago
- ☆135Updated last month
- ☆57Updated last month
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 7 months ago
- ☆318Updated last month
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆134Updated last month
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 2 months ago
- A clean and extensible agentic RAG system with modular implementation.☆95Updated last week
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆47Updated 2 weeks ago
- The code for COLING2022 paper: 《TSAM: A Two-Stream Attention Model for Causal Emotion Entailment》☆26Updated last year
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 6 months ago
- PyTorch implementation of <Neural-based Mixture Probabilistic Query Embedding for Answering FOL queries on Knowledge Graphs>☆33Updated last year
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆156Updated last month
- Completed this competition in collaboration with Jiang Yan(https://github.com/jy1993) and Guan Shuicheng(https://github.com/guanshuicheng…☆368Updated 6 months ago
- ☆207Updated last month
- Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆17Updated 2 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆48Updated 9 months ago
- ☆145Updated last year
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)☆31Updated last year
- JittorGeometric is a Jittor-based graph machine learning library.☆154Updated last week
- ☆14Updated 2 years ago
- Openai API Cost Tracker☆20Updated last year
- BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?☆568Updated last week
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆66Updated 9 months ago
- MATEval is the first multi-agent framework simulating human collaborative discussion for open-ended text evaluation.☆27Updated 2 months ago
- 实体关系联合抽取☆184Updated last year
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 3 months ago