ishmael233 / LLM4OPTLinks
A collection of LLMs for optimization, including modeling and solving
☆16Updated last week
Alternatives and similar repositories for LLM4OPT
Users that are interested in LLM4OPT are comparing it to the libraries listed below
Sorting:
- OptiBench and ReSocratic Synthesis Method☆26Updated 6 months ago
- ☆26Updated 10 months ago
- ☆58Updated 9 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆10Updated last year
- A Survey of Personalization: From RAG to Agent☆70Updated last month
- PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023☆14Updated last year
- ☆68Updated 9 months ago
- Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization☆19Updated 9 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆48Updated 11 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆16Updated 8 months ago
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆79Updated 5 months ago
- WWW 2024: New Frontiers of Knowledge Graph Reasoning: Recent Advances and Future Trends☆18Updated last year
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆103Updated 7 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Updated last year
- ☆28Updated last year
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆140Updated 3 weeks ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Updated last year
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆93Updated last year
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆21Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 2 months ago
- Source code for the paper "CAT: Interpretable Concept-based Taylor Additive Models".☆19Updated last year
- FedJudge: Federated Legal Large Language Model☆35Updated last year
- Open-source code for ''Individual Fairness for Graph Neural Networks: A Ranking based Approach''.☆12Updated 3 years ago
- ☆40Updated 5 months ago
- Anupam Datta, Matt Fredrikson, Klas Leino, Kaiji Lu, Shayak Sen, Zifan Wang☆18Updated 4 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 10 months ago
- Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization (NeurIPS 21')☆23Updated 3 years ago
- Enable Comprehensive LLM Evaluation on Graph Reasoning☆73Updated 3 months ago
- ☆23Updated last year
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year