shan23chen / MedBrowseCompLinks
☆20Updated 2 weeks ago
Alternatives and similar repositories for MedBrowseComp
Users that are interested in MedBrowseComp are comparing it to the libraries listed below
Sorting:
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆31Updated 3 months ago
- ☆53Updated 2 years ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 7 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 3 months ago
- ☆24Updated 8 months ago
- ☆46Updated 8 months ago
- ☆43Updated 8 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆27Updated 4 months ago
- ☆21Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆21Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆34Updated last week
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 4 months ago
- A method for steering llms to better follow instructions☆45Updated this week
- ☆16Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- PyTorch implementation for MRL☆18Updated last year
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- ☆36Updated this week
- ☆48Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆19Updated 3 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year