apple / ml-toadLinks
☆14Updated 9 months ago
Alternatives and similar repositories for ml-toad
Users that are interested in ml-toad are comparing it to the libraries listed below
Sorting:
- Tasks for describing differences between text distributions.☆16Updated 10 months ago
- ☆14Updated last month
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- ☆15Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated last year
- ☆15Updated 2 months ago
- Repository for Skill Set Optimization☆13Updated 11 months ago
- BERT-based nominal Semantic Role Labeling (SRL), both using the Nombank dataset and the Ontonotes dataset.☆18Updated 2 years ago
- Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021☆9Updated 3 years ago
- Supporting code for ReCEval paper☆28Updated 9 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated 2 weeks ago
- ☆22Updated 2 years ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆36Updated last year
- Data and code for ACL 2023 paper XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations☆10Updated 2 years ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆21Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- ☆25Updated 2 years ago
- ☆22Updated 6 months ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆23Updated 9 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆13Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆29Updated 10 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Updated 3 months ago
- Repository containing the website for the EMNLP 2023 conference☆16Updated 4 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆27Updated 2 months ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Updated 2 months ago
- ☆20Updated last year