facebookresearch/llm-cross-capabilities

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/llm-cross-capabilities)

facebookresearch / llm-cross-capabilities

Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"

☆43

Alternatives and similar repositories for llm-cross-capabilities

Users that are interested in llm-cross-capabilities are comparing it to the libraries listed below

Sorting:

ThomAS122102RAY / PanNuke-cell-core-region-identification-with-DINO
View on GitHub
coded with and corrected by Google Anti-Gravity
☆13Nov 23, 2025Updated 3 months ago
ContextualAI / CLAIR_and_APO
View on GitHub
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆61Aug 30, 2024Updated last year
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆21Jun 15, 2025Updated 8 months ago
ellenmellon / INSCIT
View on GitHub
INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
☆16Jan 21, 2025Updated last year
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆19Feb 27, 2026Updated last week
yzjiao / RolePred
View on GitHub
Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"
☆19Nov 5, 2022Updated 3 years ago
allenai / fluid-benchmarking
View on GitHub
Fluid Language Model Benchmarking
☆26Sep 16, 2025Updated 5 months ago
ozyyshr / StructChem
View on GitHub
Structured Chemistry Reasoning with Large Language Models
☆40May 4, 2024Updated last year
orionw / FollowIR
View on GitHub
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆52Jul 3, 2024Updated last year
maszhongming / ReactionMiner
View on GitHub
Repository for the EMNLP 2023 Demo Paper "Reaction Miner: An Integrated System for Chemical Reaction Extraction from Textual Data"
☆19Jan 27, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
weizeming / momentum-attack-llm
View on GitHub
☆23Jan 17, 2025Updated last year
facebookresearch / DIG-In
View on GitHub
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Jun 3, 2024Updated last year
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆50Feb 4, 2026Updated last month
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆25Jun 6, 2025Updated 9 months ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated last year
Jackymn25 / utm-department-analysis
View on GitHub
rmp data ranking
☆13Nov 4, 2025Updated 4 months ago
codingfisch / flashrl
View on GitHub
Fast reinforcement learning 💨
☆28Jul 15, 2025Updated 7 months ago
architsharma97 / dpo-rlaif
View on GitHub
☆98Jun 27, 2024Updated last year
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated last year
johnmackintosh / cusumcharter
View on GitHub
Easier CUSUM control charts. Returns simple CUSUM statistics, CUSUMs with control limit calculations, and function to generate faceted …
☆26Nov 22, 2024Updated last year
ucl-dark / llm_debate
View on GitHub
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆127Mar 22, 2024Updated last year
amazon-science / PAE
View on GitHub
☆68Mar 6, 2025Updated last year
trapoom555 / Language-Model-STS-CFT
View on GitHub
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Aug 2, 2024Updated last year
LeeSureman / Sequence-Labeling-Early-Exit
View on GitHub
Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit
☆28Aug 19, 2022Updated 3 years ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Feb 9, 2026Updated last month
ChenxinAn-fdu / CGSum
View on GitHub
[AAAI'21] Code and dataset for our paper: Enhancing Scientific Papers Summarization with Citation Graph
☆25Oct 16, 2022Updated 3 years ago
WadeYin9712 / Dynosaur
View on GitHub
Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)
☆64Nov 30, 2023Updated 2 years ago
InflectionAI / Inflection-Benchmarks
View on GitHub
Public Inflection Benchmarks
☆68Mar 6, 2024Updated 2 years ago
JHU-CLSP / RATIONALYST
View on GitHub
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆35Oct 3, 2024Updated last year
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆31Aug 21, 2024Updated last year
facebookresearch / RAM
View on GitHub
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆344Dec 16, 2025Updated 2 months ago
Shark-NLP / CAB
View on GitHub
☆31Jul 2, 2023Updated 2 years ago
balmasi / g2_reviews_llm_topic_modeling
View on GitHub
An experiment to see if we can process G2 reviews to extract topics from reviews
☆10Feb 5, 2024Updated 2 years ago
LuLuLuyi / LongHeads
View on GitHub
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆30Apr 8, 2024Updated last year
theopensystemslab / planx-new
View on GitHub
Plan✕ is a platform for creating and publishing digital planning services
☆17Updated this week
maszhongming / ParaKnowTransfer
View on GitHub
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆33May 9, 2024Updated last year
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated last year
dwzhu-pku / LongEmbed
View on GitHub
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆148Nov 9, 2024Updated last year