clinicalml / realhumanevalLinks

☆21

Alternatives and similar repositories for realhumaneval

Users that are interested in realhumaneval are comparing it to the libraries listed below

Sorting:

kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 7 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
facebookresearch / NeuralMemory
A Data Source for Reasoning Embodied Agents
☆19Updated last year
ctlllll / understanding_llm_benchmarks
Understanding the correlation between different LLM benchmarks
☆29Updated last year
itl-ed / llm-dp
LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task
☆44Updated 6 months ago
gautierdag / plancraft
Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs
☆15Updated last week
amzn / extremely-efficient-query-encoder
efficient query encoding for dense retrieval
☆11Updated 11 months ago
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
swarnaHub / System-1.x
PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models
☆24Updated 11 months ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
facebookresearch / coocmap
code for paper "Accessing higher dimensions for unsupervised word translation"
☆21Updated 2 years ago
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆16Updated last year
katzurik / Knowledge_Navigator
☆20Updated 4 months ago
facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆23Updated last year
ncsulsj / Robust_Summarization
☆9Updated last year
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆16Updated 5 months ago
prateeky2806 / ComPEFT
☆26Updated last year
Yuanhy1997 / HyPe
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Updated 2 years ago
EleutherAI / training-jacobian
☆23Updated 7 months ago
facebookresearch / dmae_st
Directed masked autoencoders
☆14Updated 2 years ago
benediktstroebl / agent-evals
☆22Updated last month
duykhuongnguyen / LASeR-MAB
Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"
☆13Updated 9 months ago
luohongyin / EntST
Entailment self-training
☆25Updated 2 years ago
allenai / sso
Repository for Skill Set Optimization
☆14Updated 11 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
kumar-shridhar / Screws
SCREWS: A Modular Framework for Reasoning with Revisions
☆27Updated last year
arnab-api / romba
Applies ROME and MEMIT on Mamba-S4 models
☆14Updated last year
kyegomez / SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Updated 8 months ago