ZihanWang314 / coeCheckLinks
☆16Updated 2 months ago
Alternatives and similar repositories for coeCheck
Users that are interested in coeCheck are comparing it to the libraries listed below
Sorting:
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- ☆19Updated 3 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆15Updated 2 weeks ago
- ☆64Updated 2 months ago
- ☆24Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆31Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆55Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Official Code Release for "Training a Generally Curious Agent"☆21Updated 2 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆34Updated last month
- ☆13Updated 5 months ago
- Official Repository for Task-Circuit Quantization☆20Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆79Updated 9 months ago
- Using FlexAttention to compute attention with different masking patterns☆43Updated 8 months ago
- ☆45Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- ☆50Updated this week
- Lego for GRPO☆28Updated this week
- ☆38Updated 5 months ago
- ☆9Updated last month
- ☆21Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 2 months ago
- Official repo of paper LM2☆40Updated 3 months ago