ZihanWang314 / coeCheckLinks
☆19Updated 6 months ago
Alternatives and similar repositories for coeCheck
Users that are interested in coeCheck are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆94Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- ☆26Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 5 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆30Updated last week
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆31Updated last week
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆105Updated 3 months ago
- ☆67Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 9 months ago
- XmodelLM☆39Updated 9 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆88Updated 3 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- A repository for research on medium sized language models.☆77Updated last year
- Esoteric Language Models☆96Updated last month
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated last month
- Resa: Transparent Reasoning Models via SAEs☆41Updated last month
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆56Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- ☆54Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last month
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 6 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆39Updated last month
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 6 months ago
- Official Repository for Task-Circuit Quantization☆23Updated 3 months ago
- Lego for GRPO☆29Updated 3 months ago
- ☆56Updated 2 months ago
- Official repo of paper LM2☆42Updated 6 months ago
- ☆21Updated last month