CL-bench: A Benchmark for Context Learning
☆495Feb 8, 2026Updated 2 months ago
Alternatives and similar repositories for CL-bench
Users that are interested in CL-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆88Mar 18, 2026Updated 3 weeks ago
- ☆14Oct 28, 2023Updated 2 years ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆27May 21, 2025Updated 10 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆221Oct 12, 2025Updated 6 months ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 3 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆358Jul 29, 2025Updated 8 months ago
- ☆13Feb 11, 2019Updated 7 years ago
- ☆19Mar 10, 2025Updated last year
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆57Dec 25, 2025Updated 3 months ago
- A new benchmark of 118 ICPC problems for evaluating LLM reasoning in competitive coding, featuring realistic ICPC competition scenario, r…☆16May 18, 2025Updated 10 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- ☆126Jan 21, 2026Updated 2 months ago
- ☆33May 27, 2025Updated 10 months ago
- Orienting Latent Actions for Video World Modeling☆84Feb 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28Feb 10, 2025Updated last year
- [ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"☆51Nov 20, 2025Updated 4 months ago
- Learning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks☆86Oct 16, 2025Updated 5 months ago
- ☆40Dec 26, 2025Updated 3 months ago
- 中文大语言模型评测2024高考数学专题☆19Jun 14, 2024Updated last year
- ☆16Dec 9, 2023Updated 2 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 5 months ago
- ML4CO-Bench-101: Benchmark Machine Learning for Classic Combinatorial Problems on Graphs.☆44Nov 17, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PeRL: Parameter-Efficient Reinforcement Learning☆74Updated this week
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆24Sep 23, 2025Updated 6 months ago
- ☆11Mar 22, 2024Updated 2 years ago
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆34Mar 5, 2024Updated 2 years ago
- ☆23Nov 8, 2023Updated 2 years ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated last year
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- [EMNLP 2023] Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation☆31Oct 18, 2025Updated 5 months ago
- Source code for the NAACL 2021 paper: "Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors"☆12Jul 15, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- 中文大语言模型评测第二期☆71Oct 23, 2023Updated 2 years ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆55Apr 6, 2025Updated last year
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆47Feb 13, 2025Updated last year
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated last year
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆46Oct 15, 2025Updated 5 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 6 months ago