[TMLR'25] The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
☆54Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for CoT-ICL-Eval
Users that are interested in CoT-ICL-Eval are comparing it to the libraries listed below
Sorting:
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery☆303Nov 5, 2025Updated 4 months ago
- ☆32Oct 13, 2025Updated 4 months ago
- PCF8563 full-featured driver library for general-purpose MCU and Linux.☆30Oct 26, 2025Updated 4 months ago
- Official implementation repository of Holistic Data Schedule☆199Jan 2, 2026Updated 2 months ago
- A lightweight, no_std multi-chain HD wallet derivation library in Rust.☆156Updated this week
- A comprehensive React Native starter template built with Expo. It includes reusable UI components, Poppins font setup, NativeWind, Fireba…☆23Updated this week
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated last month
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- An NER tool for ancient place names based on Pleiades and Spacy.☆24Sep 15, 2020Updated 5 years ago
- EM4100 full-featured driver library for general-purpose MCU and Linux.☆19Oct 25, 2025Updated 4 months ago
- ☆40Jul 19, 2025Updated 7 months ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联 诸…☆2,576Mar 3, 2026Updated last week
- [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆37Jun 5, 2025Updated 9 months ago
- ☆241Dec 13, 2025Updated 2 months ago
- A Systematic Evaluation Framework for Large Language Models in Multi-omics Analysis☆168Nov 15, 2025Updated 3 months ago
- AI-powered StartUp Accelerator Engine built with Next.js, LangChain, PostgreSQL + pgvector. Upload, organize, and chat with documents. In…☆787Updated this week
- A studio for designing and shipping shadcn-style components in Expo/React Native with Storybook-backed visual regression.☆176Dec 12, 2025Updated 2 months ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,882Updated this week
- Align Anything: Training All-modality Model with Feedback☆4,635Nov 27, 2025Updated 3 months ago
- A high-performance IM server.☆4,246Updated this week
- The first open autoregressive foundational video AI model.☆2,891Oct 14, 2024Updated last year
- In-depth study of the graphrag☆1,513Jul 1, 2025Updated 8 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,296Feb 28, 2026Updated last week
- Foundations of Medical Large Language Model Learning☆89Updated this week
- Glitch Gremlin AI☆15Apr 5, 2025Updated 11 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 11 months ago
- Some resources (books, paper, video and online courses) about ML,DL,DM☆12Mar 14, 2021Updated 4 years ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆29May 17, 2025Updated 9 months ago
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆192Feb 26, 2026Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- 新数据洞察方式☆1,005Jun 25, 2025Updated 8 months ago
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆1,304Feb 27, 2026Updated last week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,381Feb 13, 2026Updated 3 weeks ago
- ☆246Jan 12, 2025Updated last year
- ☆167Oct 1, 2025Updated 5 months ago
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆11Aug 12, 2021Updated 4 years ago