A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.
☆19Feb 6, 2025Updated last year
Alternatives and similar repositories for cot-eval
Users that are interested in cot-eval are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 10 months ago
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- OpenAI GPT-3/3.5/4 API client written in Go☆20Apr 13, 2023Updated 2 years ago
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- Universal differential equations for ecologists☆14Feb 4, 2026Updated 3 weeks ago
- Material for the DataLucence:Images course☆10Jun 14, 2017Updated 8 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- DOS Program Development☆13Nov 9, 2022Updated 3 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆39Jan 7, 2025Updated last year
- Documentation and tutorials worth sharing.☆10Dec 7, 2022Updated 3 years ago
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Apr 12, 2025Updated 10 months ago
- fine-tuning tutorial☆18Feb 20, 2026Updated last week
- A single-cell transcriptomic analysis of endometriosis, endometriomas, eutopic endometrial samples and uninvolved ovary tissues highlight…☆18Jan 12, 2023Updated 3 years ago
- MemVerge Netflow Plugin☆16Jun 24, 2025Updated 8 months ago
- ☆12Jan 11, 2026Updated last month
- A collection of actions for working with ROS data☆14Jun 11, 2025Updated 8 months ago
- Some tools for working with digraphs, partial orders and topological sorting with Python☆12Sep 7, 2011Updated 14 years ago
- Systematic Multi-Trait AAV Capsid Engineering for Efficient Gene Delivery (Eid et al., Nature Communications, 2024)☆11Aug 26, 2024Updated last year
- ☆11Dec 6, 2023Updated 2 years ago
- ☆14Dec 12, 2022Updated 3 years ago
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆17Updated this week
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated last month
- The MATLAB source for the DTCWT toolbox (1,2,3)D and keypoints.☆11Feb 10, 2014Updated 12 years ago
- False discovery rate regression☆10Nov 12, 2020Updated 5 years ago
- Tetris with an interesting twist☆12Mar 5, 2022Updated 3 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- DGCIT: Double Generative Adversarial Networks for Conditional Independence Testing☆11Nov 22, 2023Updated 2 years ago
- Causal Mediation analysis☆10Dec 26, 2025Updated 2 months ago
- ☆13Nov 5, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆10Jan 9, 2024Updated 2 years ago
- FiberNavigator - Maxime Chamberland -☆10Feb 9, 2022Updated 4 years ago
- This repository will contain links to the most famous available books of ML that are online☆12Oct 15, 2024Updated last year
- Implementation of Phase Coupling Estimation in Python and Matlab (see http://arxiv.org/abs/0906.3844 )☆16Feb 1, 2014Updated 12 years ago
- R markdown format and template for light-on-dark beamer presentations—with fussy extras.☆12Nov 1, 2021Updated 4 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 2 weeks ago