spcl / CheckEmbedLinks
Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
☆20Updated last week
Alternatives and similar repositories for CheckEmbed
Users that are interested in CheckEmbed are comparing it to the libraries listed below
Sorting:
- An LLM inference engine, written in C++☆15Updated last week
- MPI Code Generation through Domain-Specific Language Models☆14Updated 7 months ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated 3 weeks ago
- Code and data for paper "(How) do Language Models Track State?"☆14Updated 2 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆17Updated last week
- ☆36Updated 2 months ago
- ☆9Updated 2 months ago
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- Utilities for constructing a large dataset of LLVM IR☆21Updated 3 weeks ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆13Updated last week
- Implementation of Spectral State Space Models☆16Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 7 months ago
- ☆18Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Make reasoning models scalable☆37Updated 3 weeks ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆23Updated 2 months ago
- ☆18Updated 2 months ago
- Personal solutions to the Triton Puzzles☆19Updated 11 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 9 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆19Updated 11 months ago
- ☆28Updated 4 months ago
- Compression for Foundation Models☆31Updated 3 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆26Updated 2 weeks ago
- ☆36Updated last month
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year