maxim-saplin / llm_chessLinks
LLM Chess - Large Language Models Competing in Chess
☆63Updated this week
Alternatives and similar repositories for llm_chess
Users that are interested in llm_chess are comparing it to the libraries listed below
Sorting:
- ☆163Updated 8 months ago
- Testing baseline LLMs performance across various models☆305Updated last month
- Draw more samples☆193Updated last year
- ☆90Updated 2 months ago
- ☆69Updated 2 weeks ago
- A repo to evaluate various LLM's chess playing abilities.☆83Updated last year
- ☆173Updated last month
- GRadient-INformed MoE☆264Updated 11 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆442Updated last month
- Benchmarking Agentic LLM and VLM Reasoning On Games☆188Updated 2 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆321Updated 10 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 7 months ago
- ☆457Updated 3 months ago
- ☆92Updated 3 weeks ago
- ☆56Updated last month
- ☆185Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆105Updated 6 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆123Updated 3 weeks ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 7 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆328Updated 9 months ago
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆301Updated 3 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆825Updated this week
- rl from zero pretrain, can it be done? yes.☆264Updated 2 weeks ago
- Clue inspired puzzles for testing LLM deduction abilities☆40Updated 5 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 6 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆257Updated this week
- smolLM with Entropix sampler on pytorch☆150Updated 10 months ago
- A benchmark for emotional intelligence in large language models☆351Updated last year