ScalingIntelligence/large_language_monkeys

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ScalingIntelligence/large_language_monkeys)

ScalingIntelligence / large_language_monkeys

☆117

Alternatives and similar repositories for large_language_monkeys

Users that are interested in large_language_monkeys are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jordan-benjamin / pydra
View on GitHub
Simple, flexible configuration in pure Python!
☆32Jul 1, 2025Updated last year
ScalingIntelligence / Archon
View on GitHub
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆207Mar 7, 2025Updated last year
ScalingIntelligence / hydragen
View on GitHub
Hydragen: High-Throughput LLM Inference with Shared Prefixes
☆56May 10, 2024Updated 2 years ago
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
shunzh / mcts-for-llm
View on GitHub
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆16Jun 28, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ScalingIntelligence / codemonkeys
View on GitHub
☆59Jan 28, 2025Updated last year
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
NY1024 / Jailbreak_GPT4o
View on GitHub
☆28Jun 5, 2024Updated 2 years ago
allenai / EmbeddingRecycling
View on GitHub
Embedding Recycling for Language models
☆38Jul 11, 2023Updated 3 years ago
upiterbarg / lintseq
View on GitHub
[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)
☆19Feb 11, 2025Updated last year
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
Hritikbansal / entigen_emnlp
View on GitHub
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
☆13Aug 16, 2023Updated 2 years ago
kanishkg / boxing-gym
View on GitHub
☆11Jul 30, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
Pranjal2041 / AdaptiveConsistency
View on GitHub
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
☆41Jan 30, 2024Updated 2 years ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
mlfoundations / dataset2metadata
View on GitHub
☆28Mar 21, 2024Updated 2 years ago
ConsequentAI / fneval
View on GitHub
Functional Benchmarks and the Reasoning Gap
☆90Oct 1, 2024Updated last year
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
FranxYao / Complexity-Based-Prompting
View on GitHub
Complexity Based Prompting for Multi-Step Reasoning
☆17Mar 10, 2023Updated 3 years ago
flukeskywalker / nanoDD
View on GitHub
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Sep 27, 2024Updated last year
layer6ai-labs / UoMH
View on GitHub
☆16Aug 7, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AlignmentResearch / gpt-4-novel-apis-attacks
View on GitHub
☆23Dec 28, 2023Updated 2 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
facebookresearch / Shepherd
View on GitHub
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆224Aug 10, 2023Updated 2 years ago
Xalp / ECHO
View on GitHub
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Jan 23, 2025Updated last year
tom-pollak / claudette-pydantic
View on GitHub
☆10Oct 22, 2024Updated last year
tianyu139 / tangent-model-composition
View on GitHub
Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…
☆14May 14, 2024Updated 2 years ago
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
SinatrasC / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆17Oct 9, 2024Updated last year
xf-zhao / Agentic-Skill-Discovery
View on GitHub
Official implementation of Zero-Hero paper
☆31Feb 13, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bethgelab / sober-reasoning
View on GitHub
A Sober Look at Language Model Reasoning
☆92Nov 18, 2025Updated 8 months ago
packquickly / schedule_free_optx
View on GitHub
Schedule free optimiser implemented in JAX using Optimistix
☆15May 29, 2024Updated 2 years ago
xiwenc1 / DRA-GRPO
View on GitHub
Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
☆24Jan 6, 2026Updated 6 months ago
kimbochen / md-blogs
View on GitHub
A blog where I write about research papers and blog posts I read.
☆12Nov 20, 2024Updated last year
IST-DASLab / QIGen
View on GitHub
Repository for CPU Kernel Generation for LLM Inference
☆28Jul 13, 2023Updated 3 years ago
upiterbarg / diff_history
View on GitHub
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Aug 20, 2024Updated last year
Hritikbansal / generative-robustness
View on GitHub
Create generated datasets and train robust classifiers
☆36Sep 1, 2023Updated 2 years ago