mustafamariam / LLM-Connections-Solver
Code for Columbia University COMS 3997 – LLM Ethics and Foundations
☆13Updated 3 months ago
Alternatives and similar repositories for LLM-Connections-Solver:
Users that are interested in LLM-Connections-Solver are comparing it to the libraries listed below
- Evals meant to evaluate language models' ability to reason over long contexts.☆9Updated 7 months ago
- An attribution library for LLMs☆38Updated 7 months ago
- ☆22Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 6 months ago
- ☆80Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated last week
- llm sampler that only allows words that are in the bible☆26Updated 4 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆77Updated 6 months ago
- ☆20Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- A dataset of alignment research and code to reproduce it☆77Updated last year
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 11 months ago
- ☆108Updated 4 months ago
- ☆48Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 2 months ago
- ☆33Updated 9 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- ☆51Updated last week
- ☆128Updated 3 weeks ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- ☆66Updated 10 months ago
- ☆20Updated 4 months ago
- ☆89Updated last month
- ☆97Updated 6 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆43Updated last year
- Scrape and export data from the Open LLM Leaderboard.☆44Updated 4 months ago