saffronh / ccaiLinks
Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)
☆24Updated last year
Alternatives and similar repositories for ccai
Users that are interested in ccai are comparing it to the libraries listed below
Sorting:
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆64Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 9 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆79Updated 9 months ago
- ☆86Updated last year
- Public Inflection Benchmarks☆68Updated last year
- accompanying material for sleep-time compute paper☆115Updated 5 months ago
- ☆55Updated 3 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆116Updated 2 years ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆127Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆155Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Chat Markup Language conversation library☆55Updated last year
- ☆61Updated last year
- Data preparation code for Amber 7B LLM☆93Updated last year
- Multimodal computer agent data collection program☆150Updated last year
- Track the progress of LLM context utilisation☆54Updated 5 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆184Updated 7 months ago
- LLM boxing matches☆58Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- An automated tool for discovering insights from research papaer corpora☆139Updated last year
- ☆41Updated last year
- Code for ExploreTom☆86Updated 3 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- ☆135Updated 6 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆71Updated 5 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆88Updated last year