saffronh / ccaiLinks
Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)
☆26Updated 2 years ago
Alternatives and similar repositories for ccai
Users that are interested in ccai are comparing it to the libraries listed below
Sorting:
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆83Updated last year
- Track the progress of LLM context utilisation☆55Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆121Updated 2 years ago
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆94Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Multimodal computer agent data collection program☆161Updated 2 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆74Updated 9 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆51Updated 9 months ago
- ☆93Updated last month
- Public Inflection Benchmarks☆68Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated 3 months ago
- Evaluating LLMs with fewer examples☆169Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆153Updated last year
- ☆56Updated 7 months ago
- ☆41Updated last year
- Code for ExploreTom☆90Updated 7 months ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated 2 years ago
- ☆74Updated last year
- ☆87Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Pre-training code for CrystalCoder 7B LLM☆57Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 2 years ago