bigcode-project / transformers
☆26Updated last year
Alternatives and similar repositories for transformers:
Users that are interested in transformers are comparing it to the libraries listed below
- Pre-training code for CrystalCoder 7B LLM☆54Updated 10 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- ☆44Updated 9 months ago
- LLM finetuning☆42Updated last year
- Github repo for storing LlamaDatasets☆33Updated last year
- Learning to Program with Natural Language☆5Updated last year
- LLM-powered autonomous agent with hierarchical task management☆47Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆107Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- ☆34Updated 8 months ago
- Based on the tree of thoughts paper☆47Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- ☆126Updated last year
- Open Implementations of LLM Analyses☆103Updated 5 months ago
- ☆75Updated last week
- The Next Generation Multi-Modality Superintelligence☆71Updated 6 months ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆11Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Data preparation code for Amber 7B LLM☆86Updated 10 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- Repository for analysis and experiments in the BigCode project.☆117Updated last year
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 2 years ago