ArmelRandy / Self-instruct
A repository to perform self-instruct with a model on HF Hub
☆30Updated 11 months ago
Related projects: ⓘ
- A framework for few-shot evaluation of autoregressive language models.☆98Updated last year
- A repository for transformer critique learning and generation☆84Updated 9 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆109Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆118Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆81Updated 2 weeks ago
- evol augment any dataset online☆55Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆151Updated 4 months ago
- Evaluating LLMs with fewer examples☆131Updated 5 months ago
- ☆92Updated last year
- ☆105Updated this week
- ☆38Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆130Updated 2 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆201Updated 10 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆76Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆59Updated 10 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆96Updated last week
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents☆55Updated this week
- Steering vectors for transformer language models in Pytorch / Huggingface☆52Updated 2 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆82Updated 2 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆209Updated 8 months ago
- ☆114Updated 2 weeks ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated last week
- A simple unified framework for evaluating LLMs☆121Updated this week
- ☆158Updated last year
- Wrapper to easily generate the chat template for Llama2☆62Updated 6 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆74Updated 3 months ago
- Chain-of-Hindsight, A Scalable RLHF Method☆213Updated 11 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆143Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆104Updated 3 months ago