🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)
☆17Jan 10, 2025Updated last year
Alternatives and similar repositories for Baby-CoThought
Users that are interested in Baby-CoThought are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- A full-text error corrector for English based on transformers and deep learning☆10Jan 8, 2023Updated 3 years ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆24Sep 14, 2023Updated 2 years ago
- Introductory Julia Course☆17Dec 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B☆21May 26, 2024Updated last year
- [ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages☆106Apr 14, 2026Updated 3 weeks ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 7 months ago
- ☆14Jan 4, 2025Updated last year
- An application of tabu-enhanced genetic search to the railway optimization problem introduced in the informatiCup2022 by the German Infor…☆14May 13, 2022Updated 3 years ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning☆11Jul 20, 2022Updated 3 years ago
- 微信公众号:机器感知 | Tracking the Latest Arxiv Papers☆38Jun 5, 2025Updated 11 months ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆21Jul 13, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Aug 20, 2021Updated 4 years ago
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- ☆14May 3, 2022Updated 4 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- A benchmark for language models based on the UK Linguistics Olympiad☆11Mar 3, 2025Updated last year
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 3 years ago
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆14Feb 24, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15May 24, 2022Updated 3 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- 首个中文心理咨询对话安全检测数据集☆23Nov 7, 2023Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆11Jun 20, 2022Updated 3 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- larc solving with gpt4☆20May 25, 2023Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- ☆12Apr 24, 2024Updated 2 years ago
- ACRE: Abstract Causal REasoning Beyond Covariation☆19Dec 7, 2021Updated 4 years ago
- Machine translation and word embeddings of cuneiform corpuses☆13Nov 17, 2024Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago