πΌ Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)
β17Jan 10, 2025Updated last year
Alternatives and similar repositories for Baby-CoThought
Users that are interested in Baby-CoThought are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A LaTeX template for LMU Master/Bachelor theses (paper+slides).β16May 22, 2019Updated 6 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responsesβ18Aug 14, 2023Updated 2 years ago
- β12Aug 14, 2023Updated 2 years ago
- Opensource, personal & local chat interface for language models.β13Jun 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VSCode extension to displays current CPU stats, Memory, Battery stats, and moreβ14Feb 19, 2026Updated last month
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projectsβ24Sep 14, 2023Updated 2 years ago
- Transform your videos into captivating animations. Processes each frame to create an animation styled with Stable Diffusion. Simply uploaβ¦β16May 7, 2023Updated 2 years ago
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7Bβ21May 26, 2024Updated last year
- [ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languagesβ106Updated this week
- β22Sep 25, 2023Updated 2 years ago
- https://arxiv.org/abs/2312.10807β79Dec 29, 2025Updated 3 months ago
- Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)β12Oct 30, 2024Updated last year
- Source code for CoNLL 2021 paper by Huebner et al. 2021β21Jul 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HTMX and NextJS Streaming AI examplesβ24Oct 27, 2023Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"β22Jun 26, 2023Updated 2 years ago
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"β30Aug 20, 2021Updated 4 years ago
- Code for "Multi-scale Abstract Reasoning" paperβ12Oct 17, 2022Updated 3 years ago
- β24Feb 5, 2024Updated 2 years ago
- Menagerie of video models trained on various video datasetsβ10Oct 13, 2024Updated last year
- β14May 3, 2022Updated 3 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Trainβ¦β12Feb 25, 2023Updated 3 years ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.β50Nov 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMCβ13Mar 1, 2026Updated last month
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verificationβ42Apr 29, 2023Updated 2 years ago
- β15May 24, 2022Updated 3 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.β12Mar 6, 2023Updated 3 years ago
- This repo contains code for our ICML 2023 paper: MEWL: Few-shot multimodal word learning with referential uncertaintyβ15Jun 10, 2023Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- β11Jun 20, 2022Updated 3 years ago
- β12Jan 2, 2024Updated 2 years ago
- larc solving with gpt4β20May 25, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text eβ¦β11Dec 27, 2024Updated last year
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Modelsβ12Jul 1, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2β¦β12Feb 19, 2023Updated 3 years ago
- β12Apr 24, 2024Updated last year
- ACRE: Abstract Causal REasoning Beyond Covariationβ19Dec 7, 2021Updated 4 years ago
- Machine translation and word embeddings of cuneiform corpusesβ13Nov 17, 2024Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.β13May 11, 2022Updated 3 years ago