oooranz / Baby-CoThoughtView external linksLinks
πΌ Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)
β17Jan 10, 2025Updated last year
Alternatives and similar repositories for Baby-CoThought
Users that are interested in Baby-CoThought are comparing it to the libraries listed below
Sorting:
- Example for Logging LLM Evaluator Prompt Responsesβ18Aug 14, 2023Updated 2 years ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projectsβ24Sep 14, 2023Updated 2 years ago
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"β30Aug 20, 2021Updated 4 years ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verificationβ41Apr 29, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!β12Jul 18, 2025Updated 6 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoderβ10Mar 16, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2β¦β12Feb 19, 2023Updated 2 years ago
- β12Apr 24, 2024Updated last year
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpointsβ38Mar 21, 2021Updated 4 years ago
- Pytorch implementation of standard metrics for clusteringβ10Mar 21, 2023Updated 2 years ago
- This repository contains the implementation code for paper: Mixup Your Own Pairsβ12Oct 1, 2023Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.β12Mar 6, 2023Updated 2 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paperβ12Oct 19, 2023Updated 2 years ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoningβ11Jul 20, 2022Updated 3 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Trainβ¦β13Feb 25, 2023Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.β13May 11, 2022Updated 3 years ago
- A context-aware embedding similarity scoreβ11Aug 23, 2023Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- Menagerie of video models trained on various video datasetsβ10Oct 13, 2024Updated last year
- Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)β12Oct 30, 2024Updated last year
- A repository to get acquainted with basic training tasks in natural language processing and machine learningβ11Dec 27, 2023Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text eβ¦β11Dec 27, 2024Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.β12May 31, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.β11Mar 27, 2021Updated 4 years ago
- Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Languβ¦β14Apr 7, 2025Updated 10 months ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teamingβ15Feb 24, 2024Updated last year
- This repository contains the PLOD Dataset for Abbreviation Detection released with our LREC 2022 publicationβ12Sep 25, 2022Updated 3 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 2 years ago
- A minimal working example of using undetected-chromedriver on AWS Lambda with Selenium and Dockerβ19Aug 12, 2025Updated 6 months ago
- Claude agent for Deep AI Researchβ23Dec 9, 2025Updated 2 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2β¦β14Feb 2, 2026Updated 2 weeks ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificatiβ¦β16Jun 3, 2024Updated last year
- A full-text error corrector for English based on transformers and deep learningβ10Jan 8, 2023Updated 3 years ago
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"β14Sep 9, 2025Updated 5 months ago
- β12Jan 2, 2024Updated 2 years ago
- β11Jun 20, 2022Updated 3 years ago
- LDPC codes for Illumina sequencing-based DNA storageβ11Dec 2, 2020Updated 5 years ago
- Topic Model based on Pretrained Sentence Embeddings (with BERT)β13Feb 8, 2023Updated 3 years ago
- Collection of lines of code for basics of clean plots in Plotly and Matplotlibβ13Feb 5, 2021Updated 5 years ago