neubig / nlp-from-scratch-assignment-spring2024Links
An assignment for building an NLP system from scratch.
☆27Updated last year
Alternatives and similar repositories for nlp-from-scratch-assignment-spring2024
Users that are interested in nlp-from-scratch-assignment-spring2024 are comparing it to the libraries listed below
Sorting:
- ☆99Updated last year
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆68Updated 8 months ago
- ☆188Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: …☆88Updated 2 weeks ago
- ☆79Updated last year
- ☆158Updated last month
- ☆86Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- a curated list of the role of small models in the LLM era☆110Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆69Updated 7 months ago
- ☆70Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆84Updated last year
- Website☆57Updated 2 years ago
- ☆139Updated last year
- ☆114Updated last week
- ☆121Updated 11 months ago
- The official evaluation suite and dynamic data release for MixEval.☆253Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- ☆129Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Notes on Direct Preference Optimization☆23Updated last year
- Direct Preference Optimization from scratch in PyTorch☆120Updated 8 months ago
- ☆16Updated last year
- ☆100Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆126Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆132Updated 10 months ago
- ☆52Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆60Updated last year
- Multi-GPU supported kmeans clustering for cluser-clip☆14Updated last year