Documentation effort for the BookCorpus dataset
☆34Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for bookcorpus-datasheet
Users that are interested in bookcorpus-datasheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Transformers at any scale☆42Jan 18, 2024Updated 2 years ago
- ☆20Aug 17, 2021Updated 4 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Apr 14, 2021Updated 4 years ago
- An author identification system based on recur☆21Dec 13, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition☆12Jun 17, 2020Updated 5 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 8 months ago
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆20Oct 12, 2024Updated last year
- Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"☆27May 1, 2025Updated 10 months ago
- 汽车-androidAPP-物联网-蓝牙☆11Nov 29, 2017Updated 8 years ago
- Deformable Convolutional Networks v2 with Pytorch☆10Jul 29, 2020Updated 5 years ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- ☆13Sep 23, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Feb 23, 2021Updated 5 years ago
- Ice is a rapid information extraction customizer☆15Apr 26, 2021Updated 4 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Versatile Metrics Collection for Python☆20Feb 17, 2026Updated last month
- Search Engine Guided Non-Parametric Neural Machine Translation☆14Oct 23, 2017Updated 8 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Mar 16, 2018Updated 8 years ago
- Trains small LMs. Designed for training on SimpleStories☆12Sep 15, 2025Updated 6 months ago
- ☆15Jun 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GPT-3 attempts to predict & balance chemical reactions☆13Aug 2, 2020Updated 5 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- ☆12Jun 14, 2021Updated 4 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- ☆14Sep 10, 2021Updated 4 years ago
- Official PyTorch Implementation of Opt-CWM: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals.☆23Mar 27, 2025Updated last year
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- DEPRECATED version of SoundFile☆14May 26, 2020Updated 5 years ago
- Release code for light-weight calibrator: a separable component for unsupervised domain adaptation☆13Jul 17, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- Jupyter notebooks showing to implement statistical functions.☆14Jun 14, 2020Updated 5 years ago
- ☆10Aug 14, 2023Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 2 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago