Wikipedia document terms frequency
☆17Apr 27, 2020Updated 6 years ago
Alternatives and similar repositories for wikipedia-idf
Users that are interested in wikipedia-idf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- ☆14Jan 16, 2019Updated 7 years ago
- This repository contains code for the paper "Are Pretrained Language Models Symbolic Reasoners over Knowledge?"☆13Mar 23, 2021Updated 5 years ago
- Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View☆10May 17, 2022Updated 4 years ago
- ☆14Apr 27, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- ☆13Oct 28, 2020Updated 5 years ago
- Phish lyric and song finder☆12Feb 22, 2019Updated 7 years ago
- This is the facade for installation and access to the individual components☆16Apr 8, 2026Updated last month
- Active Learning for text classification using scikit-learn☆24Jun 6, 2019Updated 6 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated last month
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…☆29Jan 18, 2024Updated 2 years ago
- A branch of the boilerpipe project☆15Mar 18, 2011Updated 15 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Computes innocently excludable and includable sets of alternatives☆13Oct 14, 2021Updated 4 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- A lightweight, "pure Python" library for handing Extensible Binary Markup Language (EBML) data☆18Mar 30, 2026Updated last month
- Face Recognition in real-world images [ICASSP 2017]☆38Feb 27, 2017Updated 9 years ago
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆12Aug 30, 2019Updated 6 years ago
- Playing with arithmetic coding and RNNs☆22Nov 23, 2016Updated 9 years ago
- NumPy+Jax with named axes and an uncompromising attitude☆23Mar 4, 2025Updated last year
- Computer Modern font family for the web☆17Jul 21, 2024Updated last year
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Rust implementation of SIF and uSIF: Simple and fast sentence embedding☆19Jan 22, 2025Updated last year
- ☆15Oct 21, 2023Updated 2 years ago
- A list of resources dedicated to compositionality☆14Feb 21, 2019Updated 7 years ago
- Accompanies the paper "Learnability and Semantic Universals" ; trains recurrent neural networks to learn to verify sentences with quantif…☆11Aug 10, 2019Updated 6 years ago
- ☆22Dec 8, 2022Updated 3 years ago
- load word embeddings to Torch.Tensor☆14May 12, 2016Updated 10 years ago
- Signal processing toolbox for Torch 7☆50Jul 2, 2017Updated 8 years ago
- The simplest repository for training medium-sized BackpackLM for cs224n☆25Aug 13, 2023Updated 2 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearch☆29Apr 13, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Mar 24, 2025Updated last year
- Universal text classifier for generative models☆24Jul 25, 2024Updated last year
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Dec 8, 2021Updated 4 years ago
- French word embeddings from series sub-titles☆22Sep 2, 2018Updated 7 years ago
- A Streaming-Native Serving Engine for TTS/STS Models☆66May 15, 2026Updated last week
- ☆24Mar 24, 2022Updated 4 years ago
- the PyTorch implementation of paper: [Neural Response Generation via GAN with an Approximate Embedding Layer](http://www.aclweb.org/antho…☆10Feb 6, 2018Updated 8 years ago