Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding.
☆27Apr 12, 2024Updated last year
Alternatives and similar repositories for word_embedding
Users that are interested in word_embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pipeline for training word embeddings using word2vec on wikipedia corpus.☆73Dec 13, 2018Updated 7 years ago
- ☆12Feb 9, 2019Updated 7 years ago
- Joint Optimization of Cascade Ranking Models (WSDM 19)☆13Jun 21, 2022Updated 3 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- A Persian Word2Vec Model trained by Wikipedia articles☆10Jan 5, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A huge number library for Purescript with emphasis on correctness.☆12Apr 27, 2022Updated 3 years ago
- This repository uses pretrain BERT embeddings for transfer learning in QA domain☆29Dec 18, 2018Updated 7 years ago
- Relational Scheme interpreter, written in miniKanren, with Scheme pattern matcher☆11Mar 17, 2015Updated 11 years ago
- Real-time streaming data pipeline for Twitter Tweets☆10Jan 31, 2022Updated 4 years ago
- ☆10Dec 3, 2018Updated 7 years ago
- Stream processing engine☆13Apr 7, 2021Updated 4 years ago
- Latest version of GoFFish Distributed Graph Processing Platforms☆12Apr 30, 2018Updated 7 years ago
- PureScript Erlang hello world☆13Aug 3, 2018Updated 7 years ago
- PureScript version management in PureScript.☆14Jan 27, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Prolog implemented in Python☆12Sep 6, 2024Updated last year
- In this repository, I try to share some of the little tips and tricks and amazing spiders that I used to work with on the scrapy framewor…☆12Feb 2, 2020Updated 6 years ago
- Persian Word Embedding Using FastText Pre-trained Model☆13Apr 16, 2021Updated 4 years ago
- Visual Studio Code extension for viewing files and folders in the workspace with size and the estimated gzip size.☆16May 23, 2021Updated 4 years ago
- Basic Purescript wrapper for node-sqlite3☆14May 14, 2022Updated 3 years ago
- Low-level interface for asynchronous variables☆15Feb 8, 2025Updated last year
- JavaScript's native date type and corresponding functions.☆15Apr 27, 2022Updated 3 years ago
- Single header, 100 line implementation of 32bit PCG random number generator. Extremely fast RNG with simple API☆14Dec 11, 2019Updated 6 years ago
- Code and data for the paper: Answer-based Adversarial Training for Generating Clarification Questions☆43Jan 4, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Purescript community project ideas. a.k.a. What Purescript needs to take over the world☆16Feb 12, 2021Updated 5 years ago
- Spring Data Neo4j 3 Cineasts Tutorial Example☆12Aug 6, 2015Updated 10 years ago
- VSCode extension to open the Windows Explorer Context Menu☆18Feb 27, 2024Updated 2 years ago
- OpenCyc Ontology or Knowledge Base Data Files☆15Jan 14, 2022Updated 4 years ago
- Example of Minimap for Drawflow☆14Jul 2, 2021Updated 4 years ago
- Snowball Sampling in NetworX☆14Nov 9, 2018Updated 7 years ago
- An example repository for Deploy To Deta button.☆15Jul 22, 2021Updated 4 years ago
- Locally run an Instruction-Tuned Chat-Style LLM☆14Mar 30, 2023Updated 2 years ago
- a large-scale graph database created as a combination of multiple taxonomy backbones extracted from 5 existing knowledge graphs, namely: …☆14Jan 23, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tools for the TREC CAsT benchmark☆30Dec 15, 2022Updated 3 years ago
- ☆22Jun 22, 2022Updated 3 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆19Nov 23, 2024Updated last year
- A Spider that crawls reddit.com/r/cats☆15Aug 28, 2018Updated 7 years ago
- Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch☆36May 13, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- ☆13Oct 30, 2023Updated 2 years ago