google-deepmind / ithaca
Restoring and attributing ancient texts using deep neural networks
☆541Updated 9 months ago
Related projects: ⓘ
- Cramming the training of a (BERT-type) language model into limited compute.☆1,284Updated 3 months ago
- WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…☆993Updated 2 months ago
- Tools and Modeling Code for the MASSIVE dataset☆539Updated last year
- ACL 2022☆122Updated 9 months ago
- The world's largest social media toxicity dataset.☆175Updated 2 years ago
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,156Updated last year
- Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation☆1,431Updated last year
- MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from th…☆475Updated last year
- A platform for managing machine learning experiments☆814Updated last month
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆303Updated this week
- maximal update parametrization (µP)☆1,339Updated 2 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆819Updated last year
- The AI Knowledge Editor☆181Updated 2 years ago
- ☆796Updated last month
- 100 exercises to learn JAX☆562Updated 2 years ago
- ☆480Updated 7 months ago
- A library for distributed ML training with PyTorch☆365Updated last year
- ☆367Updated 2 years ago
- ☆495Updated 7 months ago
- The implementation of DeBERTa☆1,966Updated 11 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆861Updated 5 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,214Updated 10 months ago
- Code release for "Dropout Reduces Underfitting"☆311Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆850Updated 10 months ago
- Language Modeling with the H3 State Space Model☆509Updated 11 months ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆555Updated 10 months ago
- BanditPAM C++ implementation and Python package☆647Updated 7 months ago
- ☆1,240Updated last year
- ML Collections is a library of Python Collections designed for ML use cases.☆879Updated last month
- Multi-angle c(q)uestion answering☆458Updated 2 years ago