Project Name

Toast2Word

Project Description

A word guessing game made with word2vec, played between a computer program and human players.

In Collaboration with Anna Garbier

Year

2019

Tags

  • Word2Vec
  • Machine Learning
  • Python

“Toast” is a word guessing game using Word2Vec(words to vector data). We transformed various collections of themed nouns, based on their semantic meanings, into points plotted in a two-dimensional space. Similar words or words of the same category appear closer together on the map. Words that are semantically different are further apart. For example, "bread" and "toast" appear close together while "motorcycle" and "toast" can be further apart.

The game rule is simple. With any word collection of player’s choice, the computer randomly selects a mystery word. The goal is for the player to guess the target word with the least guesses by asking the computer a series of questions like “Is this target word closer to __ or __?” The computer will calculate the semantic distance between the hidden target word and players’ guesses. It will always respond with the word that’s closer to the target until the target word is found.


toast


So far, we have four collections of words. For some, we scraped from the internet then extracted the special nouns. For some, we found existing noun collections online. We fed the extracted collections into spaCY’s word2vec model to get the multi-dimensional vector data of every single word. Then, we use tSNE to deduct the multi-dimensional vector to two-dimensions so every word will have an x and y coordinate within a two-dimensional vector space.