word2vec embeddings for the top 15,000 english words [300 dimensional]
frequencies of stemmed words based on norvig dataset
measure the salience / importance of words in a text document -- based on the frequency of the words in the document, versus their frequency in English
generate nonsense words and sentences based on letter frequencies