A common technique in linguistics is to group words by similarity - for example, "king" and "queen" have similar connotations, so they would naturally be categorized together. One such method which allows these groups is Word2Vec, a machine learning algorithm which learns these groupings through text samples. 


This work applies the linguistic-based Word2Vec algorithm to chemistry. This novel method allows for similar compounds to be grouped together, showing that compounds with similar function share the same "chemical language".