

Transformer is useful for damn near anything. At the end of the day, what we consider intelligence is the ability to predict what comes next, whether that is what our senses will tell us next or what the next hypothesis to test should be based on the data we have seen so far.





I’m saying there is no “big leap” necessary. As the paper that introduced the transformer said, attention is all you need.