BERT - encoder only
GPT - decoder only
Raychev, V., Bielik, P., and Vechev, M. Probabilistic Model for Code with Decision Trees. In Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications (2016), OOPSLA ’16, ACM
semantic of code (like the operator of Python, the intepreter)
formal of code (like AST, grammar)
Introduction to Neural Network Verification; Aws Albarghouthi