For more projects, please see my Github.


A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning.


A python library to create and visualize experiment logs. It has an optional adapter for Firebase to store/retrieve data for visualization in real time.


Database-backed library to quickly load pretrained word embeddings.

Stanford NLP Stanza

Stanford NLP group's shared repository for Python infrastructure. The goal of Stanza is not to replace your modeling tools of choice, but to offer implementations for common patterns useful for machine learning experiments.