The data set of this course is from virtual blog site, we are going to use the data to solve business problems, for example what countries do your customers come from;Which day of the week receives the most online traffic; Which region contributes the most clickstream data etc,. Basic functions for data cleaning, data analysis and visualization will be coverd in this course. It is also the foundation for programming on distributed system like Spark SQL,or with Alibaba cloud MaxCompute Python SDK.