The tool that I am proposing for the data transformation task is Python. Python is a interpreted, general purpose and dynamic programming language. Python comes with range of readily available libraries to support tasks from web development to machine learning. The syntax of the programming language is simple, intuitive and often simpler to learn for both programmers and non-programmers. Pandas is one of the python library that is widely used for data processing. The operations that was performed with Hive could be done in python as follows data = pd.read_csv("/../Video_Games_Sales_as_at_22_Dec_2016.csv") sales_genre = data[['Genre','Global_Sales','Critic_Score']] sales_genre = sales_genre.round({'Global_Sales':0}) sales_genre = sales_genre.loc[sales_genre['Global_Sales'] > 0] sales_genre = sales_genre.sort_values(by="Global_Sales" , ascending=False) The following are main advantage of python for data transformation. 1.Since it is a full fledged programming language there ar...