You are to analyse a dataset collected from Kaggle. The website offers

You are to analyse a dataset collected from Kaggle. The website offers aggregated datasets and also serves as a community hub that hosts machine learning competitions to solve business problems. The provided dataset contains Amazon’s Top 50 bestselling books from 2009 to 2019. You need to explore and study the key aspects and features of the dataset before answering the following questions. Download the dataset “Amazon_bestsellers.csv” from Canvas.
Variables Dictionary
• Name: Name of the book.
• Author: Author of the book.
• User Rating: User rating of the book on the year it joined the list.
• Reviews: Number of reviews the book received on the year it joined the list.
• Price: Price of the book on the year it joined the list.
• Year: Year the book joined the list.
• Genre: Genre of the book. It can be listed as Fiction or Non-Fiction.
(a) Provide a summary of the dataset in tabular format. It should identify key aspects of the dataset such as data attributes, data types and any useful statistics. (Up to 200 words for part (a))
(15 marks)
(b) Assess quality of the data regarding its completeness, missing values and outliers. Perform any necessary cleaning and preparation of the data and illustrate with example(s). (Up to 200 words for part (b))
(20 marks)
(c) Identify one (1) business problem that can be addressed by analysing the data. Your description should provide details to explain how the data can be used to address the problem identified. (Up to 150 words for part (c))
(15 marks)
(d) Recommend two (2) professional graphical charts that provide interesting information about the business problem identified in Part (c). You may use any software tool (such as Excel, Tableau, PowerBI, etc.) to produce the proposed graphical chart. Provide a screenshot of each produced chart. Use up to 300 words to explain how each chart is produced and discuss why the two charts are recommended.
(30 marks)