Maven02

Maven Market Project - Analyzing Data of a Grocery Chain Firm

Introduction

This Project Case is part of the Microsoft Power BI Desktop for Business Intelligence Course on Udemy online platform by Maven Analytics. In this case, I am being put in the point of view of a Business Intelligence Analyst in Maven Market, a Multinational Grocery Chain company whose business locations are in Canada, Mexico and the United States. My role is to transform raw data into a dashboard report to help the Management Team track the KPI, monitor and compare the performance, as well as identify some insights. I worked through the entire business intelligence workflow, i.e.: connecting and shaping the data, building a relational model, adding calculated fields, and designing an interactive report.

Problem Questions

The Management Team would like to find out the Topline Performance of their products. This must be delivered by:

Datasets

There are 7 Datasets used in this project, i.e.:

1. Customer Table Dataset (MavenMarket_Customers.csv)

Field Name Data Type
customer_id Whole Number
customer_acct_num Text
first_name Text
last_name Text
customer_address Text
customer_city Text
customer_state_province Text
customer_postal_code Text
customer_country Text
birthdate Date
marital_status Text
yearly_income Text
gender Text
total_children Whole Number
num_children_at_home Whole Number
education Text
acct_open_date Date
member_card Text
occupation Text
homeowner Text
full_name Text
birth_year Text
has_children Y/N

2. Products Table Dataset (MavenMarket_Products.csv)

Field Name Data Type
product_id Whole Number
product_brand Text
product_name Text
product_sku Text
product_retail_price Decimal Number
product_cost Decimal Number
product_weight Decimal Number
recyclable Decimal Number
low_fat Decimal Number
discount_price Fixed Decimal Number (Currency)

3. Calendar Table Dataset (MavenMarket_Calendar.csv)

Field Name Data Type
date Date
Start of Week Date
Name of Day Text
Start of Month Date
Name of Month Text
Quarter Whole Number
Year Whole Number

4. Regions Table Dataset (MavenMarket_Regions.csv)

Field Name Data Type
region_id Whole Number
sales_district Text
sales_region Text

5. Stores Table Dataset (MavenMarket_Stores.csv)

Field Name Data Type
store_id Whole Number
region_id Whole Number
store_type Text
store_name Text
store_street_address Text
store_city Text
store_state Text
store_country Text
store_phone Text
first_opened_date Date
last_remodel_date Date
total_sqft Whole Number
grocery_sqft Whole Number
full_address Text
area_code Text

6. Transaction Table Dataset (MavenMarket_Transactions_1997 & MavenMarket_Transactions_1998.csv)

Field Name Data Type
transaction_date Date
stock_date Date
product_id Whole Number
customer_id Whole Number
store_id Whole Number

7. Return Table Dataset (MavenMarket_Returns_1997-1998.csv)

Field Name Data Type
return_date Date
product_id Whole Number
store_id Whole Number
quantity Whole Number

Connecting and Shaping the Data

There are several steps in this phase in order to connect and shape the Data used in this case.

1. Update Power BI options and settings by:

2. Connect to the MavenMarket_Customers csv file by:

3. Connect to the MavenMarket_Products csv file by:

4. Connect to the MavenMarket_Stores csv file by:

5. Connect to the MavenMarket_Regions csv file by:

6. Connect to the MavenMarket_Calendar csv file by:

7. Connect to the MavenMarket_Returns csv file by:

8. Add a new folder on your desktop (or in your documents) named “MavenMarket Transactions”, containing both the MavenMarket_Transactions_1997 and MavenMarket_Transactions_1998 csv files by:

9. With the exception of the two data tables, disable “Include in Report Refresh”, then Close & Apply by:

Building a Relational Model

There are several steps in this phase in order to build a Relational Model.

1. In the MODEL view, arrange the tables so that the lookup tables are above the data tables by:

2. Confirm the following:

3. Hide all foreign keys in both data tables from Report View, as well as “region_id” from the Stores table

4. In the DATA view, complete the following steps by:

All the steps above resulted in the following Relation Model:

Adding Calculated Fields and DAX (Data Analysis eXpression)

There are 2 spots in which I worked to add Calculation Fields and DAX, i.e. the Data View and the Report View.

1. In the DATA view, add the following calculated columns by:

2. In the REPORT view, add the following measures (Assign to tables as you see fit, and use a matrix to match the “spot check” values) by:

Executing all the detailed Steps in 2 spots above, resulted in adding a new Table containing all Measures needed in the modelling.

The overall tables created the complete Relation Model for all the data loaded in the Power Bi

Building Interactive Report

The report dashboard is to provide information in the Problem Questions Section. Below is the Dashboard for the report.

I also built a Note Report Page where its purpose is to provide insights (by utilizing the Bookmark function) so that it can be quickly shown to the management / stakeholders. Later on, for future use, I can add as many insights as can be found based on the data. Yet, in this case I only provided 3 quick insights, i.e.: Portland 1000 Sales, Victoria Hits Positive, and Plato drove the highest profit rate.

Analysis to answer the Problem Questions

Solution:

Based on the picture above, Hermanos peaked the position of the Top 30 Product Brands with the most Total Transaction of 5,342 with Total Profit of $ 21,753 and Margin of 58.64%, as well as Return Rate of 0.95%. Its position is followed by Ebony and Tell Tale in the second and third position. All the Top 30 Product Brand drove a total of 113,668 Transactions, with acumulative Profit Margin of 59.94% that equals to Total Profit of $ 449,627, and total Return Rate of 1.00%.

Solution:

Based on the scorecards displayed above, the Total Transactions in the current month had reached 18,325 number of transaction. This has surpassed the target of 17,339 transaction (around 5.69% increment) above the last month transaction. This drove a 5.61% increment of profit which equals of a Total Profit of $ 71,682 in the current month. However, it turned out that in the same period, the current Monthly Returned also increased by 2.9% or equals 14 additional returns from the previous month return (482). This means along with the increasing number of transaction, there was also an increasing number of goods returned. It is advisable to examine the cause as well of the type of the returned goods.

Solution:

If we expand the map on the Dashboard, we will see from the map above that most of the stores are located in USA, while the country with the second most store number is Mexico. Canada is in the 3rd place for number of store in the country. This number of store drove the number of transaction, as well as total profit gained (representated by the bubble size).

A quick spot check for Total Transactions and Total Profit for each country has confirmed that.

Solution:

The Treemap above implies that most of the Transaction happened in the USA. This is because most stores are located in USA. This is aligned with the information provided in the previous solution. The more stores are open, the more Transaction will take place, which eventually would drive the Total Profit.

Solution:

Based on the visuals above, the revenue experienced fluctuation over the time during year 1998. The highest revenue ever achieved was $ 120K, a slightly different from the corresponding target of around $ 119K.

The live report dashboard can be found here