A complete end-to-end data generation, cleaning, and visualization project comparing two business years (2023 vs 2024) through custom synthetic data. Built entirely in Python (Jupyter Notebook) and visualized using Power BI โ€” combining storytelling with interactivity using dynamic slicers, maps, and custom tooltips.

๐Ÿ› ๏ธ Tools & Technologies Used

  • Python (pandas, Faker, NumPy) โ€“ Generated and cleaned 2023 & 2024 datasets using random logic and libraries.
  • Power BI โ€“ Designed interactive dashboards with city/state maps, slicers, donut charts, KPI cards, tooltips, and bar graphs.
  • Generative AI (ChatGPT) โ€“ Used as a co-pilot to debug scripts, organize logic, rewrite markdown, and brainstorm layout/design enhancements.

๐Ÿ“ Dataset Description

This project doesnโ€™t use real-world business sales data. Instead:

๐Ÿ”น I first generated synthetic sales data for 2023 and 2024 using Pythonโ€™s Faker and NumPy libraries.
๐Ÿ”น Then saved them into two datasets:

๐ŸŽฏ Problem Statements / Goals

This project wasnโ€™t just about revenue per year โ€” it was about answering questions like:

  • ๐Ÿ›’ Are certain product categories more dominant by sales or by total orders?
  • ๐Ÿ™๏ธ Which cities contributed the most to overall revenue?
  • ๐Ÿ“ˆ How do trends change month-by-month across KPIs like Sales, Orders, Quantity Sold?
  • ๐Ÿ‘ค Who are the top customers and how much are they contributing to our growth?
  • ๐Ÿงญ What is the business growth direction comparing 2023 and 2024?

๐Ÿ”„ Project Workflow

Python (Jupyter Notebook)

  • Used Faker, random, and NumPy to simulate customer/product/order behaviour.
  • Created logic blocks for 2023 and 2024 (product-category mappings, ID formats, etc.)
  • Saved final outputs as .csv files for Power BI import.

Power BI Dashboard

Built an interactive dashboard with 3 report pages covering:

๐Ÿ“„ Page 1: KPI Dashboard

  • ๐Ÿ“Œ KPI cards: Sales, quantity sold, orders, customers
  • ๐Ÿ“ˆ Line charts: Month-wise trends
  • ๐ŸŽฏ Gauges: Year-end vs target KPIs

KPI Dashboard

๐Ÿ“„ Page 2: Comparison Analysis

  • ๐Ÿ“Š Sales by product category (bar)
  • ๐Ÿงฎ Orders by product category (donut)
  • ๐Ÿ—บ๏ธ City-wise sales (map)
  • ๐Ÿง‘ Top 100 customers by sales and quantity

Comparison Analysis

๐Ÿ“„ Page 3: Custom Tooltip View

  • Tooltip with Qtr-wise breakdown
  • Appears on visual hover in dashboards

Tooltip Summary

๐Ÿ’ก Key Insights

  • ๐Ÿ“ˆ 2024 saw a spike in both total customers and sales.
  • ๐Ÿงข Clothing & Electronics dominated in sales and orders.
  • ๐Ÿ—บ๏ธ San Francisco, New York, Houston led city-wise performance.
  • ๐Ÿ’Ž Top 100 customers made up over 60% of revenue.
  • ๐ŸŽฏ Most KPIs surpassed their original 2024 targets.

๐Ÿš€ Things I Learned

  • How to create realistic synthetic datasets with Python.
  • Balancing logic complexity with data readability.
  • How to format and clean data to be dashboard-ready.
  • Designing Power BI dashboards that tell stories visually.
  • Using AI as a fast-thinking assistant in exploratory and design stages.

๐Ÿ“ฆ How to Explore This Project

  1. ๐Ÿ“ฅ Download the datasets from the repo.
  2. ๐Ÿ Explore the logic in the Jupyter Notebook.
  3. ๐Ÿ“Š Open and analyze the Power BI .pbix file.
  4. ๐Ÿงญ Use filters/slicers in Power BI to explore dynamic trends.

๐Ÿ™ THANK YOU