Data Mining and Statistical Analysis

FIND A SOLUTION AT Academic Writers Bay

There are 2 deliverables for this assignment! You will use part 1 to complete part 2.
Part #1: Northwind Data Mining and Statistical Analysis – Data Warehouse
The purpose of this milestone assignment is to complete the tasks described below in preparation for your final project delivery.
1. Data Warehouse:
· Create a data warehouse database, including the fact and dimension tables (star schema).
· Create the schema for each table.
· Populate the tables using either ETL (Pentaho) or SQL (PostgreSQL).
2. Preprocessing for SAS:
· Extract data from the data warehouse, creating a file for input into SAS. The format of the file is your choice. Ensure SAS University Edition accepts your selected format.
You should use the plan formulated in Milestone 1 of Module 3 for the detailed steps you intend to follow.
For this milestone assignment, you are expected to submit:
Screenshots of the populated data warehouse
Star schema design, either a drawing or screenshot
Row counts for the fact and dimension tables
Brief description of your key learnings from completing this assignment
Your assignment must meet the following requirements:
Be 2-4 pages in length, not including the cover and references pages.
Follow the APA Writing Guidelines including references and citations. Your paper should include an introduction, a body with at least two fully developed paragraphs, and a conclusion.
Be clearly and well written using excellent grammar and style techniques. Be concise and logical. You are being graded, in part, on the quality of your writing
Be supported with at least two scholarly journal articles (at least one of which is peer-reviewed dated within 5 years).
Part #2: Northwind Data Mining and Statistical Analysis Project – Planning
The objective of this Portfolio Project is mining data from a data warehouse, which contains data from the Northwind database that was constructed during your installation of PostgreSQL.
Below are the summarized tasks for this Portfolio Project.
Data Warehouse:
Create a data warehouse database, including the fact and dimension tables (star schema).
Create the schema for each table.
Populate the tables using either ETL (Pentaho) or SQL (PostgreSQL).
Preprocessing for SAS:
Extract data from the data warehouse, creating a file for input into SAS. The format of the file is your choice. Ensure SAS University Edition accepts your selected format.
Statistical Analysis Using SAS:
Import data created in the preprocessing step.
Conduct statistical analysis using the appropriate statistics from each category:
Summary statistics
Classification
Clustering
Association

YOU MAY ALSO READ ...  Auditing HA3032

Prepare an analysis report.
Using your plan prepared in Module 3, Milestone 1, and leveraging the data warehouse and preprocessing steps in Module 6, Milestone 2, complete the tasks under Statistical Analysis Using SAS.
Your analysis report must include:
An analysis of each variable in the data set
An analysis to determine which variables could serve as appropriate classifier variables
An analysis to determine if any variables are candidates for clustering
An analysis to determine if any variables have associations
Any tables, histograms, or scatterplot graphs necessary to support your analyses
A recommendation as to the suitability of this data set for meeting your organization’s business goal
Your project must meet the following requirements:
Be 6-8 pages in length, not including the cover and references pages.
Follow the APA Writing Guidelines including references and citations. Your paper should include an introduction, a body with at least four fully developed paragraphs, and a conclusion.
Be clearly and well written using excellent grammar and style techniques. Be concise and logical. You are being graded, in part, on the quality of your writing.
Be supported with at least three peer-reviewed, scholarly references dated within the past 5 years, and one citation from the course textbooks

Order from Academic Writers Bay
Best Custom Essay Writing Services

QUALITY: 100% ORIGINAL PAPERNO PLAGIARISM – CUSTOM PAPER