The purpose of this course is for students to gain a solid foundation in the most important tools and strategies for addressing the 3 most common challenges in enterprise business intelligence: 1) Reducing the time needed to produce insightful metrics and reports. 2) Freeing data trapped inside of legacy tools and federated datasources. 3) Providing a centralized framework for user interaction and consumption of analysis, reports and curated data.
I have adressed two business problems,
The professional deliverable includes the following sections,
A real estate agent was asked to help the Sellers in selling their houses in Boston at reasonable prices based on given details of the houses such as,
Seller’s housing information:
Based on above details project forecasts incorporates immense, valuable research,
Data analysis and visualization on past Boston Housing data to find valuable insights from raw data and given data by Sellers
Building predictive algorithms to best recommend the house prices
Justification of recommended housing prices
Other necessary factors to consider for selling the house
Linear Regression
Ordinary Least Square Algorithm
Based on predicitive modeling, a real estate agent can provide the reasonable house prices which would be very useful for the sellers to sell their houses at good value in Boston area.
Boston Housing Dataset: Boston Housing Prices
PIMA Indians are a group Native American people who lives in the Phoenix, Arizona. So many years, these people are are living with poor diet which where carbohydrate deficiency seems more and in turn, they are exposed to type 2 diabetes among children as well as adults.
To deal with such a huge and deadly disease, many Medicare organizations are trying to achieve their best possible solutions to diagnose the diabetes among children as well as adults to mitigate the risk of diabetes in the future, to reduce the period required for diagnosis with exact identification of people having diabetes as fast as possible, and to provide proper treatment for diabetic people for a reduction in the severity of complications associated with this disease.
Client wants to make the best decision on a business problem of classifying the right number of people who have diabetes and who do not have diabetes to invest valuable time towards the proper treatment of diabetic patients rather than in diabetes testing.
Based on the business problem, project forecasts incorporates valuable research,
Data analysis and visualization on the PIMA Indians diabetes data to find valuable insights to provide more exposure to various important factors and their contribution
Building predictive algorithms to identify a person has diabetes or not
Justification and necessary medical attributes contributing to identifying a person as a diabetic
Recommendation for Client
Based on predicitive modeling, a client should be able to make a confident decision based on the performance of final model for identifying diabetic people from non-diabetic people.
PIMA Indians Diabetes Dataset: PIMA Indians Diabetes Dataset
1) Introduction to Statistical Learning (supervised learning models) by James, Witten, Hastie, and Tibshirani Available for free online: http://www-bcf.usc.edu/~gareth/ISL
2) Elements of Statistical Learning (deeper mathematical explanations): by Hastie, Tibshirani, and Friedman Also available online at : http://www.stat.stanford.edu/ElemStatLearn
3) R for Data Science Wicham and Grolemund: Available for free online: : https://r4ds.had.co.nz/
4) Automate the Boring Stuff with Python Al Sweigart Available for free online: https://automatetheboringstuff.com/
5) Think Python Allen B Downey Available for free online: http://greenteapress.com/thinkpython/html/index.html