The Wayback Machine - https://web.archive.org/web/20210812102915/https://github.com/topics/data-exploration
Skip to content
#

data-exploration

Here are 202 public repositories matching this topic...

zblz
zblz commented Aug 15, 2017

Currently all of the metrics computed are independent of a target variable or column, but if lens.summarise took the name of a column as the target variable, the output of some metrics could be more interpretable even if the target variable is not used in any kind of predictive modelling.

A good example of this could be PCA (see #14), which could plot the different categories of the target va

odd-platform

Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
  • Updated Jan 19, 2018
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."

Learn more