The Wayback Machine - https://web.archive.org/web/20220320131532/https://github.com/topics/data-preprocessing
Skip to content
#

data-preprocessing

Here are 507 public repositories matching this topic...

“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.
  • Updated Apr 15, 2020
  • Jupyter Notebook

Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset
  • Updated Dec 5, 2019
  • Jupyter Notebook

This is a project based on Data Science Bowl 2017. I did my best to propose a solution for the problem but I am still new to Deep Learning so my solution is not the optimal one but it can definitely be improved with some fine tuning and better resources.
  • Updated Sep 16, 2018
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more