1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
Jun 10, 2025 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Always know what to expect from your data.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Automatically visualize your pandas dataframe via a single print! 📊 💡
Visualize and compare datasets, target values and associations, with one line of code.
Beautiful visualizations of how language differs among document types.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Interactively explore unstructured datasets from your dataframe.
Automatically find issues in image datasets and practice data-centric computer vision.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Developer-first embedded analytics
Build 12 Data Apps in Python with Streamlit
Kernel Density Estimation in Python
Complete-Life-Cycle-of-a-Data-Science-Project
Ways of doing Data Science Engineering and Machine Learning in R and Python
Code review for data in dbt
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Preliminary Exploratory Visualisation of Data
Add a description, image, and links to the exploratory-data-analysis topic page so that developers can more easily learn about it.
To associate your repository with the exploratory-data-analysis topic, visit your repo's landing page and select "manage topics."