Build software better, together

zzw922cn / Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Feb 9, 2022
Python

machinelearnjs / machinelearnjs

Star

Open

Refactor all dataSync with arraySync

2

JasonShin commented Apr 8, 2019

I'm submitting a ...
[/] enhancement
Summary
As a result of upgrading the Tensorflow version to 0.15.1, we should refactor all the dataSycn with arraySync. This will greatly improve the overall readability of the code.

msamogh / nonechucks

Star

Open

Write tests

msamogh commented Mar 18, 2019

Write unit test coverage for SafeDataset and SafeDataLoader, along with the functions in utils.py.

Route next batch through step_to_index_fn in _SafeDataLoaderIter

akanz1 / klib

Sponsor

Star

Easy to use Python library of customized functions for cleaning and analyzing data.

python data-science data-visualization feature-selection data-analysis klib data-preprocessing data-cleaning

Updated Mar 12, 2022
Python

harunurrashid97 / 100-Days-Of-ML-Code

Star

Open

License.md

harunurrashid97 commented Aug 8, 2018

Today i add a license for this repository.

iTechArt / convtools-ita

Star

convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.

python functional-programming transformations conversions code-generation data-preprocessing data-processing data-preparation

Updated Oct 5, 2021
Python

HasnainRaz / SemSegPipeline

Star

A simpler way of reading and augmenting image segmentation data into TensorFlow

python deep-learning pipeline tensorflow labels data-preprocessing masks semantic-segmentation data-augmentation image-augmentation augmentation image-preprocessing input-pipeline data-augmentations

Updated Jun 15, 2020
Python

thepanacealab / SMMT

Star

Social Media Mining Toolkit (SMMT) main repository

tweets annotation twitter-api data-acquisition spacy data-preprocessing gathering data-annotation

Updated Mar 2, 2021
Python

TensorMSA / tensormsa

Star

Deep learning GUI frame work for enterprise

docker machine-learning deep-learning docker-compose tensorflow gpu microservices-architecture data-preprocessing

Updated Mar 8, 2018
Python

dansuh17 / segan-pytorch

Star

SEGAN pytorch implementation https://arxiv.org/abs/1703.09452

audio pytorch data-preprocessing mir source-separation speech-enhancement segan segan-pytorch

Updated Mar 11, 2019
Python

nursnaaz / 25DaysInMachineLearning

Star

I will update this repository to learn Machine learning with python with statistics content and materials

python data-science machine-learning statistics random-forest numpy linear-regression machine-learning-algorithms python3 logistic-regression machinelearning modelling data-preprocessing practise decision-tree descriptive-statistics bias covariance bagging machinelearning-python

Updated Nov 22, 2020
Jupyter Notebook

asavinov / prosto

Star

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

python workflow data-science spark pandas map-reduce business-intelligence olap data-wrangling data-preprocessing feature-engineering data-processing data-preparation

Updated Nov 21, 2021
Python

HypoX64 / candock

Star

A time series signal analysis and classification framework

deep-learning eeg classification data-preprocessing data-augmentation series-signal-analysis

Updated Jan 27, 2021
Python

triton-inference-server / dali_backend

Star

The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.

python deep-learning gpu image-processing dali data-preprocessing nvidia-dali fast-data-pipeline

Updated Mar 17, 2022
C++

danielhanchen / sciblox

Star

sciblox - Easier Data Science and Machine Learning

python data-science machine-learning data-mining sklearn data-visualization imputation data-analysis data-preprocessing boosting

Updated Jul 28, 2017
HTML

soumyadip007 / Data-Science-Using-Python-University-Course-Module

Star

“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.

python data-science numpy jupyter-notebook data-visualization plotting data-preprocessing panda data-processing data-preparation knn

Updated Apr 15, 2020
Jupyter Notebook

repetere / modelscript

Star

REPO MOVED TO https://repetere.github.io/jsonm-data - Data Science and Machine learning in JavaScript

javascript data-science machine-learning data-mining data-preprocessing

Updated Dec 2, 2019
JavaScript

ammsa / DTCleaner

Star

DTCleaner: data cleaning using multi-target decision trees.

data-science data-mining data-wrangling data-preprocessing data-cleaning data-quality

Updated Jun 21, 2016
Java

maet3608 / nuts-ml

Star

Flow-based data pre-processing for deep learning

data-science deep-learning deep-learning-library data-preprocessing deep-learning-framework

Updated Jan 6, 2021
Python

mdkearns / automated-data-preprocessing

Star

A command-line utility program for automating the trivial, frequently occurring data preparation tasks: missing value interpolation, outlier removal, and encoding categorical variables.

python data-science machine-learning automation interpolation pandas data-engineering imputation argparse outlier-detection command-line-tool data-preprocessing data-processing outlier-removal one-hot-encode

Updated Jan 4, 2019
Python

KwokHing / YandexCatBoost-Python-Demo

Star

Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset

visualization python data-science pandas seaborn feature-selection data-analysis data-preprocessing python27 gradient-boosting-classifier gradient-boosting pearson-correlation one-hot-encode catboost variance-analysis yandex-catboost

Updated Dec 5, 2019
Jupyter Notebook

ELToulemonde / dataPreparation

Star

Data preparation for data science projects.

data-science r variable-selection speed data-preprocessing data-preparation date-conversion variable-elimination

Updated Feb 11, 2022
R

LaureBerti / Learn2Clean

Star

Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning

reinforcement-learning data-preprocessing automated data-cleaning data-curation data-cleaning-pipeline

Updated Nov 15, 2021
Python

abrazinskas / machine-learning-data-pipeline

Star

Pipeline module for parallel real-time data processing for machine learning models development and production purposes.

python data-science machine-learning natural-language-processing deep-learning algorithms parallel data-preprocessing data-processing computing data-preparation data-pipeline

Updated Nov 13, 2019

buabaj / xplore

Star

A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.

data-science machine-learning artificial-intelligence data-wrangling data-preprocessing

Updated Apr 29, 2021
Python

suraj-maniyar / Stock-Trading-Using-Machine-Learning

Star

A comprehensive approach for stock trading implemented using Neural Network and Reinforcement Learning separately.

reinforcement-learning neural-network pca-analysis data-preprocessing

Updated Jun 25, 2018
Python

Kukuster / SumStatsRehab

Star

GWAS summary statistics files QC tool

bioinformatics gwas data-preprocessing data-preparation summary-statistics data-prep gwas-pipeline bioinformatics-tool gwas-summary-statistics sumstats

Updated Jan 22, 2022
Python

ISTE-VESIT-ORG / Machinera-2020

Star

This is an AI Series where we will cover Machine Learning and Deep Learning topics from the very basics.

python machine-learning web-scraping data-visualisation supervised-learning data-preprocessing data-manipulation unsupervised-learning

Updated Mar 7, 2021

Pooja-Bhojwani / linked-eed

Star

Aim is to come up with a job recommender system, which takes the skills from LinkedIn and jobs from Indeed and throws the best jobs available for you according to your skills.

python text-mining data-mining data-preprocessing jaccard-similarity social-network-backend job-recommendation skill-algorithm

Updated Oct 9, 2021
Python

priyanshu1210 / lung-cancer-detection

Star

This is a project based on Data Science Bowl 2017. I did my best to propose a solution for the problem but I am still new to Deep Learning so my solution is not the optimal one but it can definitely be improved with some fine tuning and better resources.

data-science machine-learning deep-learning neural-network tensorflow lung-cancer-detection convolutional-neural-networks data-preprocessing data-science-bowl-2017

Updated Sep 16, 2018
Jupyter Notebook

Oct	MAR	Apr
	20
2020	2022	2023

data-preprocessing

Here are 507 public repositories matching this topic...

zzw922cn / Automatic_Speech_Recognition

machinelearnjs / machinelearnjs

Refactor all dataSync with arraySync

Remove stopword module usage

feature/elastic_net

msamogh / nonechucks

Write tests

Route next batch through step_to_index_fn in _SafeDataLoaderIter

akanz1 / klib

harunurrashid97 / 100-Days-Of-ML-Code

License.md

iTechArt / convtools-ita

HasnainRaz / SemSegPipeline

thepanacealab / SMMT

TensorMSA / tensormsa

dansuh17 / segan-pytorch

nursnaaz / 25DaysInMachineLearning

asavinov / prosto

HypoX64 / candock

triton-inference-server / dali_backend

danielhanchen / sciblox

soumyadip007 / Data-Science-Using-Python-University-Course-Module

repetere / modelscript

ammsa / DTCleaner

maet3608 / nuts-ml

mdkearns / automated-data-preprocessing

KwokHing / YandexCatBoost-Python-Demo

ELToulemonde / dataPreparation

LaureBerti / Learn2Clean

abrazinskas / machine-learning-data-pipeline

buabaj / xplore

suraj-maniyar / Stock-Trading-Using-Machine-Learning

Kukuster / SumStatsRehab

ISTE-VESIT-ORG / Machinera-2020

Pooja-Bhojwani / linked-eed

priyanshu1210 / lung-cancer-detection

Improve this page

Add this topic to your repo