#
data-transformation
Here are 112 public repositories matching this topic...
data-science
machine-learning
spark
bigdata
data-transformation
pyspark
data-extraction
data-analysis
data-wrangling
dask
data-exploration
data-preparation
data-cleaning
data-profiling
data-cleansing
big-data-cleaning
data-cleaner
cudf
dask-cudf
-
Updated
Jan 26, 2021 - Jupyter Notebook
A block-based API for NSValueTransformer, with a growing collection of useful examples.
-
Updated
Mar 23, 2020 - Objective-C
library
framework
asynchronous
php-development
scalability
porter
data-import
data-transformation
abstraction
durability
-
Updated
Dec 16, 2020 - PHP
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
microsoft
sdk
csharp
dotnet
examples
prose
data-transformation
program-synthesis
synthesis
data-wrangling
-
Updated
Jan 26, 2021 - C#
Logical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
subscription
replication
etl
zero-downtime
postgresql
data-transformation
publish-subscribe
cdc
logical-decoding
data-transport
database-replication
-
Updated
Jan 25, 2021 - C
Like Awk but with SQL and table joins
-
Updated
Nov 9, 2020 - Tcl
-
Updated
May 27, 2020 - TypeScript
Advanced and Fast Data Transformation in R
data-science
cran
r
statistics
time-series
high-performance
data-transformation
scientific-computing
econometrics
rstats
data-analysis
data-manipulation
data-processing
weights
panel-data
weighted
data-aggregation
-
Updated
Jan 27, 2021 - R
Data transformation and utility functions for R
-
Updated
Jan 7, 2021 - R
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
spark
hadoop
algorithms
data-transformation
pyspark
partitioning-algorithms
mapreduce
data-algorithms
data-partition
mapreduce-algorithm
santa-clara-university
mapreduce-python
pyspark-algorithms-book
-
Updated
Jan 27, 2021 - HTML
A simple Spark-powered ETL framework that just works 🍺
data-science
machine-learning
framework
scala
big-data
spark
pipeline
etl
data-transformation
data-engineering
dataset
data-analysis
modularization
setl
etl-pipeline
-
Updated
Jan 22, 2021 - Scala
Reference Architectures for Datalakes on AWS
glue
amazon-emr
data-transformation
data-lake
data-catalog
data-analytics
hive-metastore
emr-cluster
ingest-data
-
Updated
May 13, 2020 - HTML
machine-learning
deep-learning
data-transformation
data-visualization
machine-learning-library
machine-learning-api
datasets
data-cleaning
ludwig
data-augmentation
automl
tpot
machine-learning-models
model-compression
model-deployment
autokeras
voice-computing
data-cleaning-pipeline
autopytorch
-
Updated
Jan 7, 2021 - Python
A curated list of Clojure resources for dealing with domain-specific languages.
-
Updated
Jan 6, 2021
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
api
php
yaml
serialization
json
php7
json-api
xml
data-transformation
yml
jsonapi
transformer
hal
hal-api
xml-transformation
marshaller
json-transformation
array-transformer
yaml-transformer
jsend-transformer
-
Updated
Jul 19, 2018 - PHP
Wrangler Transform: A DMD system for transforming Big Data
data-science
big-data
parsing
avro
data-transform
data-transformation
project
transform-data
preparation
transform
wrangle
manipulate-data
cdap
cdap-plugin
data-prep
data-cleansing
-
Updated
Jan 21, 2021 - Java
object flow treatment, data transformation
-
Updated
Jan 13, 2021 - JavaScript
Data transformation toolkit
-
Updated
Jan 16, 2021 - Ruby
Daany - DAta ANalYtics C# library with the implementation of DataFrame, Time series decomposition and various statistical parameters. It is closely implemented with ML.NET, in order to load the transformed data in to ML.NET pipeline.
-
Updated
Nov 27, 2020 - C#
Foofah: programming-by-example data transformation program synthesizer
data-transformation
data-wrangling
data-preparation
data-cleaning
combinatorial-search
programming-by-example
inductive-program-synthesis
heursitic
-
Updated
Apr 23, 2018 - CSS
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
excel
csv-files
tabular-data
data-transformation
power-bi
data-acquisition
data-visualization
open-data
data-visualisation
data-analytics
data-analysis
power-query
tabular-data-package
data-package
powerbi
json-table-schema
frictionlessdata
datapackage
-
Updated
Apr 21, 2020 - R
A PHP serialization component focused on performance
-
Updated
May 28, 2020 - PHP
A tool to read CSV files with CSVW metadata and transform them into other formats.
-
Updated
Apr 30, 2019 - Python
bamboolib - template for creating your own binder notebook
docker
data-science
data-transformation
data-visualization
data-visualisation
data-viz
data-exploration
binder-jupyter-notebook
-
Updated
Jan 18, 2021 - Jupyter Notebook
DEPRECATED: YAML-based data transformations
-
Updated
Oct 11, 2019 - Python
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
nodejs
javascript
flow
data-stream
data-transformation
pipeline-framework
data-flow
synchronous
data-pipeline
streaming-data
data-processor
pipe-data
-
Updated
Aug 8, 2017 - JavaScript
Short programming tutorials pertaining to data analysis.
-
Updated
Apr 19, 2017 - Jupyter Notebook
Improve this page
Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."
Right now the tutorial is coherently designed, tested, and even documented. However, it doesn't build up in a way that's very beginner friendly. It establishes glom's value and then immediately uses it at an intermediate level.
I'd like it if it was a bit more drawn out to use basic features first and then add a multi-line
Coalesce
as the