#
sparksql
Here are 189 public repositories matching this topic...
Geo Spatial Data Analytics on Spark
big-data
spark
geojson
geospatial
sparksql
shapefile
magellan
geometric-algorithms
geospatial-processing
geospatial-analytics
geospatial-analysis
-
Updated
Oct 5, 2019 - Scala
Scala examples for learning to use Spark
-
Updated
Sep 17, 2020 - Scala
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
-
Updated
Aug 3, 2018 - Scala
Process Common Crawl data with Python and Spark
-
Updated
May 17, 2021 - Python
vpipkt
commented
Sep 10, 2020
This code:
val df = spark.read.raster //....
.select(rf_tile($"red"), rf_extent($"red") as "red_extent", rf_crs($"red") as "red_crs")
.toLayer(tlm)
Fails with error about column crs as crs
not found. Pinpointed this to https://github.com/locationtech/rasterframes/blob/develop/core/src/main/scala/org/locationtech/rasterframes/extensions/RasterJoin.scala#L44
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
-
Updated
Dec 18, 2017 - Java
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
redis
demo
kafka
spark
prototype
bigdata
spark-streaming
quickstart
sparksql
oozie
sqoop
spark-sql
spark-streaming-examples
sqoop-import
spark-demo
middle-end
middle-office
spark-examples
-
Updated
Aug 12, 2020 - Java
type-class based data cleansing library for Apache Spark SQL
-
Updated
Jun 23, 2019 - Scala
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
-
Updated
Mar 14, 2021 - Scala
Google Spreadsheets datasource for SparkSQL and DataFrames
-
Updated
May 21, 2021 - Scala
ACL Management for Apache Spark SQL with Apache Ranger. This library has been contributed to https://github.com/apache/submarine as a sub-module, and that module can still be used individually. The project here will no longer be updated. If you have any questions please go to https://github.com/apache/submarine/tree/master/docs/submarine-security/spark-security/README.md to learn how to use and give feedback to the apache submarine community by following https://submarine.apache.org/community/contributors.html
-
Updated
Apr 1, 2020 - Scala
demo applications that show how to deploy offline feature engineering solutions to online in one minute with fedb and nativespark
-
Updated
Jun 19, 2021 - Python
Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.
-
Updated
Jul 12, 2017 - Java
This repository contains Spark, MLlib, PySpark and Dataframes projects
-
Updated
Oct 22, 2017 - Jupyter Notebook
PostgreSQL and GreenPlum Data Source for Apache Spark
-
Updated
Jan 30, 2021 - Scala
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
-
Updated
Apr 26, 2021 - Scala
Spark DataFrames for earth observation data
-
Updated
May 1, 2018 - Scala
PySpark functions and utilities with examples. Assists ETL process of data modeling
-
Updated
Dec 3, 2020 - Jupyter Notebook
Materials for Oxford Software Engineering Programme CLO course
-
Updated
Jul 23, 2021 - Jupyter Notebook
Toy Hadoop cluster combining various SQL-on-Hadoop variants
spark
yarn
presto
hive
hadoop
impala
hbase
zookeeper
sparksql
hadoop-cluster
hue
hadoop-filesystem
hadoop-mapreduce
tez
hadoop-docker
hbase-client
hadoop-framework
yarn-hadoop-cluster
hadoop-hdfs
zookeeper-deployment
-
Updated
Nov 16, 2017 - Shell
Improve this page
Add a description, image, and links to the sparksql topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sparksql topic, visit your repo's landing page and select "manage topics."
Is your feature request related to a problem? Please describe.
Today the user needs to deploy udf jars and reference data csvs manually to the blob location
Describe the solution you'd like
Enable the user to choose a file on a local disk which the web portal will then upload to the right location