Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 5,878 public repositories matching this topic...
-
Updated
May 13, 2021 - Python
-
Updated
Aug 20, 2021 - Python
-
Updated
Jul 4, 2021 - Go
-
Updated
Aug 20, 2021 - Python
Describe the bug
Serverless: Deprecation warning: Variables resolver reports following resolution errors:
- Cannot resolve variable at "provider.environment.CUBEJS_APP": Value not found at "self" source,
- Cannot resolve variable at "functions.cubejsProcess.events.0.event.resource": Value not found at "self" source
From a next major this will be
-
Updated
Aug 14, 2021 - Java
-
Updated
Aug 3, 2021
-
Updated
Oct 31, 2019
-
Updated
Aug 20, 2021 - Java
-
Updated
Aug 18, 2021 - Java
-
Updated
Dec 23, 2020 - Python
-
Updated
Aug 21, 2021 - Jupyter Notebook
-
Updated
Aug 21, 2021 - Java
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
Jul 28, 2021 - Scala
-
Updated
Jul 14, 2021 - Python
Hi,
I am running the deltaTable = DeltaTable.convertToDelta(spark, f"parquet.
{data_path}")
to read a DeltaTable from the parquet files but it doesn't return one as suggested in the documents. However, it successfully converts them. If I read them exactly after that line again using forPath, it will give me the DeltaTable.
 behavior
I can't get config, when post a job with 'sync=true'. I got it:
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"
I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?
Created by Matei Zaharia
Released May 26, 2014
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.