StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

zero323

Rating
1616.55 (1,196th)
Reputation
187,486 (265th)
Page: 1 ... 5 6 7 8 9 ... 46
Title Δ
Finding medians or approximate medians on many elements efficiently 0.00
Saving a spark dataframe in multiple parts without repartitioning 0.00
How to disable scientific notation in spark-xml -0.48
Custom processing on column in Apache Spark (Java) 0.00
Convert comma separated string to array in pyspark dataframe 0.00
unionAll resulting in StackOverflow 0.00
Using futures in Spark-Streaming & Cassandra (Scala) 0.00
How to decode HTML entities in Spark? 0.00
PySpark (Python 2.7): loading multiline records via SparkContext.ne... 0.00
Aggregation of multiple values using scala/spark 0.00
Why are Spark Parquet files for an aggregate larger than the origin... 0.00
Comparing datetime object to 8601 strings gives wrong result, why i... 0.00
Avoid redundant computation of new columns in Spark -2.53
Pyspark user defined aggregate calculation on columns 0.00
PySpark (Python 2.7): How to flatten values after reduce 0.00
How to do outer joins : Spark Scala SQLContext 0.00
Trouble getting aggregate sum function to correctly count elements 0.00
Dividing two columns of a different DataFrames +0.35
how to implement an iteratation in flatMap function 0.00
PySpark: read, map and reduce from multiline record textfile with n... 0.00
Spark: How to parse multiple json with List of arrays of Struct? +0.35
Spark Scala: How to convert Dataframe[vector] to DataFrame[f1:Doubl... 0.00
groupBy cannot handle large RDDs 0.00
Finding sum-of-square fractions in an aggregated dataframe 0.00
Coalescing has no effect on number of partitions in spark 0.00
PySpark Evaluation +0.34
Spark (streaming) RDD foreachPartitionAsync functionality/working 0.00
Why is my Spark DataFrame much slower than RDD? 0.00
How to run 2 functions doing completely independent transformations... 0.00
Save a spark RDD using mapPartition with iterator +0.36
Spark Random Forest Cross-Validation error 0.00
How is the Spark select-explode idiom implemented? 0.00
PySpark: How to select * from the left table during rdd join 0.00
Is it possible to iteratively collect each partition of rdd? 0.00
Unclosed character class using punctuation in Spark +0.35
Why quantile computation using hiveContex in spark is very slow? 0.00
Applying withColumn function with regular expression patterns in Sp... 0.00
How to modify numpy arrays in Spark dataframe? 0.00
What is the difference between cube and groupBy for operating on Da... 0.00
How to write a wrapper using existing built-in UDFs in Hive? +0.35
LinearRegression scala.MatchError: 0.00
Spark SQL performance - JOIN on value BETWEEN min and max 0.00
spark dataframe grouping, sorting, and selecting top rows for a set... 0.00
Efficient method of Count of distinct values in the column of dataf... 0.00
Select specific columns in a PySpark dataframe to improve performance -0.52
pyspark. Transformer that generates a random number generates alway... 0.00
How can I add a value to a row in pyspark? 0.00
Using Spark Dataframes, example of using interval in window functions 0.00
Inconsistent results in pyspark combineByKey (as opposed to groupBy... 0.00
How to divide dataset in two parts based on filter in Spark-scala 0.00