StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Glennie Helles Sindholt

Rating
1533.58 (15,614th)
Reputation
7,385 (21,590th)
Page: 1 2 3
Title Δ
Pyspark Multiple JOINS Column <> Row values: Reducing Actions 0.00
createDataFrame from List<Row> result throws NullPointerExcep... 0.00
How to split Comma-separated multiple columns into multiple rows? -0.23
Element-wise sum of array across rows of a dataset - Spark Scala 0.00
Can we replicate Spark's .cache() behavior but by saving parque... 0.00
How to outputs the first row with a matching key in Scala spark 0.00
reduce RDD having key as (String,String) 0.00
Gremlin query via HTTP is extremely slow 0.00
Understanding JavaPairRDD.reduceByKey function +0.45
Summing n columns in Spark in Java using dataframes -2.19
Unfair split of workload among spark executors -0.55
How can I read from S3 in pyspark running in local mode? +0.12
How to load huge no of small files in spark on EMR 0.00
How to tune the spark application in order to avoid OOM exception 0.00
how to improve performance by avoiding flatmap operation in apache... 0.00
Getting out of memory error while reading parquet file in spark sub... +0.45
AWS EMR- Yarn Container 0.00
GroupByKey faster than CombineByKey 0.00
Spark Multiple Output Paths result in Multiple Input Reads 0.00
How spark read a large file (petabye) when file can not be fit in s... 0.00
Joining pairs of key-value with pairs of key-map +0.44
filter dataframe based on a condition on two columns -0.29
Balanced RDD partition among workers - Spark 0.00
INSERT IF NOT EXISTS ELSE UPDATE in Spark SQL 0.00
Spark doesn't read columns with null values in first row -0.15
spark giving incorrect output for some value and correct output for... 0.00
Cassandra Spark : how to compare elements of two tables? 0.00
Spark memory limit exceeded issue 0.00
Query Amazon S3 Object Metadata via Spark 0.00
Cartesian product using spark 0.00
alternate way to proceed without list in scala -2.33
Break big spark sql query into smaller queries and merge it 0.00
Remove duplicate in an array[string] +1.08
spark df.write.partitionBy run very slow 0.00
write a spark Dataset to json with all keys in the schema, includin... 0.00
Process big data using hadoop parquet to CSV output +0.44
Issues running Spark application on ASW with compute optimized inst... 0.00
emr-5.4.0 (Spark executors memory allocation issue) +0.45
map in RDD with cluster +0.44
Fill Nan with mean of the row in Scala-Spark -0.19
Spark: subset a few columns and remove null rows -1.43
How to efficiently distribute and use partitions in spark? 0.00
Spark : Modify CSV file and write to other folder 0.00
applying a function of every element of an RDD -0.31
FileAlreadyExists pyspark +0.45
How to Execute sql queries in Apache Spark -0.07
How to Group By many keys in Spark RDD? 0.00
Partition a Spark Dataframe based on a specific column and dump the... 0.00
In Spark how can I create a tuple from row as (Col1 , Col2,Col3 ,(C... +0.45
How to load only the data of the last partition 0.00