StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Sim

Rating
1528.33 (20,138th)
Reputation
7,026 (22,842nd)
Page: 1 2 3 4
Title Δ
Jaccard Similarity between lines of text Apache Spark 0.00
Customize Apache Spark implementation of TF-IDF 0.00
Spark - Reading many small parquet files gets status of each file b... 0.00
Choosing between U-SQL and Spark / Databricks 0.00
Databricks Create a list of dataFrames with their size 0.00
When creating a table from a folder of csv files header information... 0.00
Databricks - Displaying a Dataframe and printing a string 0.00
Create sample value for failure records spark 0.00
Spark - Mixed case sensitivity in Spark DataFrame, Spark SQL, and/o... 0.00
Read a bytes column in spark 0.00
Spark Dataframe join Exception thrown in awaitResult 0.00
Spark: get number of cluster cores programmatically 0.00
How to aggregate data in Spark using Scala? +3.87
Remove duplicates within Spark array column +3.66
How to apply custom data formatting/map to each event before loadin... -4.13
Is UNCACHE table a lazy operation in Spark SQL? 0.00
How to reuse broadcast variable in Spark? +3.48
Apache Spark 2.1 - Scala Lengthy/Heavy attributes for Row Object 0.00
Spark SQL queries on partitioned data using Date Ranges +3.86
Reducing shuffle disk usage in Spark aggregations 0.00
Spark SQL read a JSON file which has already escaped double quote 0.00
Spark Dataset Loading multiple CSV files with headers inside a fold... 0.00
Conditional application of `filter`/`where` to a Spark `Dataset`/`D... +4.01
Dataset.groupByKey + untyped aggregation functions -0.10
How to repartition a dataframe in Spark scala on a skewed column? +3.78
Spark Timestamp - Millis and RFC3339 nano 0.00
broadcast() multiple times the same df. Is it cached? 0.00
spark: dataframe.count yields way more rows than printing line by l... +3.94
Does master node execute actual tasks in Spark? 0.00
What is an efficient way to partition by column but maintain a fixe... -0.17
Reading multiple files from S3 in Spark by date period 0.00
Spark Dataset select with typedcolumn 0.00
How to get separate RDD for each key entry +3.98
Validating against a variable number of columns in Spark 0.00
How to determine which apis to use for the code to be time efficien... 0.00
How to unnest data with SparkR? -3.97
How can I efficiently send data in a parallelized way to a REST end... 0.00
Spark standalone cluster behavior Query 0.00
Sort in descending order, using hive table in spark scala 0.00
spark: case sensitive partitionBy column 0.00
Spark DataFrame Zeppelin read folders 0.00
How to name aggregate columns? +0.11
How to use orderby() with descending order in Spark window functions? 0.00
Is it possible to do an update using SparkSQL? -0.46
Partition Location of RDD/Dataframe 0.00
Transform a column of json strings to structs 0.00
Spark SQL UDF returning scala immutable Map with df.WithColumn() 0.00
Can I use SELECT from dataframe instead of creating this temp table? 0.00
Spark: Read an inputStream instead of File 0.00
Overwrite specific partitions in spark dataframe write method 0.00