StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

thebluephantom

Rating
1474.42 (4,512,370th)
Reputation
3,105 (54,452nd)
Page: 1 2 3 4 ... 14
Title Δ
Alternate or better approach to aggregateByKey in pyspark RDD +0.53
Pyspark Array Column - Replace Empty Elements with Default Value +0.31
Hand selecting parquet partitions vs filtering them in pyspark 0.00
Defining Schemas with Struct and Array Types 0.00
Losing entries when inner-joining data to a left-joined DataFrame i... 0.00
Spark group by Key and partitioning the data 0.00
Function to find overlapping data in Spark DataFrame -0.07
Count a column based on distinct value of another column pyspark 0.00
JSON schema inference in Structured Streaming with Kafka as source 0.00
Pyspark cannot load from pathlib object 0.00
Is there a way to add a column with range of values to a Spark Data... +0.55
Huge time gap between spark jobs 0.00
Is it possible to change SparkContext.sparkUser() AFTER the SparkCo... 0.00
Pyspark SQL: Transform table with array of struct to columns 0.00
How to distribute specific data to each cluster node in spark? +0.02
Understanding scala binary compatibility on my example 0.00
Databricks Spark conditional pull from Azure SQL 0.00
How to configure where spark spills to disk? 0.00
Between statement is not working on Hive Map column - Spark SQL 0.00
Spark avoid execution of the entire query each time 0.00
Get Yarn application id before SparkSession is instantiated 0.00
Kryo encoder v.s. RowEncoder in Spark Dataset -0.47
spark jdbc - multiple connections to source? 0.00
Spark aggregation / group by so as to determine a new column's... -1.69
Processing .txt file using wholeTextFiles & wanting to extract... 0.00
Monitor the executors of Spark Application 0.00
How to clean up the checkpoint files accumulated in spark structure... 0.00
Errors querying Hive table from PySpark 0.00
Combining Rows that link together in a Spark Dataframe 0.00
Reading in multiline text files 0.00
Using regexp to join two dataframes in spark +0.32
Add filtered RDD to another RDD 0.00
How to use salting technique for joining data frames having skewed... 0.00
Unable to write PySpark Dataframe created from two zipped dataframes +0.25
Spark SQL - Check for a value in multiple columns -0.88
Scala 2.12.10 with Spark 3.0.0 : What does "data.map(Tuple1.ap... 0.00
Window Overload method cannot resolve in spark structured streaming... 0.00
Late data handling in Spark past watermark 0.00
Does spark structured streaming job fails if dependency jars are up... 0.00
Spark SQL Merge query 0.00
Can two executors / drivers from different Spark applications run o... 0.00
How Spark ensures data consistency if a node/partition fails? 0.00
Spark SQL : subtract respective rows of one dataframe from another -1.24
Spark SQL : subtract respective rows of one dataframe from another +1.01
Transform tuple to matrix in Spark 0.00
Joining a stream and a static dataframe in pyspark with Complete Mode 0.00
Training Random Forest in XGBoost 4J Spark 0.00
Pyspark Convert PipelinedRDD to Spark DataFrame -0.48
Spark row_number partitionBy without order by to keep natural order +2.19
How to stream only new data (newly appended) from old file in spark... 0.00