StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

thebluephantom

Rating
1474.42 (4,512,370th)
Reputation
3,105 (54,452nd)
Page: 1 ... 8 9 10 11 12 13 14
Title Δ
Select specific rows from Spark dataframe per grouping 0.00
sortBy in Spark / Scala 0.00
How to fix empty output for the textfilestream code 0.00
is it more efficient to cache a dataframe in on partition or more p... 0.00
How to convert teradata recursive query to spark sql 0.00
mapR Kafka cannot start second time round 0.00
QueueStream for Structured Streaming possible? 0.00
spark structured streaming exception : Append output mode not suppo... 0.00
What is the difference between dynamic.partition=True and dynamic.p... 0.00
How to perform group by and aggregate operation on spark sql 0.00
Spark repartition is not working as expected +0.21
Spark Window functions: is it possible to get other values directly... 0.00
Parallel API requests using Spark and scala 0.00
How to implement Slowly Changing Dimensions (SCD2) Type 2 in Spark 0.00
Case Class within foreachRDD causes Serialization Error 0.00
Spark QueueStream never exhausted 0.00
How to use foreachRDD in legacy Spark Streaming +0.54
Is GroupByKey function in Spark that bad? 0.00
Spark - How to join current and previous records in a DataFrame and... +0.31
Using AWS EMRFS in apache spark hosted on ec2 +1.95
Breaking SQL query up to improve Spark efficiency 0.00
Need spark vs tez vs MR comparsion 0.00
Difference in running a spark application with sbt run or with spar... +0.54
Can Apache Spark worker nodes be different machines than HDFS data... 0.00
Spark copying dataframe columns best practice in Python/PySpark? +0.04
Number of dataframe partitions after sorting? -0.44
HDFS File replace while other applications are accessing data 0.00
In Scala, how would I take a Spark RDD, and output to different fil... +0.54
pyspark - Join two RDDs - Missing third column 0.00
PySpark - Filter RDD based on another RDD - broadcast an RDD 0.00
How to use correctly mapPartitions function 0.00
Amazon EMR vs EC2 for Off loading BI & Analytics anno 2018 0.00
Spark java.lang.NullPointerException Error when filter spark data f... 0.00
Creating a new dataframe with many rows for each row in existing da... 0.00
Does size of part files play a role for Spark SQL performance +1.98
How to remove words that have less than three letters in PySpark? -0.45
How to extract values from key value map, spark dataframe 0.00
Spark SQL query Group By value followed by list 0.00
Exactly Once Semantics KAFKA Possible Claim 0.00
Spark RDD Windowing using pyspark 0.00
spark get minimum value in column that satisfies a condition -1.23
spark get minimum value in column that satisfies a condition +1.27
How to count the number of missing values in each row of a data fra... -2.29
Ambiguous Spark DataFrame schema - non JOINed scenario 0.00
Writing data in hive warehouse directory in two separate tables usi... 0.00
Run multiple spark queries in parallel in a multi-user environment... 0.00
Spark - Reading partitioned data from S3 - how does partitioning ha... 0.00
SparkSQL subquery and performance 0.00
unable to create dataframe from sequence file in Spark created by S... +0.05
How to filter Dataframe Rows not containing any of a list of Substr... -0.21